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PREFACE 


Over the past fifty years, graph theory has been one of the most rapidly growing 
areas of mathematics. Since 1960, more than 10,000 different authors have published 
papers classified as graph theory by Math Reviews, and for the past decade, more than 
1000 graph theory papers have been published each year. Not surprisingly, this Second 
Edition is about 450 pages longer than the First Edition, which appeared in 2004. 


This Handbook is intended to provide as comprehensive a view of graph theory 
as is feasible in a single volume. Many of our chapters survey areas that have large 
research communities, with hundreds of active mathematicians, and which could be 
developed into independent handbooks. The 89 contributors to this volume, 31 of 
whom are new to this edition, collectively represent perhaps as much as 90% or more 
of the main topics in pure and applied graph theory. Thirteen of the sections in the 
Second Edition cover newer topics that did not appear in the First Edition. 


Format 


In order to achieve this kind of comprehensiveness, we challenged our contributors 
to restrict their expository prose to a bare minimum, by adhering to the ready-reference 
style of the CRC Handbook series, which emphasizes quick accessibility for the non- 
expert. We thank the contributors for responding so well to this challenge. 


The 13 chapters of the Handbook are organized into 65 sections. Within each 
section, several major topics are presented. For each topic, there are lists of the essential 
definitions and facts, accompanied by examples, tables, remarks, and in some cases, 
conjectures and open problems. Each section ends with a bibliography of references 
tied directly to that section. In many cases, these bibliographies are several pages long, 
providing extensive guides to the research literature and pointers to monographs. 


To ensure that each section be reasonably self-contained, we encouraged contrib- 
utors to include some definitions that may have appeared in earlier sections. Each 
contributor was also asked to include a glossary with his or her section. These section 
glossaries were then merged by the editors into 13 chapter glossaries. 


Terminology and Notation 


Graph theory has attracted mathematicians and scientists from diverse disciplines 
and, accordingly, is blessed (and cursed) with a proliferation of terminology and no- 
tations. Since the Handbook objective is to survey topics for persons whose expertise 
may be elsewhere, either on other topics, or outside of graph theory, we asked our 
contributors to tilt toward the general usage in the mathematical community, rather 
than staying strictly within the idioms of their specialties. But to understand graph 
theory literature, it helps to accept the legacy of history. As editors, we tried to strike 
a balance between preserving the notation and terminology that evolved from each 
area’s rich history and our desire create a cohesive, uniform body of material. 


xili 


Some uniformity of usage came easily. In general, the word graph is used inclusively 
to refer to graphs with directed edges and/or to graphs with multi-edges and self-loops. 
In most sections, G denotes a graph and V and E denote its vertex- and edge-sets, 
respectively. 


However, some words are used differently by different graph theory communities. 
For instance, to an algebraic graph theorist, a Cayley graph is simple, connected, and 
undirected, but to a topological graph theorist, it may be non-connected, possibly 
directed, and have multi-edges and/or self-loops. To some graph theorists, a clique is 
a complete subgraph, maximal under set inclusion, and to others maximality is not 
required. 


Consistency in notation was also problematic. In the literature of graph coloring, 
the Greek letter y is used as the chromatic number, but to an algebraic topologist, it 
means the Euler characteristic. 


Notes regarding terminology and notation were added to make these variations 
explicit, thereby improving cross-chapter compatibility. 
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INTRODUCTION 


Configurations of nodes and connections occur in a great diversity of applications. 
They may represent physical networks, such as electrical circuits, roadways, or organic 
molecules. They are also used in representing less tangible interactions as might occur in 
ecosystems, sociological relationships, databases, or in the flow of control in a computer 


program. 


1.1.1. Graphs and Digraphs 


Any mathematical object involving points and connections between them may be 
called a graph. If all the connections are unidirectional, it is called a digraph. Our highly 
inclusive definition in this initial section of the Handbook permits fluent discussion of 
almost any particular modification of the basic model that has ever been called a graph. 


Basic Terminology 


DEFINITIONS 


D1: A graph G = (V,£) consists of two sets V and E. 


e The elements of V are called vertices (or nodes). 


e The elements of F are called edges. 


e Each edge has a set of one or two vertices associated to it, which are called its 
endpoints. An edge is said to join its endpoints. 
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NOTATION: The subscripted notations V¢ and Eg (or V(G) and E(G)) are used for the 
vertex- and edge-sets when G is not the only graph under consideration. 


D2: If vertex v is an endpoint of edge e, then v is said to be tnczdent on e, and e is 
incident on v. 


D3: A vertex u is adjacent to vertex v if they are joined by an edge. 

D4: Two adjacent vertices may be called netghbors. 

D5: Adjacent edges are two edges that have an endpoint in common. 

D6: A proper edge is an edge that joins two distinct vertices. 

D7: A multi-edge is a collection of two or more edges having identical endpoints. 


D8: A simple adjacency between vertices occurs when there is exactly one edge 
between them. 


D9: The edge-multiplicity between a pair of vertices u and v is the number of edges 
between them. 


D10: A self-loop is an edge that joins a single endpoint to itself. 


TERMINOLOGY 


e An alternative term for self-loop is loop. This can be used in contexts in which 
loop has no other meanings. 


e In computer science, the term graph is commonly used either to mean a graph as 
defined here, or to mean a computer-represented data structure whose value is a 
graph. 


EXAMPLE 


El: A line drawing of a graph G = (V, E) is shown in Figure 1.1.1. It has vertex-set 
V = {u,v,w,x} and edge-set E = {a,b,c,d,e, f}. The set {a,b} is a multi-edge with 
endpoints u and v, and edge c is a self-loop. 


Figure 1.1.1: A graph. 


REMARKS 


R1: A graph is realized in a plane or in 3-space as a set of points, representing the 
vertices, and a set of curved or straight line segments, representing the edges. The 
curvature or length of such a line segment is irrelevant to the meaning. However, if a 
direction is indicated, that is significant. 


R2: Occasionally, a graph is parametrized so that each edge is regarded as the home- 
omorphic image of the real interval [0,1] (except that for a self-loop, the endpoints 0 
and 1 have the same image). 
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Simple Graphs 


Most of theoretical graph theory is concerned with simple graphs. This is partly because 
many problems regarding general graphs can be reduced to problems about simple 
graphs. 


DEFINITIONS 
D11: A simple graph is a graph that has no self-loops or multi-edges. 
D12: A trivial graph is a graph consisting of one vertex and no edges. 


D13: A null graph is a graph whose vertex- and edge-sets are empty. 


Edge Notation for Simple Adjacencies and for Multi-Edges 


NOTATION: An edge joining vertices u and v of a graph may be denoted by the juxta- 
position uv if it is the only such edge. Occasionally, the ordered pair (u,v) is used in 
this situation, instead of uv. To avoid ambiguities when multi-edges exist, or whenever 
else desired, the edges of a general graph may be given their own names, as in Figure 
1.1.1 above. 

EXAMPLE 

E2: The simple graph shown in Figure 1.1.2 has edge-set F = {uv, vw, vz, wa}. 


Ww 


Figure 1.1.2: A simple graph. 


General Graphs 


Many applications require non-simple graphs as models. Moreover, some non-simple 
graphs serve an essential role in theoretical constructions, especially in constructing 
graph drawings (simple and non-simple) on surfaces (see Chapter 7). 


TERMINOLOGY NOTE: Although the term “graph” means that self-loops and multi-edges 
are allowed, sometimes, for emphasis, the term general graph is used. 


DEFINITIONS 


D14: A loopless graph is a graph that has no self-loops. (It might have multi-edges. ) 
Sometimes a loopless graph is referred to as a multigraph. 


D15: The dipole D,, is a loopless graph with two vertices and n edges joining them. 


D16: The bouquet B,, is a graph with one vertex and n self-loops. 
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EXAMPLES 


E3: The loopless graph in Figure 1.1.3 depicts the benzene molecule CgH¢. 


‘ / 
ae es 
‘ Ss < 
/ \ 


Figure 1.1.3: Graph model for a benzene ring. 


E4: The dipole D3 is shown in Figure 1.1.4. 


cars 5 
Figure 1.1.4: The loopless graph D3. 


E5: Two graphs with self-loops are shown in Figure 1.1.5. 


Figure 1.1.5: The dumbbell graph and the bouquet By. 


Attributes 


Allowing graphs to have additional attributes beyond vertices and edges enables them 
to serve as mathematical models for a wide variety of applications. Two of the most 
common additional edge attributes, both described in great detail later in the Handbook, 
are edge direction (e.g., Chapters 3 and 11) and edge weight (e.g., Chapters 4 and 11). 
Another common attribute (for edges or vertices) is color. Graph coloring is discussed 
in Chapter 5. 


DEFINITIONS 


D17: A vertex attribute is a function from the vertex-set to some set of possible 
attribute values. 


D18: An edge attribute is a function from the edge-set to some set of possible 
attribute values. 
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Digraphs 


An edge between two vertices creates a connection in two opposite senses at once. As- 
signing a direction makes one of these senses forward and the other backward. Viewing 
direction as an edge attribute is partly motivated by its impact on computer imple- 
mentations of graph algorithms. Moreover, from a mathematical perspective, regarding 
directed graphs as augmented graphs makes it easier to view certain results that tend 
to be established separately for graphs and for digraphs as a single result that applies 
to both. The attribute of edge direction is developed extensively in Chapter 3 and 
elsewhere in this Handbook. 


DEFINITIONS 


D19: A directed edge (or arc) is an edge e, one of whose endpoints is designated 
as the tazl, and whose other endpoint is designated as the head. They are denoted 
head(e) and tail(e), respectively. 


TERMINOLOGY: A directed edge is said to be directed from its tail and directed to 
its head. (The tail and the head of a directed self-loop are the same vertex.) 


NOTATION: In a line drawing, the arrow points toward the head. 

D20: A multi-arc is a set of two or more arcs having the same tail and same head. 
D21: A digraph (or directed graph) is a graph each of whose edges is directed. 
D22: A simple digraph is a digraph with no self-loops and no multi-arcs. 


D23: A mixed graph (or partially directed graph) is a graph that has both undi- 
rected and directed edges. In a mixed graph, using the unmodified term edge avoids 
specifying whether the edge is directed or undirected. 


D24: The underlying graph of a directed or partially directed graph G is the graph 
that results from removing all the designations of head and tail from the directed edges 
of G (ie., deleting all the edge-directions). 


Ordered-Pair Representation of Arcs 


NOTATION: In a simple digraph, an arc from vertex u to vertex v is commonly denoted 
(u,v) (or sometimes uv). When multi-arcs are possible, using distinct names is often 
necessary. 


COMPUTATIONAL NOTE: (A caution to software designers) From the perspective of 
object-oriented software design, the ordered-pair representation of arcs in a digraph 
treats digraphs as a different class of objects from graphs. This could seriously under- 
mine software reuse. Large portions of computer code might have to be rewritten in 
order to adapt an algorithm that was originally designed for a digraph to work on an 
undirected graph. 


The ordered-pair representation could also prove awkward in implementing algorithms 
for which the graphs or digraphs are dynamic structures (i.e., they change during the 
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algorithm). Whenever the direction on a particular edge must be reversed, the associ- 
ated ordered pair has to be deleted and replaced by its reverse. Even worse, if a directed 
edge is to become undirected, then an ordered pair must be replaced with an unordered 
pair. Similarly, the undirected and directed edges of a partially directed graph would 
require two different types of objects. 


EXAMPLES 


E6: The digraph on the left in Figure 1.1.6 has the undirected graph on the right as 
its underlying graph. The digraph has two multi-arcs: {a,b} and {f, h}. 


Figure 1.1.6: A digraph and its underlying graph. 


E7: A simple digraph can have one arc in each direction between two vertices, as 
illustrated in Figure 1.1.7. 


Figure 1.1.7: A simple digraph whose underlying graph is not simple. 


Vertex-Coloring 


When the vertex-set of a graph is partitioned, the cells of the partition are commonly 
assigned distinct colors. This is developed at length in Chapter 5. 


DEFINITIONS 


D25: A vertex-coloring of a graph G is a function from its vertex-set Vg vertices to 
a set C whose elements are called colors. 


D26: A vertex-coloring is proper if two adjacent vertices are always assigned different 
colors. 


D27: A graph is c-colorable if it has a proper vertex-coloring with c or fewer colors. 


D28: The (vertex) chromatic number of a graph G, denoted x(G), is the smallest 
number c of colors such that G is c-colorable. 
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REMARK 


R3: Definitions of edge-coloring, c-edge-colorable, and edge-chromatic number, 
denoted y’(G), are obtained by simply replacing the word “vertices” with the word 
“edges” in the definitions above. 


EXAMPLE 


E8: The graph G in Figure 1.1.8 is shown with a 3-coloring of its vertex-set. Since 
it is not 2-colorable, its chromatic number is 3. Also, the graph is easily seen to be 
3-edge-colorable and clearly is not 2-edge-colorable; hence, .’(G) = 3. 


3 


Figure 1.1.8: A graph G with y(G) = x/(G) =3. 


1.1.2 Degree and Distance 


Two of the most fundamental notions in graph theory are those of the degree of a 
vertex and the distance between two vertices. Distance is developed fully in Chapter 9. 


Degree 


DEFINITIONS 


D29: The degree (or valence) of a vertex v in a graph G, denoted deg(v), is the 
number of proper edges incident on v plus twice the number of self-loops. (For simple 
graphs, of course, the degree is simply the number of neighbors.) 


TERMINOLOGY: Applications of graph theory to physical chemistry motivate the use of 
the term valence as an alternative to degree. Thus, a vertex of degree d is also called 
a d-valent vertex. 


D30: The degree sequence of a graph is the sequence formed by arranging the vertex 
degrees into non-decreasing order. 


D31: The indegree of a vertex v in a digraph is the number of arcs directed to v; the 
outdegree of vertex v is the number of arcs directed from v. Each self-loop at v counts 
one toward the indegree of v and one toward the outdegree. 


D32: An isolated vertex in a graph is a vertex of degree 0. 


EXAMPLES 


E9: The graph in Figure 1.1.9 has degree sequence < 0,1,1,4,6,6 >. Vertices u and v 
both have degree 6. 
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Figure 1.1.9: A graph with degree sequence < 0,1,1,4,6,6 >. 


E10: Figure 1.1.10 below shows the indegrees and outdegrees of a digraph. 


vertex 


indegree 


outdegree 


Figure 1.1.10: The indegrees and outdegrees of the vertices of a digraph. 


FACTS 


For proofs of the following elementary facts, see [GrYe06, $1.1] or other basic texts. 


F1: (Euler) The sum of the degrees of the vertices of a graph is twice the number of 
edges. 


F2: In every graph, the number of vertices having odd degree is an even number. 


F3: A non-trivial simple graph G must have at least one pair of vertices whose degrees 
are equal. 


F4: In a digraph, the sum of the indegrees and the sum of the outdegrees both equal 
the number of edges. 


F5: The degree sequence of a graph is a finite, non-decreasing sequence of non-negative 
integers whose sum is even. 


F6: Conversely, any non-decreasing, non-negative sequence of integers whose sum is 
even is the degree sequence of some graph, but not necessarily of a simple graph. 
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Walks, Trails, and Paths 


DEFINITIONS 
D33: A walk in a graph G is an alternating sequence of vertices and edges, 
W= U0; €1, U1, €1,+++5€n;Un 


such that for 7 =1,...,n, the vertices v;_1 and v; are the endpoints of the edge e;. If, 
moreover, the edge e; is directed from v;_; to vj, then W is a directed walk. 


e In a simple graph, a walk may be represented simply by listing a sequence of 
vertices: W = v9, V1,.-.,Un such that for 7 = 1,...,n, the vertices vj_1 and v; 
are adjacent. 


The initial vertex is vo. 


The final vertex (or terminal vertex) is Un. 


e An internal vertex is a vertex that is neither initial nor final. 
D34: The length of a walk is the number of edges (counting repetitions). 


D35: A walk is closed if the initial vertex is also the final vertex; otherwise, it is 
open. 


D36: A trail in a graph is a walk such that no edge occurs more than once. 


D37: An eulerian trail in a graph G is a walk that contains each edge of G exactly 
once. (See §4.2.) 


D38: A path in a graph is a trail such that no internal vertex is repeated. 
D39: A cycle is a closed path of length at least 1. 


D40: A trivial walk, trail, or path consists of a single vertex and no edges. 


EXAMPLE 


E11: In the graph shown in Figure 1.1.11, the vertex sequence (u,v, 2z,v, z) represents 
a walk that is not a trail, and the vertex sequence (u,v, x,y, U, Z) represents a trail that 
is not a path. 


x y 


Figure 1.1.11: A graph. 
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Distance and Connectivity 


DEFINITIONS 


D41: The distance between two vertices in a graph is the length of the shortest 
walk between them. 


D42: The directed distance from a vertex u to a vertex v in a digraph is the length 
of the shortest directed walk from wu to v. 


D43: A graph is connected if between every pair of vertices there is a walk. 
D44: A digraph is (weakly) connected if its underlying graph is connected. 


D45: A digraph is strongly connected if from each vertex to each other vertex there 
is a directed walk. 


D46: The eccentricity of a vertex v in a connected graph is its distance to a vertex 
farthest from v. 


D47: The radius of a connected graph is its minimum eccentricity. 


D48: The diameter of a connected graph is its maximum eccentricity. 


EXAMPLE 


E12: The digraph shown on the left in Figure 1.1.12 is strongly connected; the digraph 
on the right is connected but not strongly connected. 


Figure 1.1.12: A strongly connected digraph and a weakly connected one. 


1.1.3. Basic Structural Concepts 


We are concerned with the possible equivalence of two graphs, with the symmetries 
of an individual graph, and with the possible appearance of one graph within another 
graph. 


Isomorphism 


In concept, two graphs are tsomorphic if they are structurally identical, which means 
that they correspond in all structural details. A formal vertex-to-vertex and edge-to- 
edge correspondence is called an isomorphism. 
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DEFINITIONS 


D49: An isomorphism between two simple graphs G and H is a vertex bijection 
@: Ve — Vy such that for u,v € Veg, the vertex u is adjacent to the vertex v in graph 
G if and only if d(u) is adjacent to ¢(v) in graph H. Implicitly, there is also an edge 
bijection Eg > Ey such that uv + ¢(u)d(v). 


D50: An isomorphism between two general graphs G and H is a pair of bijections 
oy : Vg > Vy and ¢g : Eg > Ey such that for every pair of vertices u,v € Va, the set 
of edges in Eg joining u and v is mapped bijectively to the set of edges in Fy joining 
the vertices ¢(u) and $(v). 


D51: We say that G and H are tsomorphic graphs and we write G = H if there is 
an isomorphism G > H. 


D52: An adjacency matriz for a simple graph G whose vertices are explicitly ordered 
U1, V2,-+-+,Un is the n x n matrix Ag such that 


fy “a aang ane adacent (1.1.1) 
0 otherwise 


Agi, J) = 


D53: A property associated with all graphs is an isomorphism invariant if it has 
the same value (or is the same) for any two isomorphic graphs. 


EXAMPLES 
E13: The two graphs in Figure 1.1.13 are isomorphic under the mapping 


Uy? Uy UQE? UL UZ UA U4 VB 


Vo 


Vy V3 


Figure 1.1.13: Two isomorphic graphs. 


If one flips vertex u4 of graph G downward to the bottom and rotates the figure a 
quarter-turn counterclockwise, then the resulting image of graph G “looks just like” 
graph H. Their adjacency matrices are: 


Uy U2 UZ U4 Ui U2 U3 V4 
Ut 0 1 1 0 V1 0 1 0 1 

a U2 1 0 I 1 _ v2 1 0 1 1 
AG He te ot GSS cia vy} 0 1 0 1 
UWA 0 1 1 0 U4 1 1 1 0O 


We observe that transposing rows u3 and u,4 and also transposing columns u3 and ua 
transforms the matrix Ag into matrix Ay. 
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E14: The two graphs in Figure 1.1.14 are isomorphic, even if the drawings look quite 
different. The vertex-labels indicate an isomorphism. 


ESN 
<A> 


Figure 1.1.14: Two isomorphic graphs that look quite different. 


E15: Figure 1.1.15 shows two non-isomorphic graphs with identical degree sequences. 
(It is easy to show that connectedness is an isomorphism invariant.) 


| G = H 
Figure 1.1.15: Two graphs whose degree sequences are both (2, 2,2, 3,3, 3,3). 


FACTS 


F7: Considering all possible bijections of the vertex-sets of two n-vertex graphs requires 
O(n!) steps. 


F8: Although some fast heuristics are known (see §2.2), there is no known polynomial- 
time algorithm for testing graph isomorphism. 


F9: The number of vertices, the number of edges, and the degree sequence are all 
isomorphism invariants. On the other hand, having the same values for all three of these 
invariants does not imply that two graphs are isomorphic, as illustrated by Example 15. 


F10: Each row sum (and column sum) in an adjacency matrix equals the degree of the 
corresponding vertex. 


Automorphisms 


The notion of symmetry in a graph is formalized in terms of isomorphisms of the graph 
to itself. 


DEFINITIONS 


D54: A graph automorphism is an isomorphism of the graph to itself. 
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D55: The orbit of a vertex u of a graph G is the set of all vertices v € Ve such that 
there is an automorphism ¢ such that ¢(u) = v. 


D56: The orbit of an edge d of a graph G is the set of all edges e € Eg such that 
there is an automorphism ¢ such that ¢(d) = e. 


D57: A graph is vertex-transitive if all the vertices are in the same orbit. 


D58: A graph is edge-transitive if all the edges are in the same orbit. 


FACTS 
F11: The vertex orbits partition the vertex-set of a graph. 


F12: The edge orbits partition the edge-set of a graph. 


EXAMPLE 


E16: For the graph on the left in Figure 1.1.16, the vertex orbits are {u1,u4} and 
{uz,u3}, and the edge orbits are {u ue, u1u3, Ugu4, ugua} and {ugu3}. The graph on 
the right is vertex-transitive and edge-transitive. 


Figure 1.1.16: The graph K,4— e and the Petersen graph. 


Subgraphs 


DEFINITIONS 


D59: A subgraph of a graph G is a graph H such that Vy C Vo and Ey C Eg. 
(Usually, any graph isomorphic to a subgraph of G is also said to be a subgraph of G.) 


D60: In a graph G, the induced subgraph on a set of vertices W = {w1,..., we}, 
denoted G(W), has W as its vertex-set, and it contains every edge of G whose endpoints 
are in W. That is, 


V(G(W)) = W and E(G(W)) = {e € E(G) | the endpoints of edge e are in W} 


D61: A subgraph H of a graph G is a spanning subgraph if V(H) = V(G). (Also, 
if H is isomorphic to a spanning subgraph of G, we may say that H spans G.) 


D62: A component of a graph G is a connected subgraph H such that no subgraph 
of G that properly contains H is connected. In other words, a component is a maximal 
connected subgraph. 
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G H, He 


Figure 1.1.17: A spanning subgraph H, and an induced subgraph H9. 


EXAMPLE 


E17: For the graph G in Figure 1.1.17, Hy is a spanning subgraph but not an induced 
subgraph, and Hy is an induced subgraph but not a spanning subgraph. 


FACTS 


F13: Let ¢: G— H bea graph isomorphism, and let J be a subgraph of G. Then 
the restriction of ¢ to the subgraph J is an isomorphism onto its image ¢(J). 


F14: Ifa graph J is a subgraph of a graph G but not a subgraph of a graph H, then 
G # H. This is a corollary of Fact F13. 


Graph Operations 


The operations of adding and deleting vertices and edges of a graph are regarded as 
primary operations, because they are the foundation for other operations, which may 
be called secondary operations. 


DEFINITIONS 


D63: The operation of adding the vertex u to a graph G = (V, E), such that u ¢ V, 
yields a new graph with vertex-set V U {u} and edge-set E, which is denoted GU {u}. 
(The new vertex u has no neighbors.) 


D64: The operation of deleting the vertex u from a graph G = (V,F) not only 
removes the vertex u but also removes every edge of which wu is an endpoint. The 
resulting graph is denoted G — u. 


D65: The operation of adding an edge d (or uv) to a graph G = (V, £) joining 
the vertices u and v yields a new graph with vertex-set V and edge-set E U {d} (or 
EU {uv}), which is denoted GU {d} (or GU {uv}). 


D66: The operation of deleting an edge d (or uv) from a graph G = (V, E) removes 
only that edge. The resulting graph is denoted G — d (or G— uv). 


D67: A cut-vertex (or cutpoint) is a vertex whose removal increases the number of 
components. 


D68: A cut-edge is an edge whose removal increases the number of components. 


D69: The edge-complement of a simple graph G is the graph G (alternatively 
denoted G°) that has the same vertex-set as G’, such that uv is an edge of G if and only 
if it is not an edge of G. 
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D70: The join (or suspension) of two graphs G and H is denoted by G+ H. It has 
the following vertex-set and edge-set: 


V(G+ H) =V(G)UV(A) 
E(G+ H) = E(G)U E(A)U {uv | u € V(G) and v € V(#A)} 

D71: The cartesian product (or product) of two graphs G and H is denoted by 
G x H. Its vertex-set and edge-set are as follows: 

V(G x H) = V(G) x V(A) 

E(G x H) = E(G) x V(H)UV(G) x E(#) 

The endpoints of the edge (d,v) € E(G) x V(H) are the vertices (x,v) and (y,v), 
where x and y are the endpoints of edge d € E(G). The endpoints of the edge (u,e) € 


V(G) x E(H) are the vertices (u,s) and (u,t), where s and ¢ are the endpoints of edge 
e€ E(A). 

D72: The graph union of two graphs G and H is the graph GU H whose vertex-set 
and edge-set are the disjoint unions, respectively, of the vertex-sets and the edge-sets of 


G and H. 


D73: The m-fold self-union mG is the iterated disjoint union GU---UG of m copies 
of the graph G. 


EXAMPLES 


E18: Figure 1.1.18 illustrates the operation of edge-complementation. 


=A 


Figure 1.1.18: Edge-complementation. 


G 


E19: Figure 1.1.19 illustrates the join operation. 


I. Be 


G H 


Figure 1.1.19: Join operation. 


E20: In Figure 1.1.18, the vertex in the upper left corner of the drawing of the graph 
G is a cut-vertex, and the edge from that vertex to the center vertex is a cut-edge. 
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E21: Figure 1.1.20 illustrates the product operation. 


"ag 


G H GxH 
Figure 1.1.20: Cartesian product. 


1.1.4 Trees 


Trees are important to the structural understanding of graphs and to the algorith- 
mics of information processing, and they play a central role in the design and analy- 
sis of connected networks. A standard characterization theorem for trees appears in 
Chapter 2. 


Acyclic Graphs 


DEFINITIONS 
D74: A tree is a connected graph with no cycles (i.e., acyclic). 
D75: A forest is a (not necessarily connected) graph with no cycles. 


D76: A central vertex in a graph is a vertex whose eccentricity equals the radius of 
the graph. 


D77: The center of a graph is the subgraph induced on its set of central vertices. 
TERMINOLOGY NOTE: Classically (see Chapter 3), the words center and bicenter were 
used to mean the set of central vertices of a tree, when there was only one vertex or two 
vertices, respectively. (See Fact F15 below.) 


EXAMPLE 


E22: The graph on the left in Figure 1.1.21 is a tree; the other two graphs are not. 


tree non-tree non-tree 


Figure 1.1.21: A tree and two non-trees. 
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FACT 


F15: The center of a tree is isomorphic to K; or to Ky. (See §1.3 for information 
about the historical context of this fact.) 


Trees as Subgraphs 


Several different problem-solving algorithms involve growing a tree within a graph, one 
edge and one vertex at a time. All these techniques are refinements and extensions of 
the same basic tree-growing scheme given in this section. 


DEFINITIONS 


TERMINOLOGY: For a given tree J in a graph G, the edges and vertices of T are called 
tree edges and tree vertices, and the edges and vertices of G that are not in T are 
called non-tree edges and non-tree vertices. 


D78: A frontier edge for a given tree T in a graph is a non-tree edge with one 
endpoint in T and one endpoint not in T’. 


D79: A spanning tree of a graph G is a spanning subgraph of G that is a tree. 


EXAMPLE 


E23: For the graph in Figure 1.1.22, the tree edges of a tree T are drawn in bold. The 
tree vertices are black, and the non-tree vertices are white. The frontier edges for T, 
appearing as dashed lines, are edges a, b, c, and d. The plain edges are the non-tree 
edges that are not frontier edges for T. 


Figure 1.1.22: A tree with frontier edges a, b, c, and d. 


Observe that when any one of the frontier edges in Figure 1.1.22 is added to the tree T, 
the resulting subgraph is still a tree. This property holds in general, and applying it 
iteratively forms the core of the tree-growing scheme of this section. 


FACT 


F16: Let T be a tree in a graph G, and let e be a frontier edge for T. Then the subgraph 
of G formed by adding edge e to tree T is a tree. (Formally, adding frontier edge e to 
a tree involves adding a new vertex to current tree T, i.e., its non-tree endpoint.) 
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Basic Tree-Growing Algorithm 


The basic tree-growing scheme uses vertex labels to keep track of the order in which 
vertices are added to the tree. 

TERMINOLOGY NOTE: A standard (0-based) vertex-labeling of an n-vertex graph is 
a one-to-one assignment of the integers 0,1,...,2— 1 to the vertices of that graph. 


Algorithm 1.1.1: Basic Tree-Growing with Vertex Labels 


Input: a graph G and a starting vertex v € Va. 
Output: a spanning tree T of Cg(v) and a standard vertex-labeling of C¢(v). 


Initialize tree T as vertex v. 
Write label 0 on vertex v. 
Initialize label counter i := 1 
While tree T does not yet span component Cg(v) 
Choose a frontier edge e for tree T. 
Let w be the endpoint of edge e that lies outside of T. 
Add edge e and vertex w to tree T’. 
Write label 2 on vertex w. 
t:=ti+1 
Return tree T and vertex-labeling of Ce(v). 


REMARK 
R4: Uniqueness of the Output Tree from Tree-Growing 


Without a rule for choosing a frontier edge (including a way to break ties), the output 
tree from Algorithm 1.1.1 would not be unique (in which case, many computer scientists 
would hesitate to use the term algorithm). The uniqueness of the output depends on 
some default priority based on the ordering of the edges (and vertices) in the data 
structure chosen to implement the algorithm. The default priority is used whenever no 
other rule is given and as a way of breaking ties left from other rules. 


FACTS 


F17: If an execution of the basic tree-growing algorithm starts at vertex v of a graph 
G, then the subgraph consisting of the labeled vertices and tree edges is a spanning tree 
of the component Ce(v). 


F18: A graph is connected if and only if the basic tree-growing algorithm labels all its 
vertices. 


Prioritizing the Edge Selection 


The edge-prioritized tree-growing algorithm, Algorithm 1.1.2, is a refinement of basic 
tree-growing. 
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Algorithm 1.1.2: Edge-Prioritized Tree-Growing 


Input: a connected graph G, a starting vertex v € Ve, 
and a rule for prioritizing frontier edges. 
Output: a spanning tree T and a standard vertex-labeling of Vg. 


Initialize tree T as vertex v. 
Initialize the set of frontier edges for tree T as empty. 
Write label 0 on vertex v. 
Initialize label counter i := 1 
While tree T does not yet span G 
Update the set of frontier edges for T. 
Let e be the frontier edge for T’ of highest priority. 
Let w be the unlabeled endpoint of edge e. 
Add edge e (and vertex w) to tree T. 
Write label 7 on vertex w. 
t:=t4+1 
Return tree T with its vertex-labeling. 


FACT 


F19: Different rules for prioritizing the frontier edges give rise to different span- 
ning trees: the depth-first search tree (last-in-first-out priority), the breadth-first search 
tree (first-in-first-out priority), the Prim tree (least-cost priority), and the Dijkstra tree 
(closest-to-root priority). See §10.1 
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INTRODUCTION 


Whenever a property of graphs is defined, a family of graphs — those with that 
property — results. Consequently, we focus on basic families. Along with the definitions 
of families, we include characterizations where appropriate. [ReWi98] offers a detailed 
catalog of the members of various graph and digraph families. 


1.2.1 Building Blocks 


Some simple graphs have as few edges or as many as possible for a given number 
of vertices. Some multigraphs and general graphs have as few vertices as possible for a 
given number of edges. 


DEFINITIONS 


D1: A simple graph is a complete graph if every pair of vertices is joined by an edge. 
The complete graph with n vertices is denoted Kp. 


D2: The empty graph K,, is defined to be the graph with n vertices and no edges. 
D3: The null graph Ko is the graph with no vertices or edges. 

D4: The trivial graph K, is the graph with one vertex and no edges. 

D5: The bouquet B,, is the general graph with one vertex and n self-loops. 


D6: The dipole D,, is the multigraph with two vertices and n edges. 
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D7: A simple digraph is a complete digraph if between every pair of vertices there 
is an arc in each direction. The complete digraph with n vertices is denoted K ae 

D8: The path graph P,, is the n-vertex graph with n — 1 edges, all on a single open 
path. (Quite commonly elsewhere, the subscript of the notation P,, denotes the number 


of edges.) 


D9: The cycle graph C,, is the n-vertex graph with n edges, all on a single cycle. 


REMARKS 


R1: Although the empty graph may seem to some a “pointless” concept, it is the 
default initial value in computer representations of graph-valued variables. 


R2: Whereas a “path” and a “cycle” are alternating sequences of vertices and edges, 
a “path graph” and a “cycle graph” are kinds of graphs. 


EXAMPLES 


E1: Figure 1.2.1 shows the complete graph Ky, and the complete digraph Ke 


YEN 


Figure 1.2.1: A complete graph and a complete digraph. 


E2: Figure 1.2.2 shows a path graph and a cycle graph. 


ROM “| 


Figure 1.2.2: A path graph and a cycle graph. 


1.2.2 Symmetry 


Graphs with various kinds of symmetry are of particular interest. 


Local Symmetry: Regularity 


Regularity of a graph is an elementary form of local symmetry. 
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DEFINITIONS 


D10: A graph is regular if every vertex is of the same degree. 


e It is k-regular if every vertex is of degree k. 


D11: A k-factor of a graph G is a k-regular spanning subgraph. 


FACT 


F1: All vertez-transitive graphs (see §1.1) are regular. 


EXAMPLES 
E3: For k = 0,1, 2,3, there is exactly one k-regular simple graph with 4 vertices. 


E4: The only regular simple graphs with 5 vertices are the empty graph Ks (degree 0), 
the cycle graph Cs (degree 2), and the complete graph Ks (degree 4). 


E5: [ReWi98] There are exactly two 3-regular simple graphs with 6 vertices. 


>< x 


Figure 1.2.3: The two 3-regular simple graphs with 6 vertices. 


E6: The disjoint union of the complete graphs K3 and K, is a 2-regular simple 7-vertex 
graph that is not vertex-transitive. Its edge-complement is a 4-regular connected simple 
7-vertex graph that is not vertex-transitive. 


E7: Of the five 3-regular connected simple graphs with 8 vertices, two are vertex- 


ie aunts 


Figure 1.2.4: The five 3-regular connected simple graphs with 8 vertices. 


24 Chapter 1. Introduction to Graphs 


Global Symmetry: Vertex-Transitivity 


Often vertex-transitivity arises from algebra or geometry. See §6.2 for further discussion 
of Cayley graphs and circulant graphs. 


DEFINITIONS 


D12: The Cayley graph C(A,X) for a group A with generating set X has the 
elements of A as vertices and has an edge directed from a to ax for every a € A and 
x € X. We assume that vertices are labeled by elements of A and that edges are labeled 
by elements of X. 
e We note that an involution x gives rise to a pair of oppositely directed edges 
between a and az, for each a € A; sometimes we identify each such pair of 
directed edges to a single undirected edge labeled «x. 


D13: A circulant graph Circ(n; X) is defined for a positive integer n and a subset 
X of the integers 1,2,...,| 4], called the connections. 
e The vertex set is Z,, the integers modulo n. 


e There is an edge joining two vertices j and k if and only if the difference |j — k| is 
in the set X. A circulant graph is a special case of a Cayley graph; an involution 
in the connection set gives rise to a single edge. 


D14: The 1-skeleton (often in graph theory, the skeleton) of a k-complex K is the 
graph consisting of the vertices and the edges of K. 


D15: The d-hypercube graph Q, (or d-cube graph) is the 1-skeleton of the d- 
dimensional hypercube {(x1,...,@n) | 0 < a; < 1}. This graph has 2? vertices and is 
regular of degree d. 


D16: The d-octahedral graph Og, is defined recursively: 


Ky ifn=1 
Og = oe 
Og1+Ko ifn>2 


D17: The Petersen graph is the 10-vertex 3-regular graph depicted in Figure 1.2.5. 


Figure 1.2.5: The Petersen graph. 


EXAMPLES 


E8: The n-simplex is the convex hull of n + 1 affinely independent points in n- 
dimensional space. Its 1-skeleton is isomorphic to the complete graph K,,. 


E9: A Platonic graph is the 1-skeleton of one of the five Platonic solids: the tetra- 
hedron, the cube, the octahedron, the dodecahedron, and the icosahedron. 
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E10: The Petersen graph is vertex-transitive, since there is an automorphism that 
swaps the pentagram (i.e., the star) with the pentagon. It is not a Cayley graph of 
either of the two groups of order 10, i.e., of the cyclic group Z19 or of the dihedral group 
Ds, and thus, not a Cayley graph. 


E11: The octahedral graph Og is isomorphic to dK9. 


FACTS 


F2: [Hypercube Characterization Theorem] The graph whose vertices are the binary 
sequences of length d in which two vertices are adjacent if their sequences differ in 
exactly one place is isomorphic to Qg. 


F3: We can construct the d-dimensional hypercube Qg recursively, using the cartesian 


product operation: 
Ky ifd=0 
Qa = 
Qa-1 x Ko if d > 1 


1.2.3 Integer-Valued Invariants 


Some of the most useful graph properties are provided by integer-valued invariants of 
isomorphism type. Such invariants partition all graphs into an infinite list of subclasses. 
Often the subclasses with low invariant values are of special interest. 


Cycle Rank 


The connected graphs of cycle rank 0 are of great special interest, since they are the 
trees (see §1.1). 


DEFINITION 

D18: The cycle rank of a connected graph G = (V, E) is the number |E|—|V|+1. (See 
§6.4 for an interpretation of cycle rank as the rank of a vector space.) More generally, for 
a graph G with c(G) components, the cycle rank is the number |E(G)| —|V(G)| + c(G). 
EXAMPLE 


E12: The connected graphs of cycle rank 0 are the trees. The smallest trees are shown 
in Figure 1.2.6. 


; £3 LY ec: 


n=l n=2 n=3 n=5 


Figure 1.2.6: The trees with up to five vertices. 
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FACTS 


F4:; [Tree Characterization Theorem] The following statements are equivalent for a 
graph T with n vertices (e.g., see [GrYe06, Theorem 3.1.8]): 

e T is a tree (that is, G is connected and has no cycles). 

e T is connected and has n— 1 edges. 

e T has no cycles and has n — 1 edges. 


e Any two vertices of T’ are connected by exactly one path. 


F5: [Inductive (Recursive) Definition of Trees] Let T be the family of graphs defined 
as follows: 


(i) Ky eT. 


(ii) If T € T and T’ can be obtained by adding a new vertex and joining it to a vertex 
of T, then T’ € T. 


Then 7 is the family of all trees. 
(Several more classes of recursively defined graphs are presented in §2.4.) 


F6: The cycle rank of a graph is the sum of the cycle ranks of its components. 


F7: A forest is a graph such that every component is a tree. 


Chromatic Number and k-Partite Graphs 


In a proper coloring of a graph, no two vertices with the same color are adjacent, and 
thus, every edges joins vertices in different color classes. The graphs with a proper 
2-coloring are of special interest. Graph coloring is covered extensively in §5.1 and 85.2. 


DEFINITIONS 


D19: A simple graph or multigraph is bipartite if its vertices can be partitioned into 
two sets (called partite sets) in such a way that no edge joins two vertices in the same 
set. (For technical reasons, this includes the graph Ky, in this definition.) If r and s are 
the orders of the partite sets, then the graph is said to be an r-by-s bipartite graph. 


D20: A complete bipartite graph is a simple bipartite graph in which each vertex in 
one partite set is adjacent to all the vertices in the other partite set. If the two partite 
sets have cardinalities r and s, then this graph is denoted K,.s. 


D21: A graph is k-partite if its vertices can be partitioned into k sets (called partite 
sets) in such a way that no edge joins two vertices in the same set. 


D22: A complete k-partite graph is a simple k-partite graph in which two vertices 
are adjacent if and only if they are in different partite sets. All such graphs are called 
complete multipartite graphs. If the k partite sets have orders n1,n2,...,nx, then 
the graph is denoted Ky, »no,...,.n,, and if each partite set has order r, then Ky). 


peas 
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EXAMPLES 
E13: Every tree is bipartite. 


E14: Every cycle with an even number of vertices is bipartite, and no cycle with an 
odd number is bipartite. 


E15: The complete d-partite graph K 42) is isomorphic to the d-octahedral graph Og. 
The first four complete d-partite graphs are shown in Figure 1.2.7. 


Figure 1.2.7: The complete d-partite graphs K4q2), for d=1,...,4. 


FACTS 


F8: [Bipartite Graph Characterization Theorem] A graph is bipartite if and only if the 
length of each of its cycles is even (e.g., see [GrYe06, Theorem 1.5.4]). 


F9: A graph is k-colorable if and only if it is k-partite. 


F10: For k > 3, the problem of deciding whether a graph is k-partite is NP-complete. 


K-Connectivity and K-Edge-Connectivity 


Graphs can be categorized according to their connectivity and their edge-connectivity. 
There are analogues for strong connectedness in digraphs. See $4.1 and §4.7 for extensive 
coverage of connectivity. 


DEFINITIONS 


D23: The (vertex-)connectivity of a graph G, denoted &,(G), is the minimum 
number of vertices whose removal from G leaves a non-connected or trivial graph. 


D24: The edge-connectivity of a nontrivial graph G, denoted «,.(G) is the minimum 
number of edges whose removal from G results on a non-connected graph. 


NOTATION: The subscripted “G” is often suppressed when the graph G is understood. 
Elsewhere, the notation « and X are used instead of k, and Ke, respectively. 


D25: A graph G with connectivity k, > k > 1 is called k-connected. Equivalently, G 
is k-connected if the removal of k — 1 or fewer vertices leaves neither a non-connected 
graph nor a trivial one. 
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D26: A graph G with edge-connectivity kK. > k > 1 is called k-edge-connected. 
That is, the removal of k — 1 or fewer edges from a k-edge-connected graph results in a 
connected graph. 


D27: A digraph is strongly k-connected (or k-strong) if the result of removing any 
set of fewer than k vertices is strongly connected and non-trivial. 


D28: A digraph is strongly k-arc-connected (or k-arc-strong) if the result of 
removing any set of fewer than k arcs is strongly connected and non-trivial. 


Minimum Genus 


Graphs can be categorized according to their topological properties. 


DEFINITIONS 


D29: The minimum genus (or simply the genus) of a connected graph G is the 
smallest number g such that G can be drawn on the orientable surface S, (see §7.1) 
without any edge-crossings. 


D30: A graph of genus 0 is planar. 


1.2.4 Criterion Qualification 


A graph family is also specified as the set of all graphs or digraphs that match a 
stated criterion, e.g., traversibility and various forms of minimality and maximality. 


DEFINITIONS 


D31: A graph is eulerian if it has a closed walk that contains every edge exactly 
once. (See §1.3 for the history of eulerian graphs and §4.2 for an extensive discussion.) 


D32: A graph is hamiltonian if it has a spanning cycle. (See §1.3 for the history of 
hamiltonian graphs and §4.5 for an extensive discussion.) 


D33: A k-chromatic graph is critically k-chromatic if its chromatic number would 
decrease if any edge were removed. (See §5.1.) 


D34: A k-connected graph is critically k-connected if its connectivity would decrease 
if any vertex were removed. (See §4.1.) 


D35: A k-edge-connected graph is critically k-edge-connected if its edge-connectiv- 
ity would decrease if any edge were removed. (See §4.1.) 


D36: A tournament is a digraph in which there is exactly one arc between each pair 
of vertices. (See §3.3.) 
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D37: The line graph L(G) of a graph G has the edges of G as its vertices; two vertices 
of L(G) are adjacent if the edges in G to which they correspond have a common vertex. 
A graph and its line graph are illustrated in Figure 1.2.8. Also, a graph H is said to be 
a line graph if there exists a graph G such that H is isomorphic to L(G). 


G L(G) 


Figure 1.2.8: A graph and its line graph. 


FACTS 


F11: [Line Graph Characterization] The following statements are equivalent: 
e Gis a line graph. 


e [Kr43] The edges of G can be partitioned into complete subgraphs in such a way 
that no vertex is in more than two. 


e [Be70] None of the nine graphs in Figure 1.2.9 is an induced subgraph of G. 


Figure 1.2.9: The nine forbidden induced subgraphs. 


F12: A strongly connected tournament contains a directed spanning cycle. 


EXAMPLE 


E16: The eight tournaments with one to four vertices are shown in Figure 1.2.10. 


Figure 1.2.10: All tournaments with one to four vertices. 
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INTRODUCTION 


Although the first mention of a graph was not until 1878, graph-theoretical ideas 
can be traced back to 1735 when Leonhard Euler (1707-83) presented his solution of 
the Konigsberg bridges problem. This chapter summarizes some important strands in 
the development of graph theory since that time. Further information can be found in 
[BiLIWi98] or [Wi99]. 


1.3.1 Traversability 


The origins of graph theory can be traced back to Euler’s work on the Konigsberg 
bridges problem (1735), which subsequently led to the concept of an eulerian graph. 
The study of cycles on polyhedra by the Revd. Thomas Penyngton Kirkman (1806-95) 
and Sir William Rowan Hamilton (1805-65) led to the concept of a Hamiltonian graph. 


The Konigsberg Bridges Problem 


The Konigsberg bridges problem, pictured in Figure 1.3.1, asks whether there is a con- 
tinuous walk that crosses each of the seven bridges of Konigsberg exactly once — and if 
so, whether a closed walk can be found. See §4.2 for more extensive discussion of issues 
concerning eulerian graphs. 


Figure 1.3.1: The seven bridges of Konigsberg. 


32 Chapter 1. Introduction to Graphs 


FACTS [BiLIWi98, Chapter 1] 


F1: On 26 August 1735 Leonhard Euler gave a lecture on “The solution of a problem 
relating to the geometry of position” to the Academy of Sciences of St. Petersburg, 
Russia, proving that there is no such continuous walk across the seven bridges. 


F2: In 1736, Euler communicated his solution to several other mathematicians, out- 
lining his views on the nature of the problem and on its situation in the geometry of 
position [HoWi04]. 


F3: Euler [Eu:1736] sent his solution of the problem to the Commentarii Academii Sci- 
entiarum Imperialis Petropolitanae under the title “Solutio problematis ad geometriam 
ad geometriam situs pertinentis”. Although dated 1736, it did not appear until 1741, 
and was later republished in the new edition of the Commentarii in 1752. 


F4: Euler’s paper is divided into 21 sections, of which 9 are on the Kénigsberg bridges 
problem, and the remainder are concerned with general arrangements of bridges and 
land areas. 


F5: Euler did not draw a graph in order to solve the problem, but he reformulated the 
problem as one of trying to find a sequence of eight letters A, B, C, or D (the land areas) 
such that the pairs AB and AC are adjacent twice (corresponding to the two bridges 
between A and B and between A and C), and the pairs AD, BD, and CD are adjacent 
just once (corresponding to the remaining bridges). He showed by a counting argument 
that no such sequence exists, thereby proving that the Konigsberg bridges problem has 
no solution. 


F6: In discussing the general problem, Euler first observed that the number of bridges 
written next to the letters A, B, C, etc. together add up to twice the number of bridges. 
This is the first appearance of what some graph-theorists now call the “handshaking 
lemma”, that the sum of the vertex-degrees in a graph is equal to twice the number of 
edges. 


F7: Euler’s main conclusions for the general situation were as follows: 
e If there are more than two areas to which an odd number of bridges lead, then 
such a journey is impossible. 
e If the number of bridges is odd for exactly two areas, then the journey is possible 
if it starts in either of these two areas. 


e If, finally, there are no areas to which an odd number of bridges lead, then the 
required journey can be accomplished starting from any area. 


These results correspond to the conditions under which a graph has an eulerian, or 
semi-eulerian, trail. 


F8: Euler noted the converse result, that if the above conditions are satisfied, then a 
route is possible, and gave a heuristic reason why this should be so, but did not prove it. 
A valid demonstration did not appear until a related result was proved by C. Hierholzer 
[Hi:1873] in 1873. 
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Diagram-Tracing Puzzles 


A related area of study was that of diagram-tracing puzzles, where one is required to 
draw a given diagram with the fewest possible number of connected strokes. Such 
puzzles can be traced back many hundreds of years — for example, there are some early 
African examples. 


FACTS [BiLIWi98, Chapter 1] 


F9: In 1809 L. Poinsot [Po:1809] wrote a memoir on polygons and polyhedra in which 
he posed the following problem: 


Given some points situated at random in space, it is required to arrange a 
single flexible thread uniting them two by two in all possible ways, so that 
the two ends of the thread join up and the total length is equal to the sum 
of all the mutual distances. 


Poinsot noted that a solution is possible only when the number of points is odd, and 
gave a method for finding such an arrangement for each possible value. In modern 
terminology, the question is concerned with eulerian trails in complete graphs of odd 
order. 


F10: Other diagram-tracing puzzles were posed and solved by T. Clausen [C1:1844] 
and J. B. Listing [Li:1847]. The latter appeared in the book Vorstudien zur Topologie, 
the first place that the word “topology” appeared in print. 


F1l: In 1849, O. Terquem asked for the number of ways of laying out a complete 
ring of dominoes. This is essentially the problem of determining the number of eulerian 
tours in the complete graph K7, and was solved by M. Reiss [Re:1871-3] and later by 
G. Tarry. 


F12: The connection between the Konigsberg bridges problem and diagram-tracing 
puzzles was not recognized until the end of the 19th century. It was pointed out by 


W. W. Rouse Ball [Ro:1892] in Mathematical Recreations and Problems. Rouse Ball 
seems to have been the first to use the graph in Figure 1.3.2 to solve the problem. 


B 


Figure 1.3.2: Rouse Ball’s graph of the Konigsberg bridges problem. 
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Hamiltonian Graphs 


A type of graph problem that superficially resembles the eulerian problem is that of 
finding a cycle that passes just once through each vertex of a given graph. Because of 
Hamilton’s influence, such graphs are now called hamiltonian graphs (see $4.5), instead 
of more justly being named after Kirkman, who, prior to Hamilton’s consideration of 
the dodecahedron, as discussed below, considered the more general problem. 


FACTS [BiLIWi98, Chapter 2] 


F13: An early example of such a problem is the knight’s tour problem, of finding a 
succession of knight’s moves on a chessboard, visiting each of the 64 squares just once 
and returning to the starting point. This problem can be dated back many hundreds 
of years, and systematic solutions were given by Euler [Eu:1759], A.-T. Vandermonde 
[Va:1771], and others. 


F14: In 1856 Kirkman [Ki:1856] wrote a paper investigating those polyhedra for which 
one can find a cycle passing through all the vertices just once. He proved that every 
polyhedron with even-sided faces and an odd number of vertices has no such cycle, and 
gave as an example the polyhedron obtained by “cutting in two the cell of a bee” (see 
Figure 1.3.3). 


Figure 1.3.3: Kirkman’s “cell of a bee” example. 


F15: Arising from his work on non-commutative algebra, Hamilton considered cycles 
passing through all the vertices of a dodecahedron. He subsequently invented a game, 
called the icosian game (see Figure 1.3.4), in which the player was challenged to find 
such cycles on a solid dodecahedron, satisfying certain extra conditions. 


Figure 1.3.4: Hamilton’s icosian game. 
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F16: In 1884, P. G. Tait asserted that every 3-valent polyhedron has a hamiltonian 
cycle. This assertion was subsequently disproved by W. T. Tutte [Tu46] in 1946 (see 
Figure 1.3.5). 


Figure 1.3.5: Tutte’s 3-valent non-hamiltonian polyhedron. 


F17: Sufficient conditions for a graph to be hamiltonian were later obtained by G. A. 
Dirac [Di52], O. Ore [Or60], J. A. Bondy and V. Chvatal [BoCh76], and others. 


F18: Hamiltonian digraphs have also been investigated, by A. Ghouila-Houri (1960), 
H. Meyniel (1973), and others. 


1.3.2 Trees 


The concept of a tree, a connected graph without cycles, appeared implicitly in 
the work of Gustav Kirchhoff (1824-87), who employed graph-theoretical ideas in the 
calculation of currents in electrical networks. Later, trees were used by Arthur Cayley 
(1821-95), James Joseph Sylvester (1806-97), Georg Polya (1887-1985), and others, in 
connection with the enumeration of certain chemical molecules. 


Counting Trees 


Enumeration techniques involving trees first arose in connection with a problem in the 
differential calculus, but they soon came to be fundamental tools in the counting of 
chemical molecules, as well as providing a fascinating topic of interest in their own 
right. Enumeration of various kinds of graphs is discussed in §6.3. 


FACTS [BiLIWi98, Chapter 3] [P6Re87] 


F19: While working on a problem inspired by some work of Sylvester on “differential 
transformation and the reversion of serieses”, Cayley [Ca:1857] was led to the enumer- 
ation of rooted trees. 
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F20: Cayley’s method was to take a rooted tree and remove its root, thereby obtaining 
a number of smaller rooted trees (see Figure 1.3.6). 


root 


Figure 1.3.6: Splitting a rooted tree. 


Letting A, be the number of rooted trees with n branches, Cayley proved that the 
generating function 


Tp Aya t+ Aga? +: Age? + cs 
is equal to the product 
(l1—a)-?-(1—2?)-41.(1—23)-*.... 


Using this equality, he was able to calculate the first few numbers A, one at a time. 


F21: Around 1870, Sylvester and C. Jordan independently defined the center/bicenter 
and the centroid/bicentroid of a tree. 


F22: In 1874, Cayley [Ca:1874] found a method for solving the more difficult problem 
of counting unrooted trees. This method, which he applied to chemical molecules, 
consisted essentially of starting at the center or centroid of the tree or molecule and 
working outwards. 


F23: In 1889, Cayley [Ca:1889] presented his n”~? formula for the number of labeled 
trees with n vertices. He explained why the formula holds when n = 6, but he did not 
give a proof in general. The first accepted proof was given by H. Priifer [Pr18]: his 
method was to establish a one-to-one correspondence between such labeled trees and 
sequences of length n — 2 formed from the numbers 1, 2,...,n. 


F24: In a fundamental paper of 1937, Pélya [P637] combined the classical idea of 
a generating function with that of a permutation to obtain a powerful theorem that 
enabled him to enumerate certain types of configuration under the action of a group 
of symmetries. Some of Pélya’s work was anticipated by J. H. Redfield [Re27], but 
Redfield’s paper was obscurely written and had no influence on the development of the 
subject. 


F25: Later results on the enumeration of trees were derived by R. Otter [Ot48] and 
others. The field of graphical enumeration (see [HaPa73]) was subsequently further 
developed by F. Harary [Ha55], R. C. Read [Re63], and others. 
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Chemical Trees 


By 1850 it was already known that chemical elements combine in fixed proportions. 
Chemical formulas such as CH, (methane) and C2.H;OH (ethanol) were known, but it 
was not understood how the elements combine to form such substances. Around this 
time, chemical ideas of valency began to be established, particularly when Alexander 
Crum Brown presented his graphic formulae for representing molecules. Figure 1.3.7 
presents his representation of ethanol, the usual drawing, and the corresponding tree 
graph. 


(1) @) 4 
OO-O-O-D "-e-¢-0-# 
(H) (H) H H 


Figure 1.3.7: Representations of ethanol. 


FACTS [BiLIWi98, Chapter 4] 


F26: Crum Brown’s graphic notation explained for the first time the phenomenon 
of isomerism, whereby there exist pairs of molecules (isomers) with the same chemical 
formula but different chemical properties. Figure 1.3.8 shows isomers with chemical 
formula C4H jo. 


i 
H H H H H=C=$H 
1 tf tot | 
H—C—C—C—C—H 
Cc 
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H H H H H. H LH 
Cc Cc 
H’ H H’% Su 


Figure 1.3.8: Two isomers: butane and isobutane. 


F27: Cayley [Ca:1874] used tree-counting methods to enumerate paraffins (alkanes) 
with up to 11 carbon atoms, as well as various other families of molecules; the followiing 
table gives the number of isomers of alkanes for n = 1,...,8. 


Formula CH, CoHe C3Hg C4Hio CsHy9 CeHi4 C7Hi6 CgHi8 
Number 1 1 1 2 3 5 9 18 


F28: W. K. Clifford and Sylvester believed that a connection could be made between 
chemical atoms and binary quantics in invariant theory, a topic to which Cayley and 
Sylvester had made significant contributions. In 1878, Sylvester [Sy:1877-8] wrote a 
short note in Nature about this supposed connection, remarking that: 


Every invariant and covariant thus becomes expressible by a graph precisely 
identical with a Kekuléan diagram or chemicograph. 


This was the first appearance of the word graph in the graph-theoretic sense. 
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F29: In 1878, Sylvester [Sy:1878] wrote a lengthy article on the graphic approach to 
chemical molecules and invariant theory in the first volume of the American Journal of 
Mathematics, which he had recently founded. 


F30: Little progress was made on the enumeration of isomers until the 1920s and 
1930s. A. C. Lunn and J. K. Senior [LuSe29] recognized the importance of permutation 
groups for this area, and Pdlya’s above-mentioned paper solved the counting problem 
for several families of molecules. 


1.3.3 Topological Graphs 


Euler’s polyhedron formula [Eu:1750] was the foundation for topological graph the- 
ory, since it holds also for planar graphs. It was later extended to surfaces other than 
the sphere. In 1930, a fundamental characterization of graphs imbeddable in the sphere 
was given by Kazimierz Kuratowski (1896-1980), and recent work — notably by Neil 
Robertson, Paul Seymour, and others — has extended these results to the higher order 
surfaces. 


Euler’s Polyhedron Formula 


The Greeks were familiar with the five regular solids, but there is no evidence that they 
knew the simple connection between the numbers V of vertices, E of edges, and F of 
faces of a polyhedron: 

V-E+F=2 


In the 17th century, René Descartes studied polyhedra, and he obtained results from 
which Euler’s formula could later be derived. However, since Descartes had no concept 
of an edge, he was unable to make such a deduction. 


FACTS [BiLIWi98, Chapter 5] [Cr99] [BeWi09] 


F31: The first appearance of the polyhedron formula appeared in a letter, dated 14 
November 1750, from Euler to C. Goldbach. Denoting the number of faces, solid angles 

(vertices) and joints (edges) by H, S, and A, he wrote: 
e In every solid enclosed by plane faces the aggregate of the number of faces and 
the number of solid angles exceeds by two the number of edges, or H+S = A+2. 


F32: Euler was unable to prove his formula. In 1752 he attempted a proof by dissection, 
but it was deficient. The first valid proof was given by A.-M. Legendre [Le:1794] in 1794, 
using metrical properties of spherical polygons. 


F33: In 1813, A.-L. Cauchy [Ca:1813] obtained a proof of Euler’s formula by stereo- 
graphically projecting the polyhedron onto a plane and considering a triangulation of 
the resulting planar graph. 


F34: Around the same time, S.-A.-J. Lhuilier [Lh:1811] gave a topological proof that 
there are only five regular convex polyhedra, and he anticipated the idea of duality 
by noting that four of them occur in reciprocal pairs. He also found three types of 
polyhedra for which Euler’s formula fails — those with indentations in their faces, those 


Section 1.3. History of Graph Theory 39 


with an interior cavity, and ring-shaped polyhedra drawn on a torus (that is, polyhedra 
with a ‘tunnel’ through them). For such ring-shaped polyhedra, Lhuilier derived the 
formula 


V-E+F=0 


and extended his discussion to prove that, if g is the number of tunnels in a surface on 
which a polyhedral map is drawn, then 


V-E+F=2-2% 


The number g is now called the genus of the surface, and the value of the quantity 
2 — 2g is called the Euler characteristic. (See §7.1.) 


F35: In 1861-2, Listing [Li:1861—2] wrote Der Census réumliche Complexe, an exten- 
sive investigation into complexes, and studied how their topological properties affect 
the generalization above of Euler’s formula. This work proved to be influential in the 
subsequent development of topology. In particular, H. Poincaré took up Listing’s ideas 
in his papers of 1895-1904 that laid the foundations for algebraic topology. 


F36: Poincaré’s work was instantly successful, and it appeared in an article by M. Dehn 
and P. Heegaard [DeHe07] on analysis situs (topology) in the ten-volume Encyklopadie 
der Mathematischen Wissenschaften. His ideas were further developed by O. Veblen 
[Ve22] in a series of colloquium lectures on analysis situs for the American Mathematical 
Society in 1916. 


Planar Graphs 


The study of planar graphs originated in two recreational problems involving the com- 
plete graph Ks and the complete bipartite graph K33. These graphs (shown in Figure 
1.3.9) are the main obstructions to planarity, as was subsequently demonstrated by 
Kuratowski. 


Figure 1.3.9: The Kuratowski graphs Ks and K33. 


FACTS [BiLIWi98, Chapter 8] 


F37: Around the year 1840, A. F. Mobius presented the following puzzle to his students: 


There was once a king with five sons. In his will he stated that after his death 
the sons should divide the kingdom into five regions so that the boundary 
of each region should have a frontier line in common with each of the other 
four regions. Can the terms of the will be satisfied? 
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This question asks whether one can draw five mutually neighboring regions in the plane. 
The connection with graph theory can be seen from its dual version, later formulated 
by H. Tietze: 


The king further stated that the sons should join the five capital cities of his 
kingdom by roads so that no two roads intersect. Can this be done? 


In this dual formulation, the problem is that of deciding whether the graph Ks; is planar. 


F38: An old problem, whose origins are obscure, is the utilities problem, or gas—water— 
electricity problem, mentioned by H. Dudeney [Du13] in the Strand Magazine of 1913: 


The puzzle is to lay on water, gas, and electricity, from W, G, and E, to 
each of the three houses, A, B, and C, without any pipe crossing another 
(see Figure 1.3.10). 


This problem is that of deciding whether K3,3 is planar. 


Figure 1.3.10: The gas—water-electricity problem. 


F39: In 1930 Kuratowski [Ku30] published a celebrated paper proving that every 
nonplanar graph has a subgraph homeomorphic to Ks or K3,3; this result was obtained 
independently by O. Frink and P. A. Smith. 


F40: In 1931 H. Whitney [Wh31] discovered an abstract definition of duality that is 
purely combinatorial and agrees with the geometrical definition of duality for planar 
graphs. He proved that, with this general definition of duality, a graph is planar if and 
only if it has an abstract dual. Related results were obtained by S. MacLane and others. 


F41: In 1935 Whitney [Wh35] generalized the idea of independence in graphs and 
vector spaces to the concept of a matroid. The dual of a matroid extends and clarifies 
the duality of planar graphs, and Tutte [Tu59] used these ideas in the late 1950s to 
obtain a Kuratowski-type criterion for a matroid to arise from a graph (see §6.6). 


Graphs on Higher Surfaces 


A graph drawn without crossings on a plane corresponds (by stereographic projection) 
to a graph similarly drawn on the surface of a sphere. This leads to the idea of graphs 
drawn on surfaces other than the sphere. The initial work in this area was carried out, in 
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the context of coloring maps, by Percy Heawood (1861-1955) and Lothar Heffter (1862— 
1962) for orientable surfaces, and by Heinrich Tietze (1880-1964) for non-orientable 
surfaces, but the basic problems in the area were not solved until Gerhard Ringel and 
Ted Youngs solved the Heawood conjecture in the 1960s and Neil Robertson and Paul 
Seymour generalized Kuratowski’s theorem to other surfaces in the 1980s. 


FACTS [BiLIWi98, Chapter 7; Ri74] 


F42: In 1890, Heawood [He:1890] presented an imbedding of the complete graph K7 on 
a torus. He also derived a formula for the genus of a surface on which a given complete 
graph can be imbedded, but his attempted proof of this formula was deficient. 


F43: In 1891, L. Heffter [He:1891] investigated the imbedding of complete graphs on 
orientable surfaces other than the sphere and the torus, and he proved that Heawood’s 
formula is correct for orientable surfaces of low genus and certain other surfaces. 


F44: In 1910, H. Tietze [Til0] extended Heffter’s considerations to certain non-orient- 
able surfaces, such as the Mobius band and the projective plane, and stated a corre- 
sponding Heawood formula. He was unable to prove it for the Klein bottle, but this 
case was settled in 1934 by P. Franklin [Fr34], who found that it was an exception to the 
formula. In 1935, I. N. Kagno [Ka35] proved the formula for surfaces of non-orientable 
genus 3, 4, and 6. 


F45: The Heawood formula for general non-orientable surfaces was proved in 1952 by 
Ringel. The proof for orientable surfaces proved to be much more difficult, involving 
300 pages of consideration of 12 separate cases. Most of these were settled in the mid- 
1960s, and the proof was completed in 1968 by Ringel and Youngs [RiYo68], using 
W. Gustin’s [Gu63] combinatorial inspiration in 1963 of a current graph. Since then, 
the transformation by J. L. Gross [Gr74] of numerous types of specialized combinatorial 
current graphs into a unified topological object, with its dualization to a voltage graph 
(see §7.4), has led to simpler solutions (see Gross and T. W. Tucker [GrTu74]). 


F46: In a sequence of papers in the 1980s of great mathematical depth, Robertson 
and Seymour [RoSe85] proved that, for each orientable genus g, the set of “forbidden 
subgraphs” is finite (see §7.7). However, apart from the sphere, the number of forbidden 
subgraphs runs into hundreds, even for the torus. For non-orientable surfaces, there is 
a similar result, and in 1979 H. H. Glover, J. P. Huneke, and C. S. Wang [GlHuWa79] 
obtained a set of 103 forbidden subgraphs for the projective plane. 


1.3.4 Graph Colorings 


Early work on colorings concerned the coloring of the countries of a map and, in 
particular, the celebrated four-color problem. This was first posed by Francis Guthrie 
in 1852, and a celebrated (incorrect) “proof” by Alfred Bray Kempe appeared in 1879. 
The four-color theorem was eventually proved by Kenneth Appel and Wolfgang Haken 
in 1976, building on the earlier work of Kempe, George Birkhoff, Heinrich Heesch, 
and others, and a simpler proof was subsequently produced by Neil Robertson, Daniel 
Sanders, Paul Seymour, and Robin Thomas [1997]. Meanwhile, attention had turned 
to the dual problem of coloring the vertices of a planar graph and of graphs in general. 
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There was also a parallel development in the coloring of the edges of a graph, starting 
with a result of Tait [1880], and leading to a fundamental theorem of V. G. Vizing 
in 1964. As mentioned earlier, the corresponding problem of coloring maps on other 
surfaces was settled by Ringel and Youngs in 1968. 


The Four-Color Problem 


Many developments in graph theory can be traced back to attempts to solve the cele- 
brated four-color problem on the coloring of maps. 


FACTS [BiLIWi98, Chapter 6] [Wi02] 


F47: The earliest known mention of the four-color problem occurs in a letter from 
A. De Morgan to Hamilton, dated 23 October 1852. De Morgan described how a student 
had asked him whether every map can be colored with just four colors in such a way that 
neighbouring countries are colored differently. The student later identified himself as 
Frederick Guthrie, giving credit for the problem to his brother Francis, who formulated 
it while coloring the counties of a map of England. Hamilton was not interested in the 
problem. 


F48: De Morgan wrote to various friends, outlining the problem and trying to describe 
where the difficulty lies. On 10 April 1860, the problem appeared in print, in an unsigned 
book review in the Athenaeum, written by De Morgan. This review was read in the U.S. 
by C. S. Peirce, who developed a life-long interest in the problem. An earlier printed 
reference, signed by “F.G.”, appeared in the Athenaeum in 1854 [McK12]. 


F49: On 13 June 1878, at a meeting of the London Mathematical Society, Cayley 
asked whether the problem had been solved. Shortly after, he published a short note 
describing where the difficulty might lie, and he showed that it is sufficient to restrict 
one’s attention to trivalent maps. 


F50: In 1879, Kempe [Ke:1879], a former Cambridge student of Cayley, published 
a purported proof of the four-color theorem in the American Journal of Mathematics, 
which had recently been founded by Sylvester. Kempe showed that every map must 
contain a country with at most five neighbours, and he showed how any coloring of the 
rest of the map can be extended to include such a country. His solution included a 
new technique, now known as a Kempe-chain argument, in which the colors in a two- 
colored section of the map are interchanged. Kempe’s proof for a map that contains a 
digon, triangle, or quadrilateral was correct, but his argument for the pentagon (where 
he used two simultaneous color-interchanges) was fallacious. 


F51: In 1880, Tait [Ta:1878-80] presented “improved proofs” of the four-color theorem, 
all of them fallacious. Other people interested in the four-color problem at this time 
were C. L. Dodgson (Lewis Carroll), F. Temple (Bishop of London), and the Victorian 
educator J. M. Wilson. 


F52: In 1890, Heawood [He:1890] published a paper in the Quarterly Journal of Pure 
and Applied Mathematics, pointing out the error in Kempe’s proof, salvaging enough 
to deduce the five-color theorem, and generalizing the problem in various ways, such as 
for other surfaces (see §1.1.3). Heawood subsequently published another six papers on 
the problem, the last while he was in his 90th year. Kempe admitted his error, but he 
was unable to put it right. 
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F53: During the first half of the 20th century two ideas emerged, each of which 
finds its origin in Kempe’s paper. The first is that of an unavoidable set — a set 
of configurations, at least one of which must appear in any map. Unavoidable sets 
were produced by P. Wernicke [We:1904] (see Figure 1.3.11), by P. Franklin, and by H. 
Lebesgue. 


uadrilateral 
i two pentagons pentagon and hexagon 


Figure 1.3.11: Wernicke’s unavoidable set. 


The second is that of a reducible configuration — a configuration of countries with the 
property that any coloring of the rest of the map can be extended to the configuration: 
no such configuration can appear in any counter-example to the four-color theorem. 
Birkhoff [Bi13] showed that the arrangement of four pentagons in Figure 1.3.12 (known 
as the Birkhoff diamond) is a reducible configuration. 


Figure 1.3.12: The Birkhoff diamond. 


F54: In 1912, Birkhoff [Bil2] investigated the number of ways of coloring a given map 
with & colors, and he showed that this is always a polynomial in k, now called the 
chromatic polynomial of the map. 


F55: In 1922, Franklin [Fr22] presented further unavoidable sets and reducible config- 
urations, and he deduced that the four-color theorem is true for all maps with up to 25 
countries. This number was later increased several times by other authors. 


F56: Around 1950 Heesch started to search for an unavoidable set of reducible con- 
figurations. Over the next few years, Heesch [He69] produced thousands of reducible 
configurations. 


F57: In 1976, Appel and Haken [ApHa77, ApHako77], with the assistance of J. 
Koch, obtained an unavoidable set of 1482 reducible configurations, thereby proving 
the four-color theorem. Their solution required substantial use of a computer to test 
the configurations for reducibility. 


F58: Around 1994, Robertson, Sanders, Seymour, and Thomas [RoSaSeTh97] pro- 
duced a more systematic proof. Using a computer to assist with both the unavoidable 
set and the reducible configuration parts of the solution, they systematized the Appel-— 
Haken approach, and they obtained an unavoidable set of 633 reducible configurations. 
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Other Graph Coloring Problems 


Arising from work on the four-color problem, progress was being made on other graph 
problems involving the coloring of edges or vertices. 


FACTS [BiLIWi98, Chapter 6] [FiWi77] [JeTo95] 


F59: In his 1879 paper on the coloring of maps, Kempe [Ke:1879] outlined the dual 
problem of coloring the vertices of a planar graph in such a way that adjacent vertices 
are colored differently. This dual approach to map-coloring was later taken up by 
H. Whitney in a fundamental paper of 1932 and by most subsequent workers on the 
four-color problem. 


F60: In 1880, Tait [Ta:1878-80] proved that the four-color theorem is equivalent to 
the statement that the edges of every trivalent map can be colored with three colors in 
such a way that each color appears at every vertex. 


F61: In 1916, D. K6nig [K616] proved that the edges of any bipartite graph with 
maximum degree d can be colored with d colors. (See §11.3.) 


F62: The idea of coloring the vertices of a graph so that adjacent vertices are colored 
differently developed a life of its own in the 1930s, mainly through the work of Whitney, 
who wrote his Ph.D. thesis on the coloring of graphs. 


F63: In 1941, L. Brooks [Br41] proved that the chromatic number of any simple graph 
with maximum degree d is at most d+ 1, with equality only for odd cycles and odd 
complete graphs. (See 85.1.) 


F64: In the 1950s, substantial progress on vertex-colorings was made by G. A. Dirac, 
who introduced the idea of a critical graph. 


F65: In 1964, V. G. Vizing [Vi64] proved that the edges of any simple graph with 
maximum degree d can always be colored with d+ 1 colors. In the following year, 
Vizing produced many further results on edge-colorings. 


F66: The concepts of the chromatic number and edge-chromatic number of a graph 
have been generalized by a number of writers — for example, M. Behzad and others 
introduced total colorings in the 1960s, and P. Erdos and others introduced list colorings. 


Factorization 


A graph is k-regular if each of its vertices has degree k. Such graphs can sometimes 
be split into regular subgraphs, each with the same vertex-set as the original graph. A 
k-factor in a graph is a k-regular subgraph that contains all the vertices of the original 
graph. Fundamental work on factors in graphs was carried out by Julius Petersen 
[1839-1910] and W. T. Tutte [1914-2002]. (See §5.4.) 


FACTS [BiLIWi98, Chapter 10] 


F67: In 1891, Petersen [Pe:1891] wrote a fundamental paper on the factorization of 
regular graphs, arising from a problem in the theory of invariants. In this paper he 
proved that if k is even, then any k-regular graph can be split into 2-factors. He also 
proved that any 3-regular graph possesses a 1-factor, provided that it has not more than 
two “leaves”; a leaf is a subgraph joined to the rest of the graph by a single edge. 
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F68: In 1898, Petersen [Pe:1898] produced a trivalent graph with no leaves, now called 
the Petersen graph (see Figure 1.3.13), which cannot be split into three 1-factors; 
it can, however, be split into a 1-factor (the spokes) and a 2-factor (the pentagon and 
pentagram). 


Figure 1.3.13: The Petersen graph. 


F69: In 1947, Tutte [Tu47] produced a characterization of graphs that contain a 
1-factor. Five years later he extended his result to a characterization of graphs that 
contain a k-factor, for any k. 


1.3.5 Graph Algorithms 


Graph theory algorithms can be traced back to the 19th century, when Fleury gave 
a systematic method for tracing an eulerian graph and G. Tarry showed how to escape 
from a maze (see §4.2). The 20th century saw algorithmic solutions to such problems 
as the minimum connector problem, the shortest and longest path problems, and the 
Chinese Postman Problem (see §4.3), as well as to a number of problems arising in 
operational research. In each of these problems we are given a network, or weighted 
graph, to each edge (and/or vertex) of which has been assigned a number, such as its 
length or the time taken to traverse it. 


FACTS [Da82] [LLRS85] [LoP186] 


F70: The Traveling Salesman Problem, in which a salesman wishes to make a cyclic 
tour of a number of cities in minimum time or distance, appeared in rudimentary form 
in 1831. It reappeared in mathematical circles in the early 1930s, at Princeton, and was 
later popularized at the RAND Corporation. This led to a fundamental paper of G. B. 
Dantzig, D. R. Fulkerson, and S. M. Johnson [DaFuJo54] that included the solution of 
a traveling salesman problem with 49 cities. In the 1980s a problem with 2392 cities 
was settled by Padberg and Rinaldi [PaRi87]. (See §4.6.) 


F71: The greedy algorithm for the minimum connector problem, in which one seeks a 
minimum-length spanning tree in a weighted graph, can be traced back to O. Boruvka 
[Bo26] and was later rediscovered by J. B. Kruskal [Kr56]. A related algorithm, due to 
V. Jarnik (1931), was rediscovered by R. C. Prim (1957). (See §10.1.) 


F72: Graph algorithms were developed by D. R. Fulkerson and G. B. Dantzig [FuDa55] 
for finding the maximum flow of a commodity between two nodes in a capacitated 
network, and by R. E. Gomory and T. C. Hu [GoHu61] for determining maximum flows 
in multi-terminal networks. 
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F73: Finding a longest path, or critical path, in an activity network dates from the 
1940s and 1950s, with PERT (Program Evaluation and Review Technique) used by the 
U.S. Navy for problems involving the building of submarines and CPM (Critical Path 
Method) developed by the Du Pont de Nemours Company to minimize the total cost of 
a project. (See §3.2.) 


F74: There are several efficient algorithms for finding the shortest path in a given 
network, of which the best known is due to E. W. Dijkstra [Di59]. (See §10.1.) 


F75: The Chinese postman problem, for finding the shortest route that covers each 
edge of a given weighted graph, was originated by Meigu Guan (Mei-Ku Kwan) [Gu60] 
in 1960. (See §4.3.) 


F76: In matching and assignment problems one wishes to assign people as appropriately 
as possible to jobs for which they are qualified. This work developed from work of Konig 
and from a celebrated result on matching due to Philip Hall [Ha35], later known as the 
“marriage theorem” [HaVa50]. These investigations led to the subject of polyhedral 


combinatorics and were combined with the newly emerging study of linear programming. 
(See §11.3.) 


F77: By the late 1960s it became clear that some problems seemed to be more diffi- 
cult than others, and Edmonds [Ed65] discussed problems for which a polynomial-time 
algorithm exists. Cook [Co71], Karp [Ka72], and others later developed the concept of 
NP-completeness. The assignment, transportation, and minimum spanning-tree prob- 
lems are all in the polynomial-time class P, while the traveling salesman and Hamiltonian 
cycle problems are NP-hard. It is not known whether P = NP. Further information can 
be found in [GaJo79]. 
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Glossary for Chapter 1 


bipartite graph: a graph whose vertices can be partitioned into two sets (called the 
partite sets) in such a way, that no edge joins two vertices in the same set. (For 
technical reasons, this includes the graph Ky in this definition.) 

bouquet B,,: the general graph with one vertex and n self-loops. 

Cayley graph C(A, X) — for a group A with generating set X: the digraph whose 
vertex-set is A with an edge directed from a to ax for every a € A and every 
xz € X. Sometimes two oppositely directed edges corresponding to an involution x 
are merged into a single undirected edge. (The underlying undirected graph of a 
Cayley graph is also commonly called a Cayley graph.) 

circulant graph Circ(n; X): a Cayley graph for a cyclic group Zp. 

complete k-partite graph K,,, »,,....n,: @Simple k-partite graph such that two vertices 
are adjacent if and only if they are in different partite sets. All such graphs are called 
complete multipartite graphs. 

complete bipartite graph: a simple bipartite graph such that each vertex in one 
partite set is adjacent to all the vertices in the other partite set. If the two partite 
sets have cardinalities r and s, then this graph is denoted K,.s. 


complete digraph K n: the simple digraph on n vertices such that between every pair 
of vertices, there is an arc in both directions. 

complete graph K,,: the simple graph with n vertices in which every pair of vertices 
is joined by an edge. 

k-connected graph: a graph such that the result of removing fewer than k vertices is 
connected and nontrivial, for all possible choices of the vertices. 

connectivity of a graph G: the largest number k such that G is k-connected. It is 
denoted «(G) or Ky (G). 

critically k-chromatic graph: a graph of chromatic number k whose chromatic num- 
ber would decrease if any edge were removed. (See §5.1.) 

critically k-connected graph: a graph of connectivity k whose connectivity would 
decrease if any vertex were removed. (See §4.1.) 

critically k-edge-connected graph: a graph of edge-connectivity k whose edge- 
connectivity would decrease if any edge were removed. (See §4.1.) 

cube: see hypercube. 

cube graph: see hypercube graph. 

cycle graph C,,: the n-vertex graph with n edges, such that every edge lies on a single 
cycle. 

cycle rank — for a graph G = (V,£) with c(G) components: the number |E(G)| — 
|V(G)| + c(G). 

dipole D,,: the multigraph with two vertices and n edges. 

edge-connectivity of a graph G: the largest number & such that G is k-edge- 
connected. It is denoted «’(G) or Kp(G). 
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k-edge-connected graph: a graph such that the result of removing fewer than k edges 
is connected and nontrivial, for all possible choices of the edges. 

empty graph K,,: the graph with n vertices and no edges. 

eulerian graph: a graph with a closed walk that contains every edge exactly once. 
(See §1.3 for the history of eulerian graphs and §4.2 for an extensive discussion.) 

k-factor of a graph G: a k-regular subgraph. 

genus: see minimum genus. 

hamiltonian graph: a graph that has a spanning cycle. (See Section 1.3 for the history 
of hamiltonian graphs and Section 4.6 for an extensive discussion.) 

hypercube of dimension d: {(71,...,2a) | 0 > x;)}. 

hypercube graph Qz,: the 1-skeleton of a d-dimensional hypercube. 

line graph of a graph G: the simple graph L(G) whose vertex-set is the edge-set of G, 
and in which two vertices are adjacent if the edges in G to which they correspond 
have a common vertex. Also, a graph H is said to be a line graph if there exists a 
graph G such that H is isomorphic to L(G). 

minimum genus (or genus) of a connected graph G: the smallest number g such 
that G can be drawn on the orientable surface S, (see Section 7.1) without any 
edge-crossings. Notation: Ymin(G) or y(G). 

null graph Ko: the graph with no vertices and no edges. 

octahedral graph Oj ,: the edge-complement of a 1-factor in Koq. 

partite sets: see k-partite graph. 

p-partite graph: a graph whose vertex-set can be partitioned into p subsets (called 
the partite sets) in such a way that no edge joins two vertices in the same subset. 

Petersen graph: a 10-vertex 3-regular graph, commonly depicted as a 5-pointed star 
inside a pentagon, with a 1-factor joining the vertices of the pentagon to the points 
of the star. 

path graph P,,: the n-vertex graph with n — 1 edges, such that every edge lies on 
a single open path. (Quite commonly elsewhere, the subscript of the notation P,, 
denotes the number of edges.) 

planar graph: a graph of minimum genus 0, i.e., a graph that can be drawn in the 
sphere or plane with no edge crossings. 

platonic graph: the skeleton of any of the five platonic solids. 

platonic solid: any of the five regular 3-dimensional polyhedra — tetrahedron, cube, 
octahedron, dodecaheron, icosahedron. 

regular graph: a graph in which every vertex is of the same degree. It is k-regular if 
every vertex is of degree k. 

simplex: the convex hull of a set S of affinely independent points in Euclidean space. 
It is a k-simplez if |S| =k +1. 

skeleton (or 1-skeleton) of a k-complex K: the graph consisting of the vertices and the 
edges of K. 

trivial graph K,: the graph with one vertex and no edges. 
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INTRODUCTION 


Many problems in science and engineering can be modeled in terms of directed 
and undirected graphs. The data structures and algorithms used to represent graphs 
can have a significant impact on the size of problems that can be implemented on a 
computer and the speed with which they can be solved. This section presents the 
fundamental representations used in computer programs for graphs and illustrates the 
tradeoffs among the representations using key algorithms for some of the most common 
graph problems. 

Throughout this section we use the notation |X| to denote the number of elements 
in a set X. The graphs and digraphs in this section are assumed to be simple. 


2.1.1 Basic Representations for Graphs 

The two most basic representations for a graph are the adjacency matrix and the 
adjacency list. 
DEFINITIONS 


D1: A directed graph or digraph G = (V,E) consists of a finite, nonempty set of 
vertices V and a set of edges E. Each edge is an ordered pair (v, w) of vertices. 


D2: An undirected graph G = (V, E) consists of a finite, nonempty set of vertices V 
and a set of edges E. Each edge is a set {v, w} of vertices. 


D3: In a directed graph G = (V, E), vertex w is adjacent to vertex v if (v,w) is an 
edge in EF. The number of vertices adjacent to v is called the out-degree of v. 


D4: In an undirected graph G = (V, E), vertex w is adjacent to vertex v if {v,w} is 
an edge in EF. The number of vertices adjacent to v is called the degree of v. 
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D5: A path in a directed or undirected graph is a seqence of edges (v1, v2), (v2, U3),-- +; 
(Un—1, Un). This path is from vertex v1 to vertex v, and has length n — 1. 


D6: A graph G = (V, E) is dense when the number of edges is close to |V|?. 

D7: A graph G = (V, £) is sparse when the number of edges is much less than |V|?. 
D8: An adjacency matrix representation for a simple graph or digraph G = (V, E) 
is a |V| x |V| matrix A, where Ali, j] = 1 if there is an edge from vertex i to vertex J; 


Ali, j] = 0 otherwise. 


D9: An adjacency list representation for a graph or digraph G = (V, E£) is an array 
L of |V| lists, one for each vertex in V. For each vertex i, there is a pointer L; to a 
linked list containing all vertices 7 adjacent to 7. A linked list is terminated by a nil 
pointer. 


D10: An incidence matrix representation for a simple digraph G = (V,F) is a 
|V| x |E| matrix I, where 


—1_ if edge e is directed to vertex v 
I[v,e] = 41 _ if edge e is directed from vertex v 
0 otherwise 


For an undirected graph, I[v,e] = 1 if e is incident on v and 0 otherwise. 


EXAMPLES 


E1: Figure 2.1.1 shows the adjacency matrix and adjacency list representations of a 
directed graph. 


ee (2) List Headers 


Oma | 
(a) 2 = nil 


RWON Fr 


(b) (c) 


Figure 2.1.1: (a) A directed graph G. (b) Adjacency matrix for G. 
(c) Adjacency list representation for G. 
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E2: An incidence matrix for the digraph of Figure 2.1.1 is shown below. 
(1,2) (1,4) (2,3) (4,2) (4,3) 


1 1 1 0 0 0 
Hee ah ee =i 0 1 =a 0 
aos A 0 0 =] 0 =f 

4 0 £4 0 i 1 


FACTS 


F1: An adjacency matrix representation for a graph G = (V, E) always takes O(|V|?) 
space. 


F2: An adjacency list representation for a graph G = (V, E) takes O(|V| +|E]) space. 


REMARKS 


R1: Fora more detailed discussion of graph representations, see [AhHoU174, AhHoU183, 
CoLeRiSt09, Ev79, Ta83]. 


R2: As a general rule, an adjacency list representation is preferred when a graph is 
sparse, because it takes space that is linearly proportional to the number of vertices and 
edges. 


R3: When a graph G = (V, £) is dense, both an adjacency matrix and an adjacency 
list representation require O(|V|?) space. However, with the adjacency matrix, we can 
determine whether an edge exists in constant time, whereas with the adjacency list we 
may need O(|V]|) time. For this reason, adjacency matrix representations are often used 
with dense graphs. 


RA: Note that in an adjacency list representation of an undirected graph, an edge 2, 7 
appears on two adjacency lists: the list for vertex 7 and the list for vertex 7. 


2.1.2 Graph Traversal Algorithms 


One of the most fundamental tasks in algorithms involving graphs is visiting the 
vertices and edges of a graph in a systematic order. Depth-first and breadth-first search 
are frequently used traversal techniques for both directed and undirected graphs. For 
both these techniques, the adjacency list representation of a graph works well. 


Depth-First Search 


ALGORITHM 


Depth-first search systematically visits all the vertices of a graph. Initially, all vertices 
are marked “new”. When a vertex is visited, it is marked “old”. Depth-first search works 
by selecting a new vertex v, marking it old, and then calling itself recursively on each of 
the vertices adjacent to v. The algorithm below is called “depth-first search” because it 
searches along a path in the forward (deeper) direction looking for new vertices as long 
as it can. 
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Algorithm 2.1.1 Depth-First Search 


Input: A graph G = (V, E), where V = {1,2,...,n} 
and L[v] is a pointer to the list of vertices adjacent to vertex v. 
Output: Traversal of all vertices in V in a depth-first order. 


procedure DepthFirstSearch(G){ 
for v:=1tondo 
mark|v] :-= new; 
for v:=1tondo 
if mark[v] = new; then 
df s(v); 
} 
procedure dfs(v){ 
mark{v] := old; 
for each vertex w on L{v| do 
if mark[w] = new then 


df s(w); 


DEFINITIONS 


During the course of its traversal, depth-first search partitions the graph into a collection 
of depth-first trees that make up a depth-first forest. The forest and its trees are 
determined by the edges, which are partitioned by the search into four sets: 


D11: Tree edges are those edges (v, w) where w is first encountered by exploring edge 
(v,w). 


D12: Back edges are those edges (v, w) that connect a vertex v to an ancestor w in 
a depth-first tree. 


D13: Forward edges are those nontree edges (v,w) that connect a vertex uv to a 
proper descendant in a depth-first tree. 


D14: Cross edges are the remaining edges. They connect vertices that are neither 
ancestors nor descendants of one another. 


FACTS 


F3: Depth-first search takes O(|V| + ||) time on a graph G = (V, E). 


F4: If we represent the first visit of a vertex v with a left parenthesis “(v” and its 
last visit by a right parenthesis “v)”, then the sequence of first and last visits forms an 
expression in which the parentheses are properly nested. 
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F5: In a depth-first search of an undirected graph, every edge is either a tree edge or 
a back edge. 


REMARKS 


R5: Depth-first search is a fundamental graph algorithm that has been in use since 
the 1950s. [Ta72, HoTa73] developed several efficient graph algorithms using depth-first 
search. 


R6: Depth-first search forms the basis of many important graph algorithms such as 
determining the biconnected components of an undirected graph and finding the strongly 
connected components of a directed graph. 


Breadth-First Search 


Breadth-first search is another fundamental technique for exploring a graph G. It starts 
from a specified source vertex s from which it constructs a breadth-first tree consisting 
of all vertices of G reachable from s. In the process it computes a breadth-first tree 
rooted at s such that if a vertex v is reachable from s in G, there is a path in the tree 
from the root to s. The path in the tree is a shortest path from s to v in G. 


ALGORITHM 


Breadth-first search uses the abstract data type queue to hold vertices as they are being 
processed. The operation enqueue(s,@Q) places vertex s on the back of the queue Q. 
The operation dequeue(Q) removes the element at the front of the queue Q. 


Breadth-first search (Algorithm 2.1.2) visits the vertices of a graph G uniformly across 
the breadth of the frontier of its search, visiting all vertices at distance d from s, before 
looking for vertices at distance d+ 1. In contrast, depth-first search plunges as deeply 
into the graph along a path as it can before backtracking to visit nodes closer to s. 


DEFINITION 


D15: Let BFT be the tree with root s, vertices v such that parent{v] is not nil, and 
edges {(parent|v],v)|parent[v] is not nil}. BFT is the breadth-first tree constructed 
by BreadthFirstSearch(G, s). 


FACTS 
F6: Breadth-first search takes O(|V|+ ||) time on a graph G = (V, E). 


F7: BreadthFirstSearch(G,s) computes the length of the shortest path from s to v 
in distance|v]. 
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Algorithm 2.1.2 Breadth-First Search 


Input: A graph G = (V, F), where V = {1,2,...,n}, Lv] is a pointer to the list 
of vertices adjacent to vertex v, and where s is a specified source vertex. 
Output: A breadth-first tree consisting of root s and all vertices in V 
that are reachable from s. 


procedure BreadthFirstSearch(G, s) { 
for v:= 1 to n do { 
mark|v] :-= new; 
distance|v] := 00; 
parent{v] := nil; 


mark|s] := visited; 
distance|s] := 0; 
initialize queue Q; 
enqueue(s, Q); 
while Q is not empty do { 
v := dequeue(Q); 
for each vertex w on L/v] do 
if mark|w] = visited then { 


mark|w] := visited; 
distance[w] := distance[w] + 1; 
parent[w] := v; 


enqueue(w, Q); 


REMARKS 


R7: Like depth-first search, breadth-first search has been used since the 1950s. Early 
applications of breadth-first search included maze searching and routing wires on printed 
circuit boards. 


R8: The ideas found in breadth-first search are the building blocks of many other 
graph algorithms such as Dijkstra’s single-source shortest-paths algorithm and Prim’s 
algorithm for finding minimal spanning trees. 


2.1.3 All-Pairs Problems 


This section considers two algorithms: one for computing the shortest paths between 
all pairs of vertices in a directed graph and the other for computing the transitive closure 
of a directed graph. For both algorithms the adjacency matrix is a natural representation 
for the graph. 
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All-Pairs Shortest-Paths Algorithm 


Suppose that we have a schedule that tells us the driving time between n cities at a 
given time of day and that we wish to compute the shortest driving time between all 
pairs of cities. This is an instance of the all-pairs shortest-paths problem. We could 
iterate through every pair of cities and compute the shortest path between each using 
a single-source shortest-path algorithm such as Dijkstra’s algorithm. 


ALGORITHM 


An easier way is to use the Floyd—Warshall algorithm below. The natural representation 
for a graph in the Floyd—Warshall algorithm is an adjacency matrix. Assume that we are 
given a directed graph G = (V, £) and that the vertices in V are numbered 1,2,...,n. 
Further assume that we are given a matrix C[i, j] that tells us the cost of edge (2,7). If 
there is no edge C[i, 7], then we assume C|i, j] is set to infinity. We assume all other 
costs are non-negative. 


Algorithm 2.1.3 Floyd—Warshall 


Input: A directed graph G = (V, £), where V = {1,2,...,n}; 
and a cost matrix Cli, j]. 

Output: Cost matrix A[1..n,1..n] where A[2,j] is the cost of 
the cheapest path from 7 to 7. 


procedure FloydWarshall(G) { 
for 1:=1tondo 
for j :=1 ton do 
Ali, j] = Cli, 3]; 
for 1:=1tondo 
Ali, i] := 0; 
for k :=1 ton do 
for i1:=1tondo 
for j :=1tondo 
if Ali, k] + A[k, 7] < Alt, 7] then 


The Floyd—Warshall algorithm computes a cheapest-cost array A, where Ali, 7] gives 
the cheapest cost of any path from vertex 7 to vertex 7. For the algorithm to work 
correctly, it is important that there are no negative cost cycles in the graph. 


FACT 


F8: The Floyd—Warshall algorithm computes the cost matrix of the cheapest paths 
between all pairs of vertices of a directed graph G = (V, E) in O(|V|8) time and O(|V |?) 
space. 


REMARKS 


R9: For additional discussion of the Floyd—Warshall algorithm and its variants see 
[AhHoU174] and [CoLeRiSt09]. 
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R10: Let A*|i, 7] be the cost of the cheapest path from vertex i to vertex j that does 
not pass through a vertex numbered higher than k, except possibly for the endpoints. 
We can prove by induction on k that A* [i,j] = min(A*~!, AF-1[i, k] + A¥—1[k, j]). In 
the next section we see that the Floyd—Warshall algorithm is a special case of Kleene’s 
algorithm. 


Transitive Closure 


In some problems we may just want to know whether there exists a path from vertex 
i to vertex j of length one or more in a graph G = (V,E). We call this the problem 
of computing the transitive closure of G. Given a directed graph G = (V, E£) with 
adjacency matrix A, we want to compute a Boolean matrix T such that T[i,j] is 1 if 
there is a path from i to 7 of length 1 or more, and 0 otherwise. We call T the transitive 
closure of the adjacency matrix. 

The transitive-closure algorithm below is similar to the Floyd—Warshall algorithm 
except that it uses the Boolean operation and to conclude that if there is a path from 
i to k and one from & to j, then there is a path from ¢ to j. 


Algorithm 2.1.4 Transitive Closure 


Input: A directed graph G = (V, F), with V = {1,2,...,n} 
and adjacency matrix Ali, j]. 
Output: Boolean transitive-closure matrix T[1..n, 1..n] where Ti, j] is 
1 if there is a path from 7 to 7 of length 1 or more, and 0 otherwise. 


procedure TransitiveClosure(G) { 
for 1:=1 ton do 
for j := 1 ton do 
Ti, j] = Ali, 3]; 
for k :=1tondo 
for 1:=1tondo 
for j :=1tondo 
if Ali, 7] = false then 
Aji, j] := Ali, klorAlk, 9]; 


FACT 


F9: The algorithm TransitiveClosure(G) computes the transitive closure of G in 
O(|V|?) time and O(|V|?) space. 


REMARKS 
R11: The transitive-closure algorithm is due to S. Warshall [Wa62]. 


R12: Let T*[i,j] = 1 if there is a path of length one or more from vertex i to vertex j 
that does not pass through an intermediate vertex numbered higher than k, except for 
the endpoints. We can prove by induction on k that 


C*fi,j] =C*"[i, j] or CPi, k] and C™~"[k, j] 
where and and or are the Boolean and and or operators. In the next subsection we 
will see the transitive-closure algorithm is a special case of Kleene’s algorithm. 
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2.1.4 Applications to Pattern Matching 


Graphs play a major role in problems arising in the specification and translation of 
programming languages. A special kind of graph called a finite automaton is used 
in language theory to specify and recognize sets of strings called regular expressions. 
Regular expressions are used to specify the lexical structure of many programming 
language constructs. They are also widely used in many string-pattern-matching appli- 
cations. 

This section presents an algorithm due to S. C. Kleene to construct representations 
called regular expressions for all paths between the vertices of a directed graph. 


DEFINITIONS 


D16: A nondeterministic finite automaton (NFA) is a labeled, directed graph 
G = (V, E) in which 


1. one vertex is distinguished as the start vertex 
2. a set of vertices are distinguished as final vertices 


3. each edge is labeled by a symbol from a set © U {e} where 
x is a finite set of alphabet symbols, and 
€ is a special symbol denoting the empty string 


D17: An NFA G accepts a string x if there is a path in G from the start vertex to a 
final vertex whose edge labels spell out x. 


D18: The set of strings accepted by an NFA G is called the language defined by G. 


D19: If Rand S are sets of strings, then their concatenation R-S is the set of strings 
{xy|x is in r and y is in S}. 


D20: Let S be a set of strings. Define S° = {ce} and S*' = $-S*! fori > 1. The 
Kleene closure of S, denoted S*, is defined to be U%)S". 


D21: Let © bea finite set of alphabet symbols. The regular expressions over © and 
the languages they denote are defined recursively as follows: 


1. @ is a regular expression that denotes the empty set. 
2. € is a regular expression that denotes {e}. 
3. For each a in &, a is a regular expression that denotes {a}. 


4. If r and s are regular expressions denoting the languages R and S, then 
(r +s) is a regular expression denoting the language RU S, 
rs is a regular expression denoting R- S, and 
(r*) is a regular expression denoting R*. 
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We can avoid writing many parentheses in a regular expression by adopting the conven- 
tion that the Kleene closure operator * has higher precedence than concatenation or +, 
and that concatenation has higher precedence than +. For example, ((a(b*)) + c) may 
be written ab* + c. This regular expression denotes the set of strings {ab’|i > 0} U {c}. 


Kleene’s Algorithm 


S. C. Kleene presented an algorithm for constructing a regular expression from a non- 
deterministic finite automaton. This algorithm, shown below, includes the Floyd— 
Warshall algorithm and the transitive-closure algorithm as special cases. 


ALGORITHM 


Let G = (V,£) be an NFA in which the vertices are numbered 1,2,...,n. Kleene’s 
algorithm (Algorithm 2.1.5) works by constructing a sequence of matrices C* in which 
the entry C*{i, j] is a regular expression for all paths from vertex i to vertex j with no 
intermediate vertex on the path (except possibly for the endpoints) that is numbered 
higher than k. 


Algorithm 2.1.5 Kleene’s Algorithm 


Input: A directed graph G = (V, F), where V = {1,2,...,n}, 
and a label matrix L[#, j]. 

Output: Matrix C[1..n, 1..n] where C[i, 7] is a regular expression describing 
all paths from % to j. 


procedure Kleene(G) { 
for 1:=1tondo 
for j :=1 ton do 
Cli, J] = Lii, J]; 
for i :=1tondo 
C° li, i] = € + Cli, 7d]; 
for k := 1 ton do 
for i:=1tondo 
for j :=1tondo 
Crd] = Cray + Chi, k] ; (C*[k, k])* : OPN hg]: 
for 1:=1tondo 
for j :=1 ton do 
Cli, J] = C™[i, 9]; 


REMARKS 
R13: Kleene’s algorithm appeared in [K156]. 


R14: To prove the correctness of Kleene’s algorithm, we can prove by induction on 
k that C*[i,7] is the set of path labels of all paths from vertex i to vertex j with 
no intermediate vertex numbered higher than k, excluding the endpoints. The term 
C*—11j, k] in the inner loop represents the labels of all paths from vertex i to vertex k that 
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do not have an intermediate vertex numbered higher than k—1. The term (C*~!{k, k])* 
represents the labels of all paths that go from vertex k to vertex k zero or more times 
without passing through an intermediate vertex numbered higher than k — 1. The 
term C*~!{k, j] represents the labels of all paths from vertex k to vertex j that do not 
have an intermediate vertex numbered higher than k — 1. Thus, the term C*~1{i, j] - 
(C*—1[k, k])*-C*—1{k, j]) represents the path labels of all paths with the segments: from 
i to k, from k to k zero or more times, and from k to 7 with no intermediate vertex 
numbered higher than k — 1 on any of the segments. 


R15: The Floyd—Warshall algorithm is a special case of Kleene’s algorithm with the 
inner loop replaced by C*[i, j] := min(C*1[i, j],C*—"[i, k] + C*—1[k, j]. In the Floyd— 
Warshall algorithm we don’t need to consider paths from k to k since we assume the edge 
costs are non-negative. Also, in the Floyd—Warshall algorithm the operator representing 
concatenation (-) is arithmetic addition. 


R16: The transitive-closure algorithm is a special case of Kleene’s algorithm with the 
inner loop replaced by O*/i, j] := C*~1[i, j] + C*—1 [i,k] C*"[k, 7] where + represents 
Boolean or and - represents Boolean and. 


R17: Aho, Hopcroft, and Ullman present Kleene’s algorithm in the general setting of 
a closed semiring [AhHoUI174]. 


R18: One of the key results of formal language theory is that the set of languages de- 
fined by NFAs is exactly the same as the set of languages defined by regular expressions. 
These languages are called regular sets. 


R19: For applications of finite automata and regular expressions to string pattern 
matching and compiling see [Ah90, AhSeU186]. 
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INTRODUCTION 


Isomorphism between graphs and related objects is a fundamental concept in graph 
theory and its applications to other parts of mathematics. The problem also occupies 
a central position in complexity theory as a proposed occupant of the region that must 
exist between the polynomial-time and NP-complete problems if PANP. Due to its many 
practical applications a considerable number of algorithms for graph isomorphism have 
been proposed. 


2.2.1 Isomorphisms and Automorphisms 


Informally, two graphs are isomorphic if they are the same except for the names of 
their vertices and edges. Formally, this relationship is defined by means of bijections 
between them. 


Basic Terminology 


DEFINITIONS 


D1: Let G, = (Vi, 1) and G2 = (Ve, E2) be simple graphs. An isomorphism from 
G, to G is a bijection ¢ : Vi + V2 such that vw € Fy if and only if d(v)¢(w) € Eo. 


D2: A second way to define an isomorphism is that there are two bijections ¢ : V; > Vo 
and ¢ : E, — Ep», such that the incidence relation between vertices and edges is 
preserved. That is, v € Vi is incident to e € EF; if and only if ¢(v) is incident to ¢(e). 
This method is preferred if edges have additional attributes that should be preserved 
by the mapping. However, we will use the previous definition where it applies. 


D3: An isomorphism from a graph to itself is called an automorphism or symmetry. 
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D4: The set of automorphisms of a graph G form a group under the operation of 
composition, called the automorphism group Aut(G). The automorphism group of 
a simple graph is a subgroup of the symmetric group acting on the vertex set of the 
graph. 


EXAMPLE 


E1: Figure 2.2.1 shows an isomorphism between two graphs and gives the automor- 
phism group of the first graph. 


7 6 

CR ¢ 
(0 2)(4 7) 

4 3 (1 5)(3 6) 
(0 2)(4 7)(1 5)(3 6) 
(0 5)(1 2)(3 4)(6 7) 
(0 1 2 5)(3 4 6 7) 
(0 5 2 1)(3 7 6 4) 
(0 1)(2 5)(3 7)(4 6) 


Figure 2.2.1: An isomorphism between two graphs and the automorphism group of the 
first graph. 


DEFINITION 


D5: Closely related to isomorphism is the concept of canonical labeling. Arbitrarily 
choose one member of each isomorphism class of graphs, and call it the canonical 
form of that isomorphism class. Replacing a graph by the canonical form of its isomor- 
phism class is called canonical labeling or canonizing the graph. Two graphs are 
isomorphic if and only if their canonical forms are identical, as shown in Figure 2.2.2. 


7 2 1 6 5 3 1 0 
| different | | 
<_>> 
4 0 5 3 4 2 6 7 

| canonize | canonize 
3 7 5 1 3 7 4 0 
| same | | 
7" 
2 6 4 0 2 6 5 1 


Figure 2.2.2: Isomorphism between graphs becomes equality when they are canonized. 
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REMARKS 


R1: The labeled graph which gives the lexicographically greatest adjacency matrix is 
an example of an explicitly defined canonical form. In practice more complex definitions 
are used to assist efficient computation. 


R2: Canonical labeling has central importance to practical applications. One task is 
to determine whether a graph is isomorphic to any graph in a database of graphs. This 
is best achieved by storing the canonical forms of graphs in the database and comparing 
them to the canonical form of the new graph. Another task is to remove isomorphs 
from a large collection of graphs. This is best achieved by applying a sorting algorithm 
to the canonical forms of the graphs. Both tasks are very expensive if only pair-wise 
isomorphism testing is available. 


Related Isomorphism Problems 


Many types of isomorphism problem can be modeled as isomorphism between simple 
graphs or digraphs. 


FACTS 


F1: Vertex colors that must be preserved by isomorphisms can be modeled by attaching 
gadgets to the vertices, a different gadget for each color. However, this is such an 
important generalization that most software can handle vertex colors directly. 


F2: Edge colors can be modeled using layers, once vertex colors are available. Figure 
2.2.3 illustrates one approach. The edge colors are assigned numbers according to the 
table in the center. The vertices of the original graph are assigned to vertical paths, 
with the first layer identified by vertex color. Then the original edges with each color c 
are represented by horizontal edges within those layers where the binary expansion of c 
has ones. 


ON OW 
ooo Fe 
rFOoOW © 
ono oO 


1 2 3 4 


Figure 2.2.3: Modeling of graphs with colored edges. 


F3: Hypergraphs and other types of incidence structures like block designs and finite 
geometries can be represented by bipartite graphs. One color class consists of the 
vertices of the hypergraph, while the other has a vertex for each of the hyperedges of 
the hypergraph. An edge of the bipartite graph represents a vertex of the hypergraph 
and a hyperedge it lies in. 


F4: Other types of isomorphism easily modeled by graph isomorphism include equiva- 
lence of matrices defined by permutation of rows and columns, Hadamard equivalence, 
and isotopy (such as for Latin squares) [McPil2b]. 
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2.2.2 Complexity Theory 


The problem of determining whether two graphs are isomorphic, called GI or ISO, 
has received a great deal of interest from theorists due to its unsolved nature. 


FACTS 


F5: GI is not known to have a polynomial-time algorithm, nor to be NP-complete. 
While obviously in NP, its presence in co-NP is also undecided. Indeed, it is considered 
a prime candidate for the intermediate territory between P and NPC that must exist if 
PANP. One reason for this is that the NP-completeness of GI would imply the collapse 
of the polynomial-time hierarchy [GoMiWi91]. 


F6: The fastest proven running time for GI has stood for three decades at e0(V” 8”) 
[BaKaLug3]. 


F7: On the other hand, many special classes of graph are known to have polynomial- 
time isomorphism tests. The most general such classes are those defined by a for- 
bidden minor [Po88, Gr10] or by a forbidden topological minor [Gr12]. These classes 
include many earlier classes, including graphs of bounded degree [Lu82], bounded genus 
[FiMa8g0, Mi80], and bounded tree-width [Bo90]. However, very few of these polynomial 
algorithms are practical. 


DEFINITION 


D6: A decision problem is called isomorphism-complete if it is polynomial-time 
equivalent to GI. 


FACTS 


F8: All of the isomorphism problems noted in the previous subsection are isomorphism- 
complete. Many other examples are known, including isomorphism of semigroups and 
finite automata [Bo78], homeomorphism of 2-complexes [STPi94], and polytope isomor- 
phism [KaSc03]. 


F9: Isomorphism of linear codes (vector spaces over finite fields) defined by permutation 
of the coordinate positions, where the codes are presented as generator matrices, is at 
least as hard as graph isomorphism but might be harder [PeRo97]. 


F10: Isomorphism of groups given as multiplication tables is at least as easy as GI 
but might be easier [Bo78]. The best algorithm is very elementary and takes time 
nO los”) [Bo78, Mi78]). 


F11: Some problems similar to GI are NP-complete. The best known is the subgraph 
isomorphism problem: given two graphs, is the first isomorphic to a subgraph of the 
second? Another is the presence of an automorphism without fixed points, or of such 
an automorphism of order 2 [Lu81]. 


DEFINITION 
D7: A graph invariant is a property of graphs that is equal for isomorphic graphs. 


A complete graph invariant, also called a certificate, is an invariant that always 
distinguishes between non-isomorphic graphs. 
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FACTS 


F12: Examples of invariants include the degree sequence and the eigenvalue set of the 
adjacency matrix. However, neither of those invariants is complete. Nevertheless, even 
incomplete invariants can sometimes be used as a short proof of non-isomorphism. 


F13: An example of a complete invariant is a canonical form. However, it is not known 
if there is a complete invariant computable in polynomial time. In fact, it is not even 
known if there is a complete invariant checkable in polynomial time (which would place 
GI in co-NP). 


2.2.3. Algorithms 


The development of computer programs for graph isomorphism has been such a 
popular pursuit that already in 1976 it was called a “disease” [ReCo77]. Literally 
hundreds of algorithms have been published (many wrong). 


FACTS 


F14: The earliest software appeared in the 1960s. The approach which has been the 
most successful is the “individualization-refinement” paradigm introduced by Parris and 
Read [PaRe69] and further developed by Corneil and Gotlieb [CoGo70] and Arlazarov 
et al. [ArZuUsFa74]. This genre is now represented by the author’s nauty and other 
software mentioned below. 


REMARK 


R3: We will focus our attention on canonical labeling, which is the method used by 
the most useful modern algorithms. 


DEFINITIONS 


Ds: A key routine is that of partition refinement, which is any process of making 
a partition finer (i.e., breaking its cells into smaller cells) by detecting combinatorial 
differences between the vertices. For isomorphism purposes, only properties independent 
of the numbering of the vertices may be used. This implies that vertices equivalent under 
the action of an automorphism fixing the input partition cannot be separated. 


D9: An equitable partition is a partition of the vertices of a graph into cells such 
that, for any two vertices v,w in the same cell, and any cell C, we have that v and w 
are adjacent to the same number of vertices in C. 


EXAMPLE 


E2: Figure 2.2.4 shows a graph with an equitable partition of two cells. Each black 
vertex is adjacent to no black and two white vertices, while each white vertex is adjacent 
to one white and two black vertices. As this example shows, vertices in the same cell of 
an equitable partition do not need to be equivalent under the automorphism group of 
the graph. 
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Figure 2.2.4: An equitable partition with two cells. 


FACTS 


F15: The most well-known partition refinement method splits cells until the partition 
becomes equitable. Given two cells C,, C2, the vertices in C) are separated into subcells 
according to their number of neighbors in C2. This is repeated for different pairs of cells 
until no more splitting is possible. Algorithms differ according to the method used to 
choose the pairs. The fastest algorithm is in [Mc80]. 


F16: A generalization of this type of refinement, called the k-th order Weissfeiler— 
Lehman refinement, uses partitions of the set of k-tuples of vertices rather than just 
a partition of the vertices. For some classes of graphs, it is known that there is a fixed k 
for which this refinement provides the automorphism partition of the graph, which can 
be used to build a polynomial-time isomorphism algorithm. However, Cai, Ftiredi, and 
Immerman showed that no such k is sufficient for all graphs [CaFulm92]. 


Search Tree 


A partition refinement method is used to define a search tree that is used to find a 
canonical labeling and the automorphism group. 


DEFINITION 


D10: Consider a graph and an initial partition (perhaps trivial). Define a search tree 
whose nodes are refined partitions. The root of the tree is the refined initial partition. 
Any node which is a discrete partition (one with only singleton cells) is a leaf of the 
tree. Consider any node v which is not discrete. Choose a non-singleton cell C. Then 
vy has one child for each v € C, obtained by splitting C into two cells {v} and C—{v}, 
then refining. 


FACTS 


F17: The leaves of the search tree are discrete partitions, and thus, lists of the vertices 
in a definite order. The orders define a set of numberings of the vertices of the graph.The 
maximum labeled graph, according to lexicographic or other convenient ordering, is a 
canonical form. Moreover, two numberings that define the same labeled graph yield an 
automorphism, and all automorphisms can be found in that way. 
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F18: In practice the search tree may be much too large, so various means are employed 
to reduce it. One way is to employ automorphisms as they are discovered to prune 
branches of the tree thus shown to be equivalent to other branches. Another way is to 
employ invariants computed at the nodes of the tree to perform a type of branch-and- 
bound. A third method is to use a more powerful refinement procedure. 


Software 


The first program that could process structurally highly regular graphs and graphs with 
hundreds of vertices was that of the author, which became known as nauty [Mc78, Mc80] 
and dominated the field from the 1970s until recent years. Now there are several strong 
competitors. 


EXAMPLES 


E3: nauty, by this author, can find automorphisms groups and canonical forms, of 
graphs and digraphs. It comes in two forms, with either dense or sparse data struc- 
tures [McPil2b, McPi13]. 


E4: Saucy, by Darga, Liffiton, Sakallah and Markov, computes automorphism groups 
and is especially efficient for large sparse graphs having many automorphisms that move 
few vertices [DaLiSaMa04, DaSaMa04]. 


E5: Bliss, by Junttila and Kaski, can also perform canonical labeling and has very 
dependable performance for highly regular graphs [JuKa07, JuKal1]. 


E6: Traces, by Piperno, introduced an entirely new way of scanning the search tree, 
using a combination of breadth-first and depth-first search [Pi08, McPil3]. At the time 
of writing, Traces is the most efficient program for processing many classes of very 
difficult graphs, as well as being highly competitive for easy graphs [McPil3]. Since 
January 2013, Traces has been distributed with nauty [McPil2]. 


E7: Other worthy programs are conauto by Lépez-Presa, Fernandes Anta, and Nunez 
Chiroque [LPFe09, JPFel1l], VSEP by Stoichev, and VF by Cordella, Foggia, Sansone, 
and Vento. 


E8: Packages which contain high-quality graph isomorphism facilities (usually via 
nauty) include Magma, GRAPE, LINK, Sage-combinat, and Macaulay2. 


REMARKS 


R4: An experimental comparison of nauty, Traces, saucy, Bliss, and conauto can 
be found in [McPi13]. 


R5: All the named programs have exponential running time in the worst case. However, 
the worst-case graphs are rather difficult to find and most users will see only scaling 
according to a polynomial of low degree. The state of the art is that the easiest graphs 
can be handled if they fit into main memory (tens of millions of vertices). The most 
difficult graphs cause difficulty in the hundreds or thousands of vertices. 
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INTRODUCTION 


In the first volume of the Journal of Graph Theory published in 1977, the journal 
editors wrote, “The foremost currently unsolved problem in Graph Theory is, in our 
considered opinion, the Reconstruction Conjecture.” One might agree or disagree with 
this assessment, but surely one must admit that this is an inviting problem, made all 
the more tantalizing by the fact that, although thirty-five years have passed since the 
editors of the Journal of Graph Theory expressed their views, and several researchers 
have made efforts to attain, at least, some partial reconstruction results, we are still 
nowhere near any complete solution of this problem. In the first section below we 
shall give the two main variants of the Reconstruction Problem, and in the subsequent 
sections we shall expand on these definitions hoping, this way, to provide a panoramic 
view of the present state of knowledge on the Reconstruction Problem. 

In this section all graphs are assumed to be simple. 


2.3.1 Two Reconstruction Conjectures 


Some classical problems in mathematics are of the following type. If the structure S$’ 
is associated with the given structure S, does S’ determine S uniquely? In graph theory 
we ask what knowledge, short of its full incidence relations, is sufficient to determine the 
graph completely. The structure S would be the graph and S’ could be its line-graph, 
or chromatic polynomial, or spectrum, say. The best known problem of this type in 
graph theory is the Reconstruction Problem. 
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Decks and Edge-Decks 


We first need to introduce some definitions and notations connected with families of 
vertex-deleted and edge-deleted subgraphs of a graph. 


DEFINITIONS 


D1: Let G be a graph (assumed here to have no loops or multiple edges) on n vertices. 
For any vertex uv of G, let G — v denote the vertex-deleted subgraph of G that is 
obtained from G by removing v and all edges incident to v. Similarly, let e be an edge 
of G. The edge-deleted subgraph of G, denoted by G — e, is obtained from G by 
deleting the edge e. 


D2: If A is a subset of vertices of G then G — A will denote the graph obtained from 
G by deleting all vertices in A and any edge incident to at least one of them. If B is a 
subset of edges of G then G — B will denote that subgraph of G obtained by deleting 
all the edges in B. 


D3: The collection of all vertex-deleted subgraphs of G is called the deck of G, while 
the collection of all edge-deleted subgraphs of G is called the edge-deck of G. 
NOTATION: The deck of G is denoted by D(G) and the edge-deck of G is denoted by 
ED(G). 

Note that the graphs in the deck are unlabeled and if G contains isomorphic vertex- 
deleted subgraphs, then such subgraphs are repeated in DG according to the number of 
isomorphic subgraphs which G contains. The same holds for the edge-deck. Therefore 
the deck and the edge-deck are multi-sets, rather than sets, of isomorphism types of 
graphs. 


EXAMPLE 


E1: Figure 2.3.1 shows a graph and its deck. 


G: DG): ow} TS 


Figure 2.3.1: A graph and its deck. 


Reconstructibility 


Now suppose that H is another graph with D(H) = D(G). The question we are inter- 
ested in is: must H be isomorphic to G? 


DEFINITION 


D4: Any graph H with the same deck as G is called a reconstruction of G. If every 
reconstruction of G is isomorphic to G, then G is said to be reconstructible. 
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EXAMPLE 


E2: A graph that is not reconstructible is Ka, the graph consisting of just one edge, 
because if H is the graph consisting of two isolated vertices, then clearly H is a recon- 
struction of K2, but not isomorphic to Kz. The Reconstruction Conjecture asserts that 
these are the only non-reconstructible graphs. 


The Reconstruction Conjecture 


C1: [Ke57, Ul160] Every graph with at least three vertices is reconstructible. 


Another way of looking at reconstruction is by saying that a graph G is recon- 
structible if it can be uniquely (up to isomorphism) determined from D(G). Note that 
the problem stated in this way is not about finding an efficient algorithm for recon- 
structing G from D(G). In principle, given the deck, one can consider all graphs on 
n vertices to check which of them have the given deck. The question remains one of 
uniqueness, that is, whether this search will find only one graph with the given deck. 
The deck is a collection of isomorphism types with appropriate multiplicities, and the 
question is whether the isomorphism type of G can be determined uniquely from this 
collection. 


DEFINITIONS 
Closely related to the Reconstruction Problem is the Edge-Reconstruction Problem. 


D5: An edge-reconstruction of G is a graph which has the same edge-deck as G, and 
G is said to be edge-reconstructible if every edge-reconstruction of G is isomorphic 
to it. 


The Edge-Reconstruction Conjecture 


C2: [Ha64] Every graph with at least four edges is edge-reconstructible. 


EXAMPLE 


E38: The graph G = K3 UkK, is not edge-reconstructible because, if H is the graph 
Ky 3 U(k —1)4), then H is an edge-reconstruction of G that is not isomorphic to it. 
Also, G = 2K is not edge-reconstructible because if H = P3U kK, (where P3 is the path 
on three vertices), then ED(G) = ED(H) but G# H. 


The Edge-Reconstruction Conjecture asserts that these are the only graphs that are 
not edge-reconstructible. 


REMARK 


R1: Two important surveys on the Reconstruction Problem were published in 1977 and 
1978 [BoHe77, Na78]. These give the state of knowledge on this problem till that date, 
together with a complete bibliographic list. Since then, a number of survey or expository 
articles have been published [El88, Ma88, La87, Bo91]. The two early surveys and the 
most recent one should be consulted by anyone who intends to do serious work in this 
field. The book [LaSc03] also contains four chapters on the reconstruction problem. 
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Comparing Reconstruction with Edge-Reconstruction 


Intuition seems to suggest that it is easier to reconstruct a graph from its edge-deck 
than from its deck; there are generally more graphs in the edge-deck, and edge-deleted 
subgraphs are generally more nearly like the original graph than vertex-deleted sub- 
graphs. This intuitive notion is borne out by the following theorem of Greenwell which, 
essentially, says that if the Reconstruction Conjecture is true then so is the Edge- 
Reconstruction Conjecture. 


FACTS 


F1: [Gr71] [Greenwell’s Theorem] Let G be a graph without isolated vertices. The 
deck of G is edge-reconstructible, that is, D(G) is uniquely determined from ED(G). 
Therefore, if G is reconstructible, then it is also edge-reconstructible. 


Therefore, for graphs without isolated vertices, if the Reconstruction Conjecture 
is true, then so is the Edge-Reconstruction Conjecture. (We shall henceforth tacitly 
assume, unless otherwise stated, that any graph to be reconstructed has no isolated 
vertices.) So, if we can prove a result for vertex reconstruction, say that a certain class 
of graphs is reconstructible, then this result would automatically hold for edge-recon- 
struction. 

Another result which shows that the problem of edge-reconstruction is a special case 
of reconstruction is the following theorem of Hemminger. 


F2: [He69] A graph is edge-reconstructible if and only if its line graph is reconstructible 
and is not K3. 


However, the Edge-Reconstruction Problem holds its own considerable independent 
interest because several results are known in edge-reconstruction which have not yet 
been proved for vertex-reconstruction, and some elegant proof techniques have been 
developed for edge-reconstruction. We shall consider these techniques in some detail in 
a later section. 


Reconstruction and Graph Symmetries 


It is well to emphasize here that at the heart of the difficulty of reconstructing a graph G 
from its deck is the symmetry of G and of the subgraphs in its deck. As an illustration, 
let us consider an extreme case. Suppose that the vertices of G are labeled 1,...,n 
and that these labels are preserved on every vertex-deleted subgraph. Then, clearly, 
the graph G can be uniquely reconstructed by considering any three subgraphs in its 
deck and ‘superimposing’ them accordingly. This happens because the labelings have 
essentially removed all symmetries of G and its subgraphs, and therefore all ambiguities 
of how these subgraphs are embedded inside G. A later section of this chapter will bring 
out more clearly the role of the automorphism of G in the edge-reconstruction of G. 
But the situation just described does, in fact, occur quite often, and considering how 
this happens should help one understand where the difficulty of reconstruction lies. 


DEFINITION 


D6: Let k be a fixed but arbitrary integer. Then the graph G is said to have property 
A, if, whenever A and B are distinct k-sets of vertices of G, the graphs G — A and 
G — B are not isomorphic. In other words, if G has n vertices, then any two subgraphs 
of G induced by different sets of n — k vertices are not isomorphic. 
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FACTS 


F3: If G has property A,+1 then it has property A, and if it has property A, then its 
automorphism group is trivial. 


F4: [Ko71, Mii76, Bo90] For a fixed k almost every graph has property A,, meaning 
that the proportion of labeled graphs on n vertices which have this property tends to 1 
as n goes to oo. 


F5: [My88, Bo90] If a graph G has property A3 then it can be reconstructed uniquely 
from any three subgraphs in its deck. 


REMARK 


R2: Considering why the last fact holds helps to gain insight into the Reconstruction 
Problem. Let G—u,G— v,G — w be any three subgraphs from the deck of G. Since 
G also has property Az, we can identify v in G— u and u in G — v as the only vertices 
which give G—u—v ~ G—v—u. Also, by property As, there is a unique isomorphism 
from G—u—v to G—v—u. This isomorphism labels the two graphs uniquely, and we 
have the situation of a labeled graph which we described above. By comparing the two 
graphs G — u,G — v we can then clearly put the vertex u back in G — u and join it to 
its neighbors in G. The only uncertainty is whether or not u is adjacent to v. But this 
can be resolved by repeating the above with G — u and G — w instead of G — v. 


2.3.2 Some Reconstructible Parameters and Classes 


Faced with the olympian task of settling either way the Reconstruction Conjectures, 
most mathematicians have settled on attacking the conjectures partially, by showing 
that certain graph parameters or graph classes are reconstructible. We shall here present 
the most important parameters or classes which have been shown to be reconstructible, 
focusing mainly on results which have been obtained since the surveys mentioned in 
Section 1 and on those classes and parameters with which we shall not be dealing in 
the next three sections. 


Reconstructible Parameters 


DEFINITION 


D7: A graph parameter P is said to be reconstructible (or edge-reconstructible) if, 
for any graph G with the value p for the parameter, any reconstruction (or edge- 
reconstruction) of G also has value p for P. Equivalently, P is reconstructible from 
D(G) (or ED(G)) if it is uniquely determined by the deck (or edge-deck). 


FACTS 

Recall that, by Greenwell’s Theorem, for graphs without isolated vertices, a param- 
eter is edge-reconstructible if it is reconstructible. For some of the following facts it is 
easy to show that the respective parameter is reconstructible. 


F6: The number of vertices and edges are both reconstructible and edge-reconstructible. 
(See [LaSc03].) 
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F7: The degree sequence is reconstructible and edge-reconstructible. (See [Bo91, 
LaSc03].) 


F8: Given a G — v from the deck of G, the degree in G of the missing vertex and the 
degrees in G of the neighbors of v are reconstructible. (See [LaSc03].) 


F9: Given a G — e in the edge-deck of G, the degrees in G of the vertices with which 
the missing edge e is incident is edge-reconstructible. 


NOTATION 
Let H and G be two graphs. Then (a denotes the number of subgraphs of G 
isomorphic to H. 


FACTS 
Perhaps the single most useful result in reconstruction has proven to be this very 
simple lemma referred to as Kelly’s Lemma. 


F10: [Ke57] Let G and H be graphs with G having more vertices than H. Then () 
is reconstructible from D(G). Similarly, if G has at least as many vertices as H and 
strictly more edges than H, then (3) is reconstructible from ED(G). 


Reconstructible Classes 


Recall that graphs which do not have isolated vertices and are reconstructible are, by 
Greenwell’s Theorem, also edge-reconstructible. 

Also, one must keep in mind that, when we say that the class of graphs C is recon- 
structible, one is only given the deck or the edge-deck, and not the information that the 
graph to be reconstructed is in C. Reconstruction here usually proceeds in two steps, 
first determining from the deck that the graph is in C, then using this extra piece of 
information to prove that G is reconstructible. The following is a more exact definition 
of these two stages. 


DEFINITIONS 


D8: A class C of graphs is said to be recognizable or edge-recognizable if, for any 
graph G € C, any reconstruction, or edge-reconstruction, of G is also in C. Equivalently, 
C is recognizable or edge-recognizable if it can be determined from D(G) or ED(G) 
whether or not G is in C. 


D9: A graph G € C is said to be weakly reconstructible or weakly edge-reconstruct- 
ible if any reconstruction, or edge-reconstruction, of G which is also in C is isomorphic 
to G. Equivalently, G is weakly reconstructible or weakly edge-reconstructible if it can 
be determined uniquely from the deck, or edge-deck, with the extra information that G 
is inC. 


This two-step process was essential in practically all proofs of reconstructibility of 
the following classes. 


FACTS 


F11: Regular graphs are reconstructible. (See [LaSc03].) 
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F12: Disconnected graphs are reconstructible. (See [LaSc03].) 
F13: [Ke57] Trees are reconstructible. 


F14: [Bo69b] Separable graphs (that is, graphs with connectivity 1) without vertices 
of degree 1 are reconstructible. 


F15: [Zh88] The reconstruction conjecture is true if all 2-connected graphs are recon- 
structible. 


F16: [Mc77] A computer search has shown that all graphs on nine or fewer vertices 
are reconstructible. 


F17: [FiMa78, FiLa81, La81] Maximal planar graphs are reconstructible. 
F18: [Gi76] Outerplanar graphs are reconstructible. 


F19: [GoMc81] If all but at most one eigenvalue of G is simple and the corresponding 
eigenvectors are not orthogonal to the all-1’s vector, then G is reconstructible. In 
particular, if G and its complement share no eigenvalue, then G is reconstructible. 


F20: [Yu82] If there exists a subgraph G — v of G none of whose eigenvectors is 
orthogonal to the all-1’s vector, then G is reconstructible. 


F21: [Fa94] Planar graphs with minimum degree at least 3 are edge-reconstructible. 


F22: [Zh98a, Zh98b] Any graph of minimum degree 4 that triangulates a surface is 
edge-reconstructible. Any graph that triangulates a surface of characteristic at least 0 
is edge-reconstructible. A graph G that triangulates a surface } of characteristic y(X) 
is edge-reconstructible if |V(G)| > —43x(X). 


F23: [FaWuWa01] Series parallel networks (that is, 2-connected graphs without a 
subdivision of 4) are edge-reconstructible. 


F24: [Ch71] If a graph has property Ag then it is reconstructible. 
F25: [ElPyXi88] Claw-free graphs are edge-reconstructible. 


F26: [MyElHo87]| Bidegreed graphs are edge-reconstructible. 


REMARKS 


R3: A claw-free graph is one which has no induced subgraph isomorphic to K1¥3. 
This result made essential use of Nash—Williams’ Lemma, which we shall discuss below. 


R4: A bidegreed graph is a graph whose vertices can have only one of two possible 
degrees (the degrees have to be consecutive numbers, otherwise edge-reconstruction is 
trivial). The next step after this result would be the edge-reconstruction of tridegreed 
graphs (again, if the three degrees are not consecutive, then edge-reconstruction is easy). 
However, even the most elementary instance of this case, that is, degrees equal to 1, 2, 
and 3, seems to be extremely difficult to tackle [Sc85]. 
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R5: Asanext step, after the above series of results, attempting to reconstruct bipartite 
graphs seems worthwhile. However, no progress has been achieved to date either in 
reconstructing or edge-reconstructing bipartite graphs, in spite of the fact that, in Bondy 
and Hemminger’s survey, this was even then suggested as a worthwhile problem. 


R6: Several of the proofs of the above results involved long arguments very specific 
to the class of graphs under consideration, although in some cases common techniques 
began to emerge. But if these results are viewed as a step-wise attempt at solving the 
Reconstruction Problem for all graphs, then these results would simply be nibbles at a 
big mountain. What makes these results interesting, really, is the fact that use is made 
of the properties of such classes, and often new properties have to be unearthed. In 
Sections 4 and 5 we shall see sets of results which are more general and less specific to 
particular classes of graphs. 


2.3.3. Reconstructing from Less than the Full Deck 


The proofs of most of the reconstructibility results given above use much less infor- 
mation than is given by the full deck or edge-deck. This situation is best epitomized 
by the reconstruction of trees. Trees have been shown to be reconstructible by deleting 
only their endvertices (vertices of degree 1) [HaPa66], or only their peripheral vertices 
(vertices at maximum distance from the center of the tree) [Bo69a]. This immediately 
suggests, for graphs with many endvertices, reconstructibility from only the endvertex- 
deleted subgraphs. 


Endvertex-Reconstruction 
DEFINITION 
D10: The endvertez-deck of a graph G is the collection of graphs G — v for all 


vertices v with degree 1 in G. A graph G is endvertex-reconstructible if it is uniquely 
determined by its endvertex-deck. 


FACTS 


F27: [HaPa66] Trees are endvertex-reconstructible. 


One natural question which arises is therefore whether a graph with a sufficiently 
large number of endvertices is necessarily endvertex-reconstructible. A negative result 
of Bryant, however, puts paid to any such hopes. 


F28: [Br71] For any integer & there is a graph with k endvertices which is not endvertex- 
reconstructible. 


However, this is not the end of the story for endvertex-reconstructibility. Toward 
the end of Section 5 we shall present a result which indicates that it is the proportion 
of endvertices in a graph that determines its endvertex-reconstructibility. 
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Reconstruction Numbers 


Again noting that not all graphs in the deck are usually needed for reconstruction, 
Harary and Plantholt [HaP185] introduced the definition of reconstruction numbers. 


DEFINITIONS 


D11: The reconstruction number of a graph G, denoted by rn(G), is the least 
number of subgraphs in the deck of G which guarantees that G is uniquely determined. 
The edge-reconstruction number, denoted by ern(G), is analogously defined. 


D12: Let C be a class of graphs. The class reconstruction number of a graph G 
in C, denoted by Crn(G), is the minimum number of subgraphs in the deck of G which, 
together with the information that G is in C, guarantees that G is uniquely determined. 
The class edge-reconstruction number, denoted by Cern(G), is analogously defined. 


Since almost every graph has property A3, when discussing above the relationship 
between reconstruction and symmetries we have already met the first of the following 
results that answers in the positive a question raised by Harary and Plantholt in their 
paper. 


FACTS 
F29: [My88, Bo90] Almost every graph has reconstruction number equal to 3. 


The next two results imply that there is no disconnected graph with c vertices in 
each component and reconstruction number equal to c+ 1. They also raise the natural 
question of investigating the gap between 3 and c+ 1 for the reconstruction number of 
disconnected graphs. Thus, let G be a disconnected graph consisting of a number of 
copies of H with |V(H)| = c. Determine the number g = g(c) such that if rn(G) > g 
then H = K, but there is a G, with H # K-, such that rn(G) = g — 1. The last result 
above shows that g < c. Also, is there a constant go such that if G is a disconnected 
graph with rn(G) > go then G must be a union of complete graphs? 


F30: [My90] A disconnected graph with components not all isomorphic has recon- 
struction number 3. If all components are isomorphic and have c vertices each, then the 


reconstruction number can be equal to c+ 2. 


F31: [AsLa02] If the reconstruction number of a disconnected graph is at least c + 1 
then G must consist of copies of Ke. 


F32: [BaBaHo87] If C is the class of total graphs and G is in C, then Crn(G) equals 1. 
F33: [My90] The reconstruction number of trees is 3. 


Harary and Lauri [HaLa88] have conjectured that if C is the class of trees and T is 
a tree then Crn(T) is at most 2. 


F34: [HaLa87] If C is the class of maximal planar graphs and G is maximal planar then 
Crn(G) is at most 2. Those maximal planar graphs with class reconstruction number 


equal to 1 are characterized. 


F35: Almost every graph has edge-reconstruction number equal to 2. (See [LaSc03].) 
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F36: [Mo95] Let G be a disconnected graph. If G contains a pair of nontrivial, non- 
isomorphic components, then ern(G) is at most 3. If, furthermore, G is not a forest 
and contains a component other than A3 and Kj 3, then ern(G) is at most 2. If the 
components of G are all isomorphic and contain k edges, then the edge-reconstruction 
number can be as high as k + 2. 


Remarks analogous to those made above concerning the reconstruction number of 
disconnected graphs also can be made here. 


F37: [Mo93] Every tree with at least 4 edges has edge-reconstruction number at most 3. 


FURTHER REMARKS 


R7: An intriguing question which has hardly been given any attention is the rela- 
tionship between rn(G) and ern(G). While we have seen results which say that edge- 
reconstruction is implied by reconstruction, no such relationship seems to exist between 
these two parameters. In fact, the edge-reconstruction number for a graph could be 
greater than its reconstruction number. 


R8: Reconstruction numbers are also interesting from another point of view. We have 
seen that lack of symmetry favors reconstruction, but whereas this should imply that 
high symmetry makes reconstruction more difficult, highly symmetric graphs are regu- 
lar, and these are trivially reconstructible. Reconstruction numbers seem to put this in a 
better perspective because, while graphs with property A3 have reconstruction number 
3, it seems [My88] that regular graphs are the candidates for being the graphs with the 
largest reconstruction number. It should also be pointed out that the reconstruction 
number of regular graphs is not yet known. 


R9: In her thesis [My88], Myrvold calls the reconstruction number the ally recon- 
struction number and she also defines another parameter which she calls the adver- 
sary reconstruction number. If A and B are two players, the reconstruction number 
can be seen as the smallest number of graphs from the deck which A can give to B such 
that the latter can determine the graph uniquely; here A and B are allies. However, we 
can also ask for the largest number of subgraphs which A can give B such that B cannot 
determine the graph uniquely; here A and B are adversaries. The adversary reconstruc- 
tion number of a graph is equal to 1 plus this last number. In other words, a graph G 
has adversary reconstruction number k if and only if any k subgraphs from the deck of 
G determine it uniquely; equivalently, no other graph has these same k subgraphs in its 
deck. Only partial results have been obtained on the adversary reconstruction number, 
which seems even more difficult to tackle than the (ally) reconstruction number, and 
this might perhaps be an area for further interesting research. 


Set Reconstruction 


In 1964, Harary [Ha64] suggested another way of reconstructing by not using the full 
deck when he made the following conjecture. 


C3: [Set Reconstruction Conjecture] Any graph G with n > 4 vertices can be recon- 
structed uniquely from its set of nonisomorphic subgraphs G — v. 
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In other words, one is now only given one graph from each isomorphism class in the 
deck, and one does not know how many times each given graph appears in the deck. 


DEFINITION 


D13: A graph or a parameter which can be determined from the respective set of 
non-isomorphic subgraphs is said to be set reconstructible. 


FACTS 
F38: [Ma76] The number of edges and the set of degrees of a graph is set reconstructible. 


F39: [Ma76] For every graph in which no vertex of minimum degree lies on a triangle, 
the degree sequence is set reconstructible. 


F40: [Ma76] The degree sequence of any graph with minimum degree at most 3 is set 
reconstructible. 


F41: [Ma76] The connectivity of any graph is set reconstructible. 


F42: |Ma76] Disconnected graphs are set reconstructible. 


F43: Separable graphs (that is, graph of connectivity 1) without vertices of degree 1 
are set reconstructible. 


F44: [Ma70] Trees are set reconstructible. 
F45: [Gi76] Outerplanar graphs are set reconstructible. 


F46: [ArCo74] Unicyclic graphs (that is, graphs having only one cycle) are set recon- 
structible. 


REMARK 


R10: The idea of set reconstruction can also be applied to edge-reconstruction, that is, 
only one copy of each isomorphism type in the edge-deck is given. When a parameter 
or a class of graph is so reconstructible we say that it is set edge-reconstructible. We 
highlight a few results in set edge-reconstructibility. 


FACTS 


F47: [Ma76] The degree sequence of a graph is set edge-reconstructible. 


Delorme, Favaron, and Rautenbach have improved this result of Manvel. 


F48: [DeFaRa02] The degree sequence of a graph with at least four edges is uniquely 
determined by the set of degree sequences of its edge-deleted subgraphs with one well- 
described class of exceptions. Moreover, the multiset of the degree sequences of the 
edge-deleted subgraphs determines the degree sequence of the graph. 


F49: [AnDiVe96] If a graph G with at least four edges has at most two non-isomorphic 
edge-deleted subgraphs, then G' is set edge-reconstructible. 
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Reconstruction from the Characteristic Polynomial Deck 


In [Sc79], Schwenk proposed the problem of reconstructing a graph from the charac- 
teristic polynomial of each subgraph in the deck, which we shall call the polynomial 
deck. He also showed that the answer to this general problem is in the negative, that is, 
a graph is not necessarily reconstructible from the polynomial deck. But he suggested 
a weakening of the problem so that what is required is the reconstruction of the char- 
acteristic polynomial of G from its polynomial deck. This problem is still open, and we 
here limit ourselves to presenting four results from the few which have been obtained. 


FACTS 


F50: [Sc79] The characteristic polynomial of any graph is reconstructible up to a 
constant from the polynomial deck. 


F51: If a subgraph in the deck of G has a characteristic polynomial with repeated 
roots, then the characteristic polynomial of G is reconstructible from its polynomial 
deck. (See [LaSc03].) 


F52: [CvLe98] The characteristic polynomial of a tree is reconstructible from its poly- 
nomial deck. 


F53: [Sc02] If a graph of order n has at least n/3 vertices of degree 1 then its charac- 
teristic polynomial is reconstructible from its polynomial deck. 


Reconstructing from k-Vertex-Deleted Subgraphs 


A k-vertex-deleted subgraph of G is a subgraph obtained from G by deleting k of its 
vertices and all edges incident to them. We shall have more to say about this mode of 
reconstruction in Section 5 when we consider k-edge-deleted subgraphs. We here limit 
ourselves to one result. 


F54: [Ta89] Let k > 3 be an integer. Then the degree sequences of all sufficiently large 
graphs are determined by their k-vertex-deleted subgraphs. In particular, this result is 
true for all graphs on at least f(k) vertices, where f(k) is a certain function which is 
asymptotic to ke. 


2.3.4 Tutte’s and Kocay’s Results 


If Kelly’s Lemma were true for all spanning subgraphs of G (that is, subgraphs with 
the same number of vertices of G) then this would solve the reconstruction problem. 
In [Tu79], however, Tutte managed to show that Kelly’s Lemma can be extended to 
certain classes of spanning subgraphs, and this has had very important consequences. 
In [Ko81], Kocay managed to obtain Tutte’s results with proofs that were much easier. 
We shall here present a sketch of Kocay’s method. A fuller treatment is given in [Bo91] 
and [LaSc03]. 
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Kocay’s Parameter 


DEFINITION 


D14: Let G be a graph and F = (Fj, Fo,..., F,) a sequence of graphs (we do not 
exclude the possibility that different F; could be isomorphic). A cover of G by F isa 
sequence G = (G1, G2,...,G,) of subgraphs of G (not necessarily distinct) such that: 
(i) Gj ~ Fi, i=1,...,k and (ii) G = U;G;. The number of covers of G by F is denoted 
by c(F,G). 


FACTS 
[Tu79, Ko81] (See also [Bo91, LaSc03].) 


F55: Let G be a graph and let F = (Fi, Fo,..., Fx) be a sequence of graphs with each 
|V(F;| < |V(G)|. Let «(F,G) be the parameter defined by 


SFX) @ 


x 


where the summation is taken over all isomorphism types X of graphs such that 
|V(X)| =|V(G)|. Then «(F,G) is reconstructible. 


The following results are then obtained by defining a suitable choice for the Fj. 
F56: The number of 1-factors of G is reconstructible. 
F57: The number of spanning trees of G is reconstructible. 
F58: The number of Hamiltonian cycles of G is reconstructible. 


F59: The number of 2-connected spanning subgraphs of G with a specified number of 
edges is reconstructible. 


Characteristic and Chromatic Polynomials 


DEFINITIONS 


D15: An elementary graph is a graph in which every component is either an edge 
or a cycle. For any graph X, c(X) denotes the number of components of X and s(X) 
the number of cycles. 


FACTS 
The proofs of the first two of the following results, due to Sachs and Whitney, 
respectively, can be found in [Bi93]. 


F60: Let the characteristic polynomial of G be 
NO Ba aaa Bie ey, 
Then each coefficient a; is given by 


06 = D-}1209(9) 


x 


where the summation extends over all isomorphism types X of elementary graphs on 7 
vertices. 
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F61: Let the chromatic polynomial of G be 
byx t+ bor? +... + dn x” 


Then each coefficient b; is given by 


b = (21 je 


x 


where the summation extends over all isomorphism types X of graphs on n vertices and 
% components. 


From these characterizations of the characteristic and chromatic polynomials to- 
gether with the previous reconstruction results, the next important results follow. [Tu79, 
Ko81] (See also [Bo91, LaSc03].) 


F62: The characteristic polynomial is reconstructible. 


F63: The chromatic polynomial is reconstructible. 


2.3.5 Lovasz’s Method and Nash—Williams’s Lemma 


It is quite arguable that the deepest and most general results obtained in recon- 
struction are those which we shall be presenting in this section. In 1972, Lovasz [Lo72] 
published a beautiful two-page paper in which he showed that if a graph has one half 
more than the largest possible number of edges, then it is edge-reconstructible. This 
paper made a surprising and elementary use of the inclusion-exclusion principle. In 
1977, Miiller [Mii77], using the same method as Lovdsz, obtained a stronger conclu- 
sion for edge-reconstructibility. In his survey of 1978 [Na78], Nash—Williams proved a 
lemma from which Lovadsz’s and Miiller’s results follow (but still applying the method 
introduced by Lovadsz). We shall here give these results. A more extended treatment 
can be found in [E188] or [Bo91] or [LaSc03}. 


The Nash—Williams Lemma 


DEFINITIONS 
In the following, let G and H be graphs which are assumed to share the same 
vertex-set V, and let X be a subset of the edge-set of G. 


D16: A homomorphism from G to H is a permutation of V such that any edge of G 
is mapped into an edge of H. The number of such homomorphisms is denoted by [H]a. 


D17: A homomorphism with forbidden X is a permutation of V such that all 
edges in E(G) — X are mapped into edges of H but all edges in X are mapped into 
non-edges in H. The number of such homomorphisms is denoted by [H]q\ x- 


REMARK 


R11: Note that [H]q\x is quite different from [H]¢_x; the latter counts homomor- 
phisms from G—X to H, that is, where all edges in E(G) — X are mapped into edges of 
HAT, but all edges in X can be mapped either into edges or non-edges of H; this is unlike 
the case of [H]q\ x where all the edges of X must be mapped into non-edges of H. 
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FACTS 


F64: [Lo72] Let G, H, and X be as above. Then 


[Hla\x = >5 (-1)""[H]e-x+y 
YCX 


F65: Nash—-Williams’ Lemma Let G, H and X be as above, and suppose that G and 
HT have the same edge-deck. Then 


[H]q = |Aut(@)| + (-1)*|([H]ayx — [Glayx) 
(See [Bo91,LaSc03].) 


The following is an important corollary to Nash—Williams’s Lemma. (See [Bo91, 
LaSc03].) 


F66: Suppose G, H and X are as in Nash—Williams’s Lemma, and assume that G % H. 
Then, 


(i) if |X] is odd, then [H]q\x > 0; 
(ii) if |X| is even, then [G]q\x > 0. 


From this, Lovasz and Miller’s results follow. 
F67: [{Lo72] Let G be a graph such that |E(G)| > ()/2. Then G is edge-reconstructible. 
F68: [Mii77] Let G be a graph such that 2!"(@!-! > n!. Then G is edge-reconstructible. 


Perhaps the most striking result in edge-reconstruction obtained by these methods 
is the following. 


F69: [Py90] A Hamiltonian graph with a sufficiently large number of vertices is edge- 
reconstructible. 


Structures Other than Graphs 


The real power and generality of the above methods appear with the realization that 
edge-reconstruction can be generalized to the reconstruction, up to some group of iso- 
morphisms, of a combinatorial object or structure from its subobjects, again given up 
to isomorphism. 


DEFINITION 


D18: More exactly, define a structure to be a triple (D,T, FE) where D is a finite set, T 
is a group of permutations acting on D, and E is a subset of D. By edge-reconstruction 
we mean here that the subsets & — x are given, up to ‘translation’ by the group I, 
and the question is whether FE can be reconstructed uniquely, again up to action by the 
group T. 


Therefore in edge-reconstruction for graphs, D would be the set of all possible a) 
edges on n vertices, & would be the edges which define the graph to be reconstructed, 
and I’ would be the full symmetric group with its induced action on the unordered, 
distinct pairs of vertices. 
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An extended treatment of edge-reconstruction seen in this light can be found in 
[Bo91, LaSc03}. 


FACTS 

We shall here present some results obtained by viewing edge-reconstruction in this 
more general setting. The first result, although a straightforward application of Nash— 
Williams’ Lemma to structures, gives some life to the problem of reconstructing from 
the endvertex-deck. 


F70: [LaSc03] Let H be a graph with minimum degree 2, and let G be obtained from 
HT by adding k endvertices such that no two have a common neighbor. Then G is 
endvertex-reconstructible if either k > V|H|/2 or 2*-! > Aut(H). 


REMARK 


R12: This result in conjunction with Bryant’s negative result leads to the natural 
question asking what is the minimum proportion of endvertices required to guarantee 
endvertex-reconstructibility. 


DEFINITION 


D19: The following generalizes Miiller’s result not only to structures, but also in this 
fashion: instead of removing one edge at a time, k edges at a time are removed. Let us 
call this k-edge-reconstruction. 


FACTS 


F71: [AlCaKrRo89] Let (D,I', £) be a structure such that 2'7!-* > |T|. Then the 
structure is k-edge-reconstructible. 


The paper [AlCaKrRo89] contains several other results of this type and should be 
studied carefully by anyone who is interested in extending the reconstruction of struc- 
tures in the direction of k-edge-reconstruction. Many of the results in this paper have 
been extended more recently by Radcliffe and Scott. They consider the reconstruction 
of a subset of the integers modulo n, Z,, or of the reals, R, up to translation from 
the collection of its subsets of a given size, also given up to translation. The following 
summarizes their important results. 


F72: [RaSc98] Suppose p is prime. The every subset of Z, is reconstructible from the 
collection of its 3-subsets. 


F73: [RaSc98] For arbitrary n, almost all subsets of Z,, are reconstructible from the 
collections of their 3-subsets. 


F74: [RaSc98] For any n, every subset of Z,, is reconstructible from its 9a(n)-subsets, 
where a(n) is the number of distinct prime factors of n. 


F75: [RaSc99] A locally finite subset of R (that is, a subset which contains only 
finitely many translates of any given finite set of size at least 2) is reconstructible from 
its 3-subsets. 
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Another recent result of this type is the following. Here subsets of R? are considered, 
and any two such subsets are considered isomorphic if one can be transformed into the 
other by a translation or a rotation by a multiple of 90 degrees. 


F76: [Ra02] Any finite subset A of the plane R? is uniquely determined by at most 5 
of its subsets of cardinality |A|—1, given up to isomorphism; that is, in the terminology 
of graph reconstruction, A has reconstruction number 5. 


The Reconstruction Index of Groups 


Looking at edge reconstruction in this guise, that is, as reconstruction of structures 
(D,T,F), has led some authors to focus attention more directly on the permutation 
group [. 


DEFINITION 


D20: The reconstruction index p(T, D) of the permutation group I acting on D 
is the smallest t such that for any E C D with |E| > t, the structure (D,T, E) is 
edge-reconstructible. 


The Edge-Reconstruction Conjecture therefore states that, if Y = {1,2,...,n} and 
D is the set of unordered, distinct pairs of Y, and if Ss?) is the symmetric group of Y 


acting on these pairs, then (82), D)=4. 


FACT 
The following is one result obtained on the reconstruction index of permutation 
groups. 


F77: [Mn98] The reconstruction index of an abelian group is 4 and the reconstruction 
index of Hamiltonian groups is 5. 


Other works which deal with the reconstruction index of groups are [Ca96, Ma96, 
Mn87, Mn92, Mn95] 


2.3.6 Digraphs 


The situation with the reconstruction of digraphs is quite different from that of 
graphs, for here it has been shown that the conjecture is false. 


FACTS 


F78: [St77, Ko85] There exists an infinite family of tournaments which are not recon- 
structible. 


A positive partial result has been obtained by Harary and Palmer. 


F79: [HaPa67| Tournaments on at least five vertices which are not strongly connected 
are reconstructible. (See also [BoHe77].) 


Some more positive results on the reconstructibility of tournaments can be found in 
[DeGu90, Gu96, Vi99]. 
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Therefore the problem here should be to investigate which digraphs are recon- 
structible, or to determine what reconstruction question one should ask for digraphs. In 
fact, Ramachandran has noted that all non-reconstructible digraphs which have been 
discovered to date have the property that they would be reconstructible if, with every 
D —v, one is also given the in-degree and the out-degree in D of the missing vertex v. 


DEFINITION 


D21: [Ra97] A digraph D is said to be N-reconstructible if it is uniquely determined 
by the triples (D — u;, deg,,, (vi), degout(vi)), for all vertices v; of D. 


And Ramachandran goes on to make the following conjecture. 
C4: [The N-Reconstruction Conjecture for Digraphs] Every digraph is N-reconstructible. 


No counterexamples to this conjecture are known. 


2.3.7 Illegitimate Decks 


As we have already said, the reconstruction problem is not about finding an efficient 
algorithm to determine G from its deck, but the question is one of uniqueness: is there 
only one graph with the given deck? 

However, there is one problem in graph reconstruction which falls naturally within 
the setting of computational complexity. 


DEFINITION 


D22: A collection of graphs G,,G2,...,G, each on n — 1 vertices is said to be an 
illegitimate deck if there is no graph G having the given collection as deck. The 
illegitimate deck problem is to determine whether or not such a given collection of 
graphs is indeed the deck of some graph. 


FACTS 


F80: [{Ma82] Determining whether a given collection of graphs is an illegitimate deck 
is at least as hard as the isomorphism problem. 


F81: {HaP1St82] The graph isomorphism problem is polynomially equivalent to the 
illegitimate deck problem for regular graphs. 


More information about the relationship between the computational complexities 
of the legitimate deck problem and the graph isomorphism problem can be found in 
[K6ScT093, KrHe94]. 
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2.3.8 Recent Results 


We briefly outline the most important results in reconstruction obtained since the 
publication of the Handbook of Graph Theory, First Edition. Notably, Asciak, Fran- 
calanza, Lauri and Myrvold [AsFrLaMy10] have given a survey of open questions re- 
garding reconstruction numbers. At about the same time, Bowler, Brown, and Fenner 
have commenced a systematic attack on the adversary reconstruction number. 


FACTS 


F82: [BikKwYu07] The class of planar graphs is recognizable. 


F83: [BoBrFel0] There exist pairs of graphs on n vertices with 2|4(n — 1)| common 
cards for every n > 10. 


F84: There exist pairs of trees on n vertices with 2|3(n —5)| cards in common for 
n> 8. 


F85: [BoBrFeMyl11] The maximum number of cards in common between a connected 
and a disconnected graph on n vertices is |} |+1. Graphs that attain this upper bound 
are characterized. 


F86: ([AsLal0] If G is a connected graph all of whose components are isomorphic 
and contain k edges, and if G has edge-reconstruction number at least k +1, then the 
components of G' are isomorphic to the star Ky,,. 


CONJECTURES 


C5: [BoBrFel0] The largest possible number of cards that two graphs on n vertices 
can have in common is 2|$(n —1)|. Affirming this would imply that any such graph 
can be reconstructed from any 2|4(n — 1)| +1 of its cards. 


C6: [BaWel0] They also conjecture that drn(T) < 2 for all but finitely many trees T. 


C7: [AsLal0] For a disconnected graph all of whose components are isomorphic to H, 
suppose that ern(G) > 3. Then H is isomorphic to the star K1,,. 


REMARK 


R13: The reconstruction number and the edge-reconstruction number of all graphs 
with n < 11 vertices are given by [RiRal1], along with computational results on recon- 
struction numbers associated with the removal of more than one vertex at a time. 


DEFINITION 


D23: The degree associated reconstruction number drn(G) of a graph G is the 
minimum number of cards required to reconstruct G if, along with any card G — v, one 
is also given the degree of the missing vertex v. Note that drn(G) is equivalent to the 
class reconstruction number of graphs with a given value of m as the number of edges. 


FACTS 


F87: [Ra06] If G is a connected non-regular graph and kG is the disconnected graph 
made up of & copies of G then drn(kG) < 1+darn(G). If G is r-regular of order n > 2 
then drn(kG) <n+2-—r. 
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F88: [BaWel0] The degree associated reconstruction number equals 2 for all cater- 
pillars, except stars and one 6-vertex example. 
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INTRODUCTION 


The core idea of recursively constructed graphs is captured in Definition 1, but the 
substantial literature on the subject has motivated a considerable breadth and variety 
of notational distinctions. 


NOTATION: All graphs in this section are simple, and an edge with endpoints x and y is 
denoted (a, y). 


DEFINITIONS 


D1: A recursively constructed graph class is defined by a set (usually finite) of 
primitive or base graphs, in addition to one or more operations that compose larger 
graphs from smaller subgraphs. Each operation involves either fusing specific vertices 
from each subgraph or adding new edges between specific vertices from each subgraph. 


D2: Each graph in a recursive class has a corresponding decomposition tree that 
shows how to build it from base graphs. 


REMARK 


R1: Graphs in these classes possess a modular structure, so fast algorithms can often 
be designed to solve hard problems restricted to these classes. The algorithms typically 
proceed by solving the desired problem on the base graphs, then employ dynamic pro- 
gramming to combine solutions for small subgraphs into a solution for a larger graph. 
The construction of these algorithms is the subject of Section 10.4. 


102 Chapter 2. Graph Representation 
2.4.1 Some Parameterized Families of Graph Classes 


Trees 


DEFINITION 


D3: The graph with a single vertex r (and no edges) is a tree with root r (the sole 
base graph). Let (G,r) denote a tree with root r. Then (Gi,11) ® (G2,r2) is a tree 
formed by taking the disjoint union of G; and G2 and adding an edge (r1,r2). The root 
of this new tree is r =r}. 


TERMINOLOGY NOTE: Technically, the pairs (Gr) in Definition D3 denote rooted trees. 


However, the specification of distinguished vertices r; and rz (and hence r) is relevant 
here only as a vehicle in the recursive construction. 


EXAMPLE 


E1: Figure 2.4.1 illustrates the recursive construction of trees. 


Figure 2.4.1: Recursive construction of a tree. 


Series-Parallel Graphs 


From a non-recursive perspective, a graph is series-parallel if it has no subgraph home- 
omorphic to K4 [Du65]. The graph on the left of Figure 2.4.2 is not series-parallel; 
the offending subgraph is identified by bold edges. Removal of two edges, as indicated, 
yields the graph to the right which is series-parallel. 


Figure 2.4.2: Non-series-parallel and series-parallel graphs. 


Following, we give a recursive definition of this class. 
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DEFINITION 


D4: A series-parallel graph with distinguished terminals | and r is denoted (G,1,r) 
and is defined recursively as follows: 
e The graph consisting of a single edge (v1, v2) is a series-parallel graph (G,1,r) 
with 1 = v, and r = vo. 
e The series operation (Gi,11,71) ©s (Go, l2,r2) forms a series-parallel graph by 
identifying r, with Jj. The terminals of the new graph are J, and ro. 
e The parallel operation (G1, 1),11) ©p (G2, 12,72) forms a series-parallel graph by 
identifying 1; with lz and r; with rg. The terminals of the new graph are J, and 
T1- 
e The jackknife operation (G1,l1,171) ©; (Go, l2,r2) forms a series-parallel graph 
by identifying r, with l2; the new terminals are /; and rj. 


COMPUTATIONAL NOTE: The jackknife operation can also be specified where the new 
terminals, after composition, are defined to be 1; and lo. 


EXAMPLE 


E2: The three operations defining series-parallel graphs are demonstrated in Figure 
2.4.3. The pair-specific composition is on the left; the result is shown to the right. 
Terminal vertices are circled and labeled. 


fy r1 lo oro 
+) ,@—® 
fy [5 
“Jp 
PY Py 
fy ry lg 9g 
),e—_® 


Figure 2.4.3: Composition operations for series-parallel graphs. 


k-Trees and Partial k-Trees 


DEFINITIONS 


D5: The k-vertex complete graph, K,, isa k-tree. A k-tree with n+1 vertices (n > k) 
is constructed from a k-tree on n vertices by adding a vertex adjacent to all vertices of 
one of its Ky, subgraphs, and only to those vertices. 


D6: A partial k-tree is a subgraph of a k-tree. 


TERMINOLOGY NOTE: In a given construction of a k-tree, the original K;, subgraph is 
referred to as its basis. 


104 Chapter 2. Graph Representation 


D7: A graph is chordal (or triangulated) if it contains no induced cycles of length 
greater than 3. 


Ds: A graph is perfect if every induced subgraph has chromatic number equal to the 
size of its maximum clique. 

FACTS 

F1: ‘Trees are 1-trees, and forests are partial 1-trees. 

F2: Series-parallel graphs are partial 2-trees. 

F3: Any Kk; subgraph of a k-tree can act as its basis. 

F4: All k-trees are chordal graphs and, hence, perfect (because every chordal graph is 
perfect). 

EXAMPLES 


E3: A 3-tree is shown on the left in Figure 2.4.4, and a partial 3-tree is shown to the 
right. 


Figure 2.4.4: Construction of a 3-tree and a partial 3-tree. 


Demonstrated by the graph to the left in Figure 2.4.4 is the “creation” of a 3-tree 
following a small number of composition operations starting from the basis given by an 
initial K3 identified by vertex labels of 0. At each step, a new (consecutively labeled) 
vertex is added. Observe that if edge e is eliminated from the graph on the right in 
Figure 2.4.4, a partial 2-tree is created. 


E4: The graph on the left in Figure 2.4.5 is series-parallel; it is a subgraph (and hence 
a partial 2-tree) of the 2-tree on the right. The dotted edges complete the 2-tree where 
the construction is verified by the labels on the vertices that are interpreted similarly 
as for Figure 2.4.4. 


Figure 2.4.5: A series-parallel graph and a 2-tree. 
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Halin Graphs 


DEFINITION 

D9: A Halin graph is a planar graph having the property that its edge set FE can be 
partitioned as E = (T,C), where T is a tree with no vertex of degree 2 and C is a cycle 
including only and all leaves of T. 

FACTS 

F5: Halin graphs are contained in the class of partial 3-trees. 

F6: The set of Halin graphs is not closed under the taking of subgraphs, i.e., some 
subgraphs of Halin graphs are not Halin graphs. 

EXAMPLES 


E5: A Halin graph is given in Figure 2.4.6, with the cycle edges drawn on the outer 
face; their removal yields a tree satisfying the stated degree stipulation. 


Figure 2.4.6: A Halin graph. 


E6: The graph G on the left in Figure 2.4.7 is a 3-tree; vertex labels guide the con- 
struction as before. But this graph G' is not a Halin graph. However, by removing one 
edge, we obtain the subgraph G’ on the right, which is both a partial 3-tree and a Halin 
graph. The edges shown in bold form a tree of the Halin graph, and the cycle edges can 
be easily traced through the leaves of this tree. 


0 0 
Figure 2.4.7: Non-Halin graph and Halin graph. 
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Bandwidth-k Graphs 


DEFINITION 


D10: A graph G(V,E) is a bandwidth-k graph if there exists a vertex labeling 
h:V — {1,2,...|V|} such that {u,v} € EB => |h(u) — h(v)| < &. (Bandwidth is 
discussed in §9.4.) 


EXAMPLE 


E7: A bandwidth-3 graph is shown to the left in Figure 2.4.8; displayed to the right is 
a bandwidth-2 graph. 


2 4 
Figure 2.4.8: Bandwidth-3 and bandwidth-2 graphs. 


Treewidth-k Graphs 


The seminal works by Robertson and Seymour (cf. [RoSe86-a], [RoSe86-b], [RoSe91-a]) 
are commonly identified as being responsible for motivating the creation of the graph 
classes in this section. Most notable is the concept of treewidth, which played a key role 
in the authors’ work on graph minors culminating, ultimately, in the proof of Wagner’s 
conjecture, a topic addressed in §2.4.2. 


DEFINITIONS 


D11: A tree-decomposition of a graph G = (V, E) is a pair ({X; | i € I},T), where 
{X; | 7 © I} is a family of subsets of V, and T is a tree with vertex set I such that: 

° Vier Xi as 

e for all edges (x,y) € E there is an element 7 € J with z,y € X;, and 

e for all triples i,j,k € I, if j is on the path from 7 to k in T, then X; (| Xp C Xj. 


D12: The width of a given tree-decomposition is measured as max je7{|X;| — 1}. 


D13: The treewidth of a graph G is the minimum width taken over all tree- 
decompositions of G. 


D114: A graph G is a treewidth-k graph if it has treewidth no greater than k. 
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REMARK 


R2: Trivially, every graph, G, has a tree-decomposition that is defined by a single vertex 
(representing G itself). On the other hand, we are interested in tree-decompositions and, 
hence, their graphs, in which the X; are small (i.e., graphs with small treewidth). 


EXAMPLE 


E8: A sample tree-decomposition is shown in Figure 2.4.9. For the stated graph, G, 
one family of suitable vertex sets can be given by: X1 = {v1, v2, u3}, Xo = {v2, v7, us}, 
X3 = {vo,v3,07},X4 = {v3,05, 07}, X5 = {v3, v4, U5}, and X_ = {vs,ve,v7}. An ap- 
propriate tree T’ is shown next and then on the right side of Figure 2.4.9, the relevant 
subgraphs of G induced by the stated pair ({X;},7) are displayed. Moreover, the graph 
G has treewidth 2; in fact, the graph is series-parallel. 


Us 
cal v2 
u 
8 ve 
xX) X2 
UT 
v3, v2 aa 
bi i X3 
1 2 vy 
UT U3 
| ge 
v 
U3 3 
X4 


UB XxX 5 Xx 6 U5 \ 
U4 U5 
i lg " 


Figure 2.4.9: A sample tree-decomposition. 


Pathwidth-k Graphs 


DEFINITIONS 


D15: A path-decomposition is a tree-decomposition whose tree is a path. 


NOTATION: A path-decomposition is often denoted simply by a sequence of vertex sub- 
sets of V, say {X1, Xo,..., Xz}, listed in order defined by their position on the path. 


D16: The width of a path-decomposition is max 1<;<:{|Xi| — 1}. 


D17: The pathwidth of a graph G is the smallest width taken over all path-decompositions 
of G. 


D18: A pathwidth-k graph is a graph that has pathwidth no greater than k. 
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EXAMPLE 


E9: A sample path-decomposition is shown in Figure 2.4.10. The vertex-sets X; and 
the first edge occurrences are displayed below the corresponding vertices of T. 


X, Xp X3 Xq X5 Xe X7 Ag XQ Ayo Ary Ap 
o_o —_e_eo_0_0__e__0_0__0—_-® 


Can ve] Ul 4] U4 Ug US Ug U3 UF U4 UR US Lg Ug u vg 


2 v9 
® ! e oes ios igs Lo] les lee S . S 
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fea elo a3 e'’4 e'%5 eS @l7 UT® 


Figure 2.4.10: A path-decomposition. 


Branchwidth-k Graphs 


DEFINITIONS 


D19: A branch-decomposition of a graph G = (V,£) is a pair (T, f), where T is 
a tree in which every non-leaf vertex has exactly three neighbors and f is a bijection 
from the leaves of T to E. 


D20: If the degree of every non-leaf vertex in T is at least 3, the pair (T, f) is called a 
partial branch-decomposition. 


D21: Let (T, f) be a branch decomposition of a graph G = (V, F). The order of an 
edge e of T is the number of vertices v in V such that there exist leaves J; and lg of T 
residing in different components of T — e, where f(i,) and f(lz) are both incident on v. 


D22: The width of a branch decomposition (T, f) is the maximum order of the edges 
of T. 


D23: The branchwidth of G is the minimum width taken over all branch-decomposi- 
tions of G. 


D24: A graph G is a branchwidth-k graph if it has branchwidth no greater than k. 


FACTS 


F7: [RoSe91-a] A graph G is branchwidth-0 if and only if every component of G has 
at most one edge. 


F8: [RoSe91-a] A graph G is branchwidth-1 if and only if every component of G has 
no more than one vertex with degree greater than or equal to 2. 


F9: [RoSe91-a] A graph G is branchwidth-2 if and only if G has treewidth no greater 
than 2. 
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EXAMPLE 


E10: A branchwidth-2 graph is shown on the left in Figure 2.4.11 (edges are numbered); 
its branch-decomposition is given to the right. 


Figure 2.4.11: Branchwidth-2 graph and its branch-decomposition. 


k-Terminal Graphs 


DEFINITIONS 


D25: A k-terminal graph G = (V,T, E) has a vertex set V, an edge set F, and a set 
of distinguished terminals T = {t1,t2,...,tyrj} GC V, where |T| < k. 


D26: A k-terminal recursively structured class C(B, R) is specified by a set B 
of base graphs and a finite rule set R = {f1, fo,..., fn}, where each f; is a recursive 
composition operation. 


EXAMPLE 


E11: A construction for a 2-terminal graph is shown in Figure 2.4.12. Vertices are 
labeled in order to clarify how constituent subgraphs compose; terminals are denoted 
by doubly circled vertices. 


Figure 2.4.12: Recursive construction of a 2-terminal graph. 
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REMARKS 


R3: Typically, for some k, B is the set of connected k-terminal graphs (V,T, F) with 
V =T. But each such base graph is trivially composed of individual edges, so it is 
reasonable and hence convenient to simply use C(R) to denote C(B, R), where B is a 
singleton consisting only of edges (i.e., 2). 


R4: The notion of composition typically permitted in the context of k-terminal 
graphs can be described in a more formal way. For 1 < i < m, let G; = (Vi,T;, Ei), 
such that Vi,...,Vin are mutually disjoint vertex sets. Let G = (V,T, E) as well. Then 
a valid vertex mapping is a function f : Uj<i<mV; — V such that: 


e Vertices from the same G; remain distinct: 


v1 € Vi, v2 € Vi, f(v1) = f(v2) > v1 = v2 


e Only (not necessarily all) terminals map to terminals: 


veVi fweTsveT; 


e Only terminals can merge: 


v1 € Vi, , V2 € Vig t1 F ta, f(vr) = f(v2) > v1 € Ty, ve © Ti, 


e Edges are preserved: 


(si) ({u1, va} € Ei) = {f(v1), f(va)} € EB 
NOTATION: If f is a valid vertex mapping, then the corresponding m-ary composition 


operation (denoted by f) is generally written f(G1,...,Gm) = G. 


Cographs 


DEFINITION 


D27: A cograph is defined recursively as follows: 
e A graph with a single vertex is a cograph. 
e If G; and Gp are cographs, then the disjoint union G'; U G2 is a cograph. 


e If G, and G2 are cographs, then the cross-product G, x G2 is a cograph, which is 
formed by taking the union of G and G2 and adding all edges (v1, v2) where v1 
is in G, and v2 is in Go. 


TERMINOLOGY NOTE: Cographs are also referred to as complement reducible graphs. 
FACTS 


F10: [CoLeBu81] The complement of any cograph is also a cograph. 


F11: [CoLeBu81] All cographs are perfect. 


Section 2.4. Recursively Constructed Graphs 111 


EXAMPLE 


E12: A cograph construction is demonstrated in Figure 2.4.13. The relevant operations 
are signified at each node of the decomposition tree (left) for the graph G shown on the 
right. 


a d 
b e 
f 

G 


Figure 2.4.13: Cograph construction. 


Cliquewidth-k Graphs 


The graph parameter cliquewidth was introduced in [CoEnRo93] and formed a seminal 
concept in linking research in graph theory and logic. 


DEFINITION 


D28: Let [k] denote the set of integers {1,2,...,k}. A cliquewidth-k graph is defined 
recursively as follows: 


e Any graph G with V(G) = {v} and I(v) € [k] is a cliquewidth-k graph. 
e If G; and G are cliquewidth-k graphs and i, 7 € [k] with i 4 7, then: 


— The disjoint union G, U G2 is a cliquewidth-k graph. 


— The graph (G1);xj is a cliquewidth-k graph, where (G1); is formed from 
G, by adding all edges (v1, v2) such that [(v,) = 7% and I(v2) = 7. 


— The graph (G);-,; is a cliquewidth-k graph, where (G'1),;_,; is formed from 
G1 by switching all vertices with label 7 to label 7. 


REMARK 


R5: Definition D28 defines the class of cliquewidth-k graphs. The cliquewidth of a 
graph G is the smallest value of k such that G is a cliquewidth-k graph. A cliquewidth 
decomposition for a graph is a rooted tree such that the root corresponds to G, each 
leaf corresponds to a labeled, one-vertex graph, and each non-leaf node of the tree is 
obtained by applying one of the operations U, 7 x 7, or i > 7 to its child or children. 


TERMINOLOGY NOTE: In this section, the term clique refers to any complete subgraph 
of the graph. In some other sections of this handbook, clique is defined to be a maximal 
subset of pairwise adjacent vertices of the graph. 
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TERMINOLOGY NOTE: Every tree is a treewidth-1 graph, so treewidth is a measure of 
how much a graph varies from a tree. Similarly, every clique is a cliquewidth-2 graph, 
so cliquewidth is a measure of how much a graph varies from a clique. This analogy 
forms the basis for coining the term cliquewidth (cf. [CoO100]). 


EXAMPLE 


E13: A cliquewidth-3 construction is given in Figure 2.4.14. As in Example E12, the 
relevant operations are identified at each node of the decomposition tree (left) for the 
graph G shown on the right. 


o 


G 


Figure 2.4.14: A cliquewidth-3 graph construction. 


k-NLC Graphs 


DEFINITION 


D29: Let [k] denote the set of integers {1,2,...,4} and let B denote a bipartite 
graph on [k] x [k]. A k-NEIC (node-label-controlled) graph is defined recursively 
as follows: 


e Any graph G with V(G) = {v} and I(v) € [k] is a K-NLC graph. 
e If G; and Gg are k-NLC graphs and i, 7 € [k], then: 


— The join G, x p G2 is a k-NLC graph, where G, x pg G2 is formed from G1UG2 
by adding all edges (v1, v2) where v1 € Vi, (v1) = 7, ve € Va, I(ve) = j, and 
(i,j) is an edge in Eg. 

— The graph (G1);-,; is a k-NLC graph, which is formed from G, by switching 
all vertices with label 7 to label 7. 
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EXAMPLE 


E14: The same graph G previously shown in Figure 2.4.14 is a 2-NLC graph. In Figure 
2.4.15, its decomposition tree has leaves corresponding to the vertices a, b, c, d, e, f, 
and g with starting labels drawn from the set & = {1,2} as shown. Relevant operations 
are identified with the internal nodes of the tree, i.e., (i + 7) for label switching and 
(i, 7) indicating the specific edge from Eg inducing the stated composition. 


(1,1) 


a(1) b(2) e(1) (2) 


Figure 2.4.15: A 2-NLC graph construction. 


k-HB Graphs 


The homogeneous balanced graphs produce a modular decomposition upon application 
of a certain decomposition algorithm. 


DEFINITION 


D30: k-HB (homogeneous balanced) graphs are graphs for which there is a 
particular O(n**?)-time top-down decomposition algorithm that constructs a pseudo- 
cliquewidth-(k + 2") balanced decomposition. 


REMARKS 


R6: Top-down decomposition refers to a recognition algorithm that places a candidate 
graph at the root of a tree and then decomposes this graph into smaller subgraphs that 
become its children in the tree, and so on, recursively, until reaching the leaves of the 
tree. 


R7: A pseudo-cliquewidth decomposition is similar to a k-NLC decomposition, except 
that the vertex labels used at one node in the tree are not enforced at other nodes. 


R8: A balanced decomposition of an n-vertex graph is a decomposition tree that has 
height O(log n). 


R9: The requirement that the decomposition must be balanced is more restrictive, 
while simultaneously the pseudo-cliquewidth condition is less restrictive. This trade- 
off yields the class of k-HB graphs. For more details on these matters, see [Jo03}, 
[BoJoRaSp04]. 


114 Chapter 2. Graph Representation 


R10: However, k-HB graphs are an ambiguously defined class due to the nondeter- 
ministic nature of this decomposition algorithm. On the other hand, the decomposition 
is guaranteed to succeed for every cliquewidth-k graph despite this nondeterminism, so 
every cliquewidth-& graph is a k-HB graph. 


2.4.2 Equivalences and Characterizations 


Relationships between Recursive Classes 


A number of equivalences serve to relate many of the recursive graph classes defined 
in the previous subsection. Several of these are listed below. Unless a specific source 
is cited, a good general and fairly comprehensive reference for Facts F12 through F19 
(and others) is [BrLeSp99]. 


FACTS 
F12: A graph has treewidth at most & if and only if it is a partial k-tree. 
F13: Every bandwidth-k graph is a pathwidth-k and thus a treewidth-k graph. 


F14: The class of partial k-trees can be defined as a (k + 1)-terminal recursive graph 
class (cf. [WiHe88], [Wi87]). 


F15: 1-trees are trees in the usual sense and have treewidth 1. 
F16: ‘Trees are series-parallel graphs where only the jackknife operation is used. 


F17: Series-parallel graphs in which only the series and parallel operations are used 
are precisely the 2-terminal series-parallel graphs. 


F18: Series-parallel and outerplanar graphs are partial 2-trees and have treewidth 2. 


F19: Halin graphs are contained in the class of partial 3-trees; they are also defined 
as a class of 3-terminal graphs by an appropriate choice of composition operations. 


F20: [CoEnRo93] Cographs are precisely the cliquewidth-2 graphs. 

F21: [CoRo05] Every treewidth-k graph has cliquewidth at most 3-2*7!. 

F22: [RoSe91-a] Every graph of branchwidth at most k has treewidth at most 3k/2. 
F23: [RoSe91-a] Every graph of treewidth at most k has branchwidth at most k + 1. 
F24: [Wa94] Cographs are exactly the 1-NLC graphs. 

F25: [Wa94] Every treewidth-k graph has NLC width at most 2*+! — 1. 

F26: [Jo98] Every cliquewidth-k graph is a k-NLC graph. 


F27: [Jo98] Every k-NLC graph is a cliquewidth-2k graph. 


F28: [Jo03], [BoJoRaSp04] Every cliquewidth-k graph is a k-HB graph. 
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Characterizations 


Structural characterizations of recursive graph classes are generally stated in terms of 
forbidden subgraph minors. 


DEFINITIONS 


D31: An edge-extraction operation on a graph G = (V, F) removes an edge e leaving 
a graph, G—e, with V(G-—e) = V and E(G—e) = E — {e}. 


D32: The operation of edge-contraction produces a graph with edge-set E — {e} but 
with a vertex-set obtained by replacing (“merging”) the vertices defining e in G, thus 
creating a new single vertex where the latter inherits all of the adjacencies of the pair 
of replaced vertices, without introducing loops or multiple edges. 


D33: A graph H is a minor of a graph G if and only if it can be obtained from G by 
a finite sequence of edge-extraction and edge-contraction operations. 


REMARKS 


R11: A result apparently first conjectured (but unpublished) by K. Wagner asserts 
the following: Suppose ¥ is a graph class with the property that if G is in F and H 
is contained as a minor in G, then H is in fF, i.e., the class F is closed under minors. 
Then there exists a finite set {H1, H2,...,H,} of graphs, the forbidden minors such 
that G is in F if and only if it contains no minor isomorphic to any member H; for 
1<i<k. 


R12: Robertson and Seymour ([RoSe88-b]) confirmed Wagner’s conjecture and with 
their proof established that any graph class F closed under minors can be recognized 
in polynomial time. Unfortunately, this outcome, although deep, is an existential one; 


we do not know the number of forbidden minors or their sizes in an arbitrary case. 


R13: The class of partial k-trees is closed under minors and thus, by the Robertson— 
Seymour results is completely characterized by a finite set of forbidden minors. 


R14: The forbidden minors for partial 3-trees are known (see Fact F33 below), but 
complete lists of explicit minors for partial k-trees are not known for values of k > 4. 


FACTS 
F29: [CoLeBu81] Cographs have no induced paths P,. 
F30: Trees are graphs having no K3 minor. 


F31: The class of partial 2-trees is characterized by a single forbidden minor: the 
complete graph Ky. 


F32: The forbidden minors of outerplanar graphs are K, and K2.3. 
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F33: The class of partial 3-trees has four forbidden minors: Ks and the three graphs 
shown in Figure 2.4.16. 


Figure 2.4.16: Forbidden minors of partial 3-trees. 


2.4.3. Recognition 


In order to solve graph problems on recursive classes and particularly, to do so 
efficiently, it is necessary that membership in the classes be quickly recognized. 


REMARKS 


R15: Some recognition algorithms are direct and essentially ad hoc. For example, 
Halin graphs can be recognized by first testing for 3-connectivity and for planarity. Then 
simply embed the graph in the plane, select a largest cycle of edges that corresponds to 
a face on the plane embedding, remove these edges and test if the remaining graph is a 
tree of the stated form (see [CoNaPu83}). 


R16: Partial 2-trees or series-parallel graphs are recognizable, unambiguously, by 
successive application of the following reduction operations (cf. [Du65]): replace any 
vertex of degree 2, say v;, and its incident edges (v;,v;) and (v;,ug) by a new edge 
(v;, Uz); replace any pair of multiple edges by a single edge; and eliminate any edges 
incident to a vertex of degree 1 unless only one edge remains. Then a single edge 
remains, upon an admissible application of these reduction operations, if and only if the 
original graph is a partial 2-tree; otherwise, the process will stop with either Ky, or a 
graph with a K4 minor. 


R17: Similar reduction operations have also been described in the case of partial 
3-trees (cf. [ArPr86]) as well as for partial 4-trees ([Sa96]). 


EXAMPLE 


E15: An illustration of a successful reduction sequence is shown in Figure 2.4.17. 


Pa ee 


Figure 2.4.17: Reduction operations for a partial 2-tree. 
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Recognition of Recursive Classes 


FACTS 
F34: Trees can be recognized and their decomposition constructed in linear time. 


F35: Series-parallel graphs can be recognized and their decomposition constructed in 
linear time. 


F36: Treewidth-k, pathwidth-k, branchwidth-k, and bandwidth-k graphs can be rec- 
ognized and their decompositions constructed in O(n**?) time. 


COMPUTATIONAL NOTE: For fixed k the polynomial-time algorithms of Fact F36 are 
practical. 


F37: The graph classes of Fact F36 can be recognized in linear time for fixed k. 


COMPUTATIONAL NOTE: ‘The corresponding algorithms referred to by Fact F37 are 
not practical because their running times possess enormous hidden constants. 


F38: When k < 4, more practical linear-time recognition algorithms have been found 
for the graph classes in Fact F36 (cf. [MaTh91] for k = 3; [Sa96] when k = 4). 


F39: When k is part of the problem instance, the recognition problems associated with 
the graphs of Fact F36 are ’P-complete. 


F40: Branchwidth can be determined in polynomial time for planar graphs [SeTh94]. 


F41: Since partial k-trees are characterizable by a finite set of forbidden minors, they 
are polynomially recognizable (cf. [RoSe88-b}). 


COMPUTATIONAL NOTE: Fact F41 was established in the graph minors results of 
Robertson and Seymour. However, the result is existential rather than constructive 
and so the actual exhibition of the implied algorithms remains elusive. 


F42: [Wi87] Every k-terminal graph is a treewidth-k’ graph for some k’ that depends 
upon k and the particular set of recursive composition operations. For example, if m 
denotes the maximum arity of any operation, then k’ < km. 


TERMINOLOGY NOTE: The term “arity” refers to the number of operands. For example, 
a binary operation has arity 2. 


F43: [CoPeSt85] Cographs can be recognized and their decomposition constructed in 
linear time. 


F44: [CoHaLaReRo00] Recognition of cliquewidth-k graphs is solvable in polynomial 
time for k < 3, but this problem remains open for fixed k > 4. 


F45: [FeRoRoSz09] Recognition of cliquewidth-k graphs is an M’P-complete problem 
for arbitrary k. 


F46: [Jo00] Recognition of k-NLC graphs is solvable in polynomial time for k < 2, but 
this problem remains open for fixed k > 3. 


118 Chapter 2. Graph Representation 


F47: [GuWa05] Recognition of k-NLC graphs is ’P-complete for arbitrary k. 


F48: [Jo03], [BoJoRaSp04] Algorithms for problems defined on k-HB graphs are robust 
with respect to cliquewidth-k graphs. That is, such an algorithm either determines the 
correct answer or reports that the decomposition was unsuccessful and hence the input 
graph is not a cliquewidth-k graph. The O(n**?)-time decomposition algorithm for k- 
HB graphs is guaranteed to succeed for all cliquewidth-k graphs as well as some others. 
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INTRODUCTION 


The goal of this section is to survey some recent results in structural graph the- 
ory. One of the greatest achievements of structural graph theory to this day has been 
the Robertson—Seymour Graph Minor Project, which, in addition to answering a long 
standing open conjecture of Wagner, revolutionized the field and laid foundation to a 
large body of research that is being conducted today. For this very reason, numer- 
ous excellent surveys on the topic of minors have been written over the years (see, for 
example, [Di05, Re97, Th99]), and so in this section we concentrate on other aspects 
of structural graph theory. The bulk of the section is devoted to the area of induced 
subgraphs, which received a fair amount of attention in the past 10 years, due to the 
proof of the Strong Perfect Graph Conjecture, a famous open question, posed by Claude 
Berge in 1961 [Be61], which was finally solved in the early 2000s [ChRoSeTh06]. We 
conclude the section with a survey of new results on tournament immersion. 


DEFINITIONS 

All our graphs are finite and simple (except when we explicitly say otherwise). Let 
us start with some definitions. We omit some very basic and standard definitions here; 
the reader is referred to [Di05] or [We01] for those. Let G be a graph. We denote its 
vertex set by V(G), and its edge set by E(G). 


D1: The complement of G, denoted by G°, is a graph with vertex setV(G), such 
that two vertices are adjacent in G° if and only if they are non-adjacent in G. 
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D2: A clique in G is a set of vertices of G, all pairwise adjacent. 


D3: A stable set in G is a set of vertices, all pairwise non-adjacent. Thus, S C V(G) 
is a stable set in G if and only if it is a clique in G®. 


D4: The largest size of a clique in G, the clique number of G, is denoted by w(G), 
and the largest size of a stable set, the stability number of G, by a(G). 


D5: The chromatic number of G, denoted by x(G), is the smallest number k for 
which the vertices of G can be colored with & colors, so that no two adjacent vertices 
receive the same color. 


D6: Given a graph H, we say that H is an induced subgraph of G if V(H) C V(G), 
and uv € E(#) if and only if wv € E(G) for every u,v € V(A). 


D7: Given a graph H, we say that the graph G is an H-free graph if no induced 
subgraph of G is isomorphic to H. For X C V(G), the subgraph of G induced by X is 
denoted G|X. 


D8: A component of G is a maximal connected subgraph of G. 
D9: We say that the graph G is anticonnected if its complement G° is connected. 


D10: An anticomponent of G is a maximal anticonnected induced subgraph of G. 


2.5.1 Perfect Graphs 


Let G be a graph. It follows immediately from the definitions of the clique number 
and the chromatic number that y(G) > w(G). It is then natural to ask: for what 
graphs G does equality hold, that is, when is y(G) = w(G)? It turns out that the 
two parameters are equal in many natural classes of graphs: bipartite graphs (these 
are graphs whose vertex set is the union of two stable sets), complements of bipartite 
graphs [Kol16], comparability graphs [Di50] (these are graphs whose vertices are elements 
of a given partially ordered set P, and two elements are adjacent if and only if they are 
comparable in P), and many others. However, here is an unfortunate example: take 
a graph G’, and let G be the union of G’ with a complete graph on |V(G’)| vertices. 
Clearly y(G) = w(G) = |V(G’)|, and yet we have not learned anything about the 
structure of the subgraph G’ of G. 

Thus, to get a nice answer, we need to modify the question a little. In 1961 Claude 
Berge came up with what seems to be right modification, the notion of a “perfect graph” 
[Be61]. 


DEFINITIONS 


D11: A graph G is a perfect graph if y(H) = w(H) for every induced subgraph H 
of G. A graph that is not perfect is called imperfect. 


D12: A cycle of length n (where n > 3 is an integer) is the graph with vertex set 
{U1,---,Un}, such that v; is adjacent to v; if and only if |i — 7] = 1 mod n. We denote 
this graph by C),. 
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EXAMPLES 


E1: Bipartite graphs, their complements, comparability graphs, and many other natu- 
ral classes of graphs are all perfect, but the pathological example we constructed at the 
end of the last paragraph may not be. 


These two examples of imperfect graphs will be important in what follows. 


E2: It is easy to see that if n > 5 is odd, then w(C,,) = 2 and x(C,,) = 3, and therefore 
C,, is imperfect. 


E3: Let us next consider C’,, again with n > 5 odd. In this graph, the largest clique 
has size ||, and the chromatic number is | 5] (since the size of the largest stable set 
is two, and so at most two vertices can be colored with a given color), so CS is also 


imperfect. 


Fact F1 arose as a conjecture of Berge. It is now known as the Weak Perfect 
Graph Theorem. 


FACT, FIRST CONJECTURED BY BERGE 


F1: (Lovadsz [Lo72]). A graph G is perfect if and only if G° is perfect. 


DEFINITIONS 


D13: Let G be a graph, and H be an induced subgraph of G. We say that H is a hole 
if H is isomorphic to C, for some integer n > 4; moreover, H is an odd hole if n is 
odd, and an even hole if n is even. 


D14: Similarly, H is an antthole if H is isomorphic to C% for some integer n > 4; 
also H is an odd antthole if n is odd, and an even antthole if n is even. 


D15: Let us say that a graph is a Berge graph if G has no odd holes and no odd 
antiholes (this terminology is due to Chvatal). 


Fact F2 also arose as a conjecture of Berge. Since every induced subgraph of a 
perfect graph is perfect, the “only if” direction of Fact F2 follows immediately. The 
“if” direction remained open for more than 40 years, until it was proved in the early 
2000s. This fact is commonly known as the Strong Perfect Graph Theorem. 


ANOTHER FACT, ALSO FIRST CONJECTURED BY BERGE 


F2: (Chudnovsky, Robertson, Seymour, and Thomas [ChRoSeTh06]). A graph G is 
perfect if and only if it has no odd holes and no odd antiholes. 
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Outline of the Proof of the Strong Perfect Graph Theorem 


Since its introduction by Berge, the theory of perfect graphs became a very active area 
of graph theory (largely motivated by the attempts to prove Fact F2), and so we will 
spend some time here discussing the ideas of that proof. The key idea is to “describe 
all Berge graphs”. 

The main part of the proof of Fact F2 is devoted to proving a theorem that says 
that every Berge graph is either “basic” (meaning that it belongs to a well-understood 
class of graphs, all of whose members are perfect), or admits a “useful decomposition” 
(this is a decomposition that cannot occur in an imperfect Berge graph with a minimum 
number of vertices). Let us call this a “decomposition theorem” . 

Any theorem of this form would imply Fact F2; let us quickly run through the 
implication. Assume that Fact F2 is false; then there exists an imperfect Berge graph G 
with |V(G)| minimum. Apply the theorem to G. Then G is either basic (and therefore 
perfect, which is a contradiction), or admits a useful decomposition, which is again a 
contradiction by the definition of usefulness. The fact that a decomposition theorem of 
this form should exist had been a growing belief in the field for a number of years prior 
to the proof of Fact F2, and was finally formulated, but not published, as a conjecture 
by Cornuéjols, Conforti, and Vuskovié. 

Let us now make this more precise. 


DEFINITIONS 


D16: Given a graph H, its line graph L(#) is the graph with vertex set E(H), and 
ef € E(L(#)) if and only if the edges e and f share an endpoint in H. 


D17: We say that G is a double-split graph if V(G) can be partitioned into four sets 
{a1,...,Qm}, {b1,.--, 0m}, {a1,---; en}, {di,..., dn} for some m,n > 2, such that: 
e a; is adjacent to bj for 1 <i <™m, and c; is nonadjacent to d; for 1 <j <n. 
e there are no edges between {a;,b;} and {a;,b;} for 1 <i < i’ < m, and all four 
edges between {c;,d;} and {cj-,dj'} for 1 <j <j’ <n. 
e there are exactly two edges between {a;,b;} and {c;,d;} for 1 < i < m and 
1<¥j <n, and these two edges have no common end. 


D18: Let us say that a graph G is a Berge-basic graph if either G or G* is bipartite, 
the line graph of a bipartite graph, or a double-split graph (“Berge-basic” is an ad hoc 
definition; we use it to avoid confusion with graphs that we would like to consider basic 
in other settings). It is not difficult to see (using theorems of Kénig [Kol16]) that all 
Berge-basic graphs are perfect. 


Next let us define the useful decompositions used in [ChRoSeTh06]. 


D19: A skew-partition in a graph G is a partition (A, B) of V(G) such that A is not 
connected and B is not anticonnected. Skew-partitions were first introduced by Chvatal 
[Ch85]. 


D20: A proper 2-join (a special case of a decomposition defined by Cornuéjols and 
Cunningham [CoCu85]) in G is a partition (X,,X2) of V(G) such that there exist 
disjoint nonempty A;, B; C X; (¢ = 1,2) satisfying: 
e every vertex of A; is adjacent to every vertex of Az, and every vertex of By is 
adjacent to every vertex of Bo, 
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e there are no other edges between X, and Xo, 

e for i =1,2, every component of G|X; meets both A; and B;, and 

e for i = 1,2, if |A;| = |B;| = 1 and G|X; is a path joining the members of A; and 
B;, then it has odd length > 3. 


D21: If X C V(G) and v € V(G) \ X, we say that v is X-complete if v is adjacent 
to every vertex in X, and that v is X-anticomplete if v has no neighbors in X. 


D22: If X,Y C V(G) are disjoint, we say that X is complete to Y (or the pair 
(X,Y) is complete) if every vertex in X is Y-complete; and being anticomplete to Y 
is defined similarly. 


D23: Finally, a proper homogeneous pair in G (a slight variation of a decomposition 
by Chvatal and Sbihi [ChSb87]) is a pair of disjoint nonempty subsets (A, B) of V(G), 
such that, if A,,A2, respectively, denote the sets of all A-complete vertices and all 
A-anticomplete vertices in V(G), and if By, Bz are defined similarly, then: 
e A, U Ap = By U By = V(G) \ (AU B) (and in particular, every vertex in A has a 
neighbor in B and a non-neighbor in B, and vice versa), and 
e the four sets Ai M By, A1 MN Bo, A2M Bi, Ag M Be are all nonempty. 


REMARK 


R1: Note that if G admits a skew-partition then so does G°, and the same holds for 
homogeneous pairs. However, a 2-join in G is substantially different from a 2-join in G°. 
In fact, Fact F2 uses a slight variant of the skew-partition decomposition (also invariant 
under taking complements), called a balanced skew-partition, but the definition of that 
is somewhat technical, and we omit it here. 


FACT 
We can now state the decomposition theorem of [ChRoSeTh06]. 


F3: [ChRoSeTh06] Let G be a Berge graph. Then either 
e G is Berge-basic, or 
e G admits a balanced skew-partition, or 
e G admits a proper homogeneous pair, or 
e one of G,G* admits a proper 2-join. 


REMARKS 


R2: In [Ch03, Ch06], the author showed that the proper homogeneous pair decompo- 
sition is in fact unnecessary, and Fact F3 remains true if we simply omit it. We will 
come back to this result later, in §2.5.4, since it contains a tool that has since been 
very useful in the study of induced subgraphs. The proof of Fact F3 occupies most of 
[ChRoSeTh06], which is over 150 pages long. Recently, Seymour and the author were 
able to shorten the proof of Fact F2 by about a third, proving a theorem that is similar 
to Fact F3 but uses more kinds of decompositions [ChSe09]. 
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R3: Even though Fact F3 provides enough structural information about Berge graphs 
to prove Fact F2, it is not what we would call a “structure theorem” for Berge (or perfect) 
graphs. The problem is that while Fact F3 gives a way to break (or decompose) a big 
Berge graph into smaller pieces, this decomposition cannot be “reversed”. We cannot 
use Fact F3 to prove a theorem that says “every perfect graph can be built from basic 
pieces by gluing them together via certain operations; and every graph built in this way 
is perfect”. Thus, while we have a decomposition theorem for perfect graphs, we do not 
have a structure theorem. The question of explicitly describing the structure of perfect 
graphs is currently wide open, and any progress on it would be a breakthrough in the 
area. 


2.5.2 Other Decomposition Theorems 


This subsection is devoted to more decomposition results for graphs with certain 
induced subgraphs forbidden. The theorems in this subsection have not, so far, been 
turned into explicit structure theorems. Of course, a carefully crafted decomposition 
theorem is often enough to answer a specific question about a certain class of graphs 
(just as Fact F3 was strong enough to prove Fact F2); decomposition theorems also 
often have strong algorithmic consequences. 


DEFINITION 


D24: Let F be a family of graphs. We say that a graph G is F-free if G is F-free for 
every F € F. 


Defining C to be the family of all cycles of odd length at least five, and their com- 
plements, Fact F3, with the strengthening of [Ch03, Ch06], can be restated as follows: 


FACT 


F4: Let G be a C-free graph. Then either 


e G is Berge-basic, or 
e G admits a balanced skew-partition, or 
e one of G,G* admits a proper 2-join. 


DEFINITION 


D25: A k-star cutset in a graph G is a partition (A, B) of V(G) such that A is not 
connected, and there is a clique K C B, such that |K| = k, and every vertex of B\ K 
has a neighbor in kK. A 1-star cutset is usually called a star cutset. 


We observe that every star cutset is a skew-partition, and every skew-partition is a 
2-star cutset (but the converse implications are false). In this subsection we also refer 
to a 2-join decomposition; it is a slight variant of the proper 2-joins that we have used 
so far, but we will not define it exactly. 

Next, we discuss a number of decomposition theorems for F-free graphs for various 
families F. Here we use the word “basic” loosely, to mean that a graph belongs to some 
well-understood explicitly constructed class of graphs, all of which are F-free for the 
family F in question. 
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DEFINITION 


D26: Let 6B be the family of all cycles C, where n is not divisible by 4; the B-free 
graphs are called balanced graphs. 


FACT 


F5: Conforti and Rao [CoRa92] proved a decomposition theorem for B-free graphs, 
where they showed that every such graph is either basic (“strongly balanced” ) or admits 
a 2-join, or an “extended star cutset” (this is a variation on the star cutset theme). 


This was a ground-breaking result, the first theorem of the kind. In particular it 
lead to a polynomial-time recognition algorithm for the class of balanced graphs. 
Here are more results of a similar flavor. 


DEFINITIONS 


D27: Let Foaq be the family of cycles C,, with odd n > 3, and Feven the family of 
cycles C, with even n. An Foaq-free graph is called odd-hole-free, and an Feyen-free 
graph is called even-hole-free. 


FACTS 


F6: [CoCoVu04]. Every odd-hole-free graph is either basic, or admits a 2-join or a 
k-star cutset, for some k < 2. 


In [CoCoKaVu02] a similar result was proved for Feyen-free graphs. More precisely: 


F7: [CoCoKaVu02] Every even-hole-free graph is either basic, or admits a 2-join or 
admits a k-star cutset, for some k < 3. 


REMARK 


R4: Fact F7 was later strengthened in [daSiVu13], where only 2-joins and star cutsets 
are used (though the list of basic classes had to be extended). Each of these two theorems 
can be used to design a polynomial-time recognition algorithm for even-hole-free graphs; 
we will discuss those later, in §2.5.5. 


Incidentally, here is another structural property of even-hole-free graphs (which is 
neither a structure theorem nor a decomposition theorem). 


DEFINITION 


D28: A vertex v in a graph G is a bisimplicial vertex if the set of neighbors of v 
in G is the union of two cliques. 


FACT 


F8: {AdChHaReSe08] Every non-null even-hole-free graph has a bisimplicial vertex. 


Star cutsets (along with other “tightly structured” cutsets) are used in the proof of 
Fact F8 as a way to reduce the problem to a smaller graph and use inductive arguments. 
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2.5.3 Structure Theorems 


In contrast with the previous subsection, here we survey some results that provide 
explicit structural descriptions of F-free graphs for some families F. In each of the 
classes of graphs now considered, the proof of the structure theorem follows a certain 
outline: first a decomposition theorem is proved, and then it is shown that the de- 
compositions can be reversed and turned into “compositions” (that is, ways to glue two 
smaller graphs in a class in such a way that the resulting graph is also in the class). One 
then analyzes the effect of repeatedly performing the composition operations, starting 
from basic graphs, and an explicit structural description emerges. Theorems 6.2 and 7.3 
of [Vul3] provide further examples of decomposition theorems that can be turned into 
compositions. 


Claw-Free Graphs 


DEFINITION 


D29: A claw is the complete bipartite graph Ky3, and a graph is claw-free if it is 
K1.3-free (in other words, no vertex has three pairwise non-adjacent neighbors). 


EXAMPLES 


E4: Claw-free graphs are a generalization of line graphs (it is not difficult to see that if 
G = L(#) for some graph H, then G is claw-free). But there are others. For example, 
the skeleton of the icosahedron (this is the unique 5-regular planar graph on 12 vertices) 
and the Schlafli graph (a highly symmetric 27-vertex graph that comes up naturally in 
the geometry of polytopes) are examples of claw-free graphs that are far from being line 
graphs. 


DEFINITIONS 
Another interesting subclass of claw-free graphs is called circular interval graphs. 


D30: Let © be a circle, and let F\,...,#, GC 4 be homeomorphic to the interval [0, 1], 
such that no two of F,,...,F, share an end-point. Now let V C © be finite, and let G 
be a graph with vertex set V in which for distinct u,v € V, u is adjacent to v if and 
only if u,v € F; for some 7. Such a graph G is called a circular interval graph. If 
in addition no three of F\,..., Ff, have union %, then G is a long circular interval 
graph. 


D31: A composition of strips is a generalization of a line-graph: given a graph H, 
in L(H) every edge of H is replaced by a vertex, and vertices that correspond to edges 
that share an end are made adjacent; in a composition of strips that corresponds to 
HT, every edge of H is replaced by a member of 1 of 15 prescribed families of graphs 
(a “strip”), and then certain edges are added between the subgraphs corresponding to 
edges of H that share an end. 


FACT 

The main theorem of [ChSe08] gives an explicit description of all claw-free graphs. 
To state this theorem precisely would take several pages, so let us instead describe it 
roughly. 
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F9: [ChSe08] Every connected claw-free graph G with a(G) > 4 is either a a thick- 
ening of a long circular interval graph (this is a slight generalization of a long 
circular interval graph), or a composition of strips. 


Obviously, all graphs G with a(G) = 2 are claw-free, so it remains to construct those 
claw-free graphs G for which a(G) = 3. This turns out to be the most complex part of 
both the proof and the statement of the main theorem of [ChSe08]; graphs obtained in 
certain ways from the skeleton of the icosahedron are one class of claw-free graphs with 
stability number three, but there are many others that we will not describe here. 
Quasi-Line Graphs 
DEFINITION 


D32: A graph is a quasi-line graph if each of its vertices is bisimplicial. 


EXAMPLES 
E5: Every line graph is quasi-line. 
E6: Every quasi-line graph is claw-free. 


E7: The 5-wheel (Cs, together with a vertex complete to its vertex set) is claw-free 
and not quasi-line. 


E8: Long circular interval graphs are quasi-line and not line graphs. 


FACT 


The structure of quasi-line graphs is described in [ChSe12], and it is much simpler 
than that of general claw-free graphs. The main result of [ChSe12] states that: 


F10: [ChSel2] There are only two kinds of connected quasi-line graphs: thickening of 
circular interval graphs (again, this is a slight generalization of circular interval graphs), 
and compositions of strips, with only two kinds of strips permitted. 


REMARK 


R5: In [ChP113] an explicit structural description of perfect claw-free graphs is 
given (these were originally studies in [ChSb88] and [MaRe99]). 
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Bull-Free Graphs 


DEFINITIONS 


D33: The bull is the graph with vertex set {v1, v2, v3, V4, U5} where {v2, v3, U4} is a 
clique, v; is adjacent to v2, v4 is adjacent to vs, and there are no other edges. 


M3 
Figure 2.5.1: The graph known as the bull. 


D34: A graph G is bull-free if no induced subgraph of G is isomorphic to the bull. 


REMARK 


R6: The structure of bull-free graphs was originally studied in connection with perfect 
graphs in [ChSb87], but that paper contains only decomposition theorems, and no 
structure theorem. The structure of general bull-free graphs was described in [Ch12, 
Ch12a], and we discuss it next. 


FACTS 


F11: Since the bull is isomorphic to its complement, the class of bull-free graphs is 
closed under complementation. 


F12: [Ch12, Ch12a] There are basically two types of bull-free graphs, 7, and 72, which 
are described in what follows. 


DEFINITIONS 


D35: Let us first define the substitution operation. Given disjoint graphs H; and 
HA, each with at least two vertices, and v € V(A1), we say that H is obtained from 
HI, by substituting H2 for v, or obtained from H, and Hp by substitution (when 
the details are not important) if: 


e VA) = (V(Ai) UV (Aa)) \ {0}, 

e H|V(H2) = Hp, 

e A\(V(HA;) \ {v}) = Hi \v, and 

e u € V(A)}) is adjacent in H to w € V(Az2) if and only if u is adjacent in H to v. 


D36:_ A graph is prime if it is not obtained from smaller graphs by substitution. 


Section 2.5. Structural Graph Theory 133 


FACTS 
F13: The bull is a prime graph. 


F14: Substituting one bull-free graph for a vertex of another bull-free graph results in 
yet another bull-free graph. 


If follows from Facts F13 and F14 that to understand bull-free graphs, it is enough 
to construct all prime bull-free graphs. 


EXAMPLES 
E9: Graphs with no clique of size three are bull-free. 
E10: Graphs with no stable set of size three are bull-free. 


E11: However, there are prime bull-free graphs that contain both a big clique and 
a big stable set: let nm > 3 be an integer, and let G be a graph with vertex set 
ky,...,kn,$1,---,;5n Where 


e {ki,..., kn} is a clique, 
e {51,...,8n} is a stable set, 
e for i,j € {1,...,n}, 5; is adjacent to k; if and only ifi+j > n. 


This richness of examples suggests that the structure theorem, if it exists, should be 
quite complex; and indeed it is. Once again, it would take several pages to state the 
construction explicitly, so instead we will try to give the reader a flavor of the result. 


DEFINITION 


D37: A graph F is triangle-free if w(F) < 2. 


INFORMAL DEFINITIONS 


D38: Graphs in 7; consist of a triangle-free induced subgraph F’, together with a 
disjoint union of cliquesk,,..., Ky, and the edges between V(F’) and UES KK; are care- 
fully controlled. Thus, even though graphs in 7; may have both a large clique (one of 
Ky,..., K+) and a large stable set (in F’), these two structures tend to “live” in different 
parts of the graph. 


On the other hand, in the class 73, large cliques and large stable sets happily inter- 
twine. 


D39: Let G bea graph. Let a,b € V(G) be distinct vertices, and let A = {a1,...,an} 
and B = {bi,...,6m} be disjoint subsets of V(G) such that AU B = V(G) \ {a,b}. Let 
us now describe the adjacency in G. 


e ais complete to A and anticomplete to B. 
e bis complete to B and anticomplete to A. 


e the adjacency between a and b is not specified. 
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e Ifi,j € {1,...,n}, andi < 7, and a, is adjacent to a;, then a; is complete to 
{ai4i, sane ,a;-1}, and aj is complete to {ai, ae , ait}. 


e Ifi,j € {1,...,m}, andi < j, and }, is adjacent to 6;, then b; is complete to 
{bi41,---,6;-1}, and 6; is complete to {b1,...,b;-1}. 


e Ifpe {1,...,n} and qe {1,...,m}, and a, is adjacent to bg, then a, is complete 
to {bg+1,---,bm}, and bg is complete to {ap4i,...,@n}. 


Under these circumstances we say that G is a 1-thin graph. 
D40: A variant of this is called a 2-thin graph, but we omit the definition here. 


D41: Graphs in 72 are built from 1-thin and 2-thin graphs by gluing them together in 
prescribed ways. 


D42: Another class of bull-free graphs, called 7p, consists of a few sporadic bull-free 
graphs. We omit the precise definition of 7p in order to avoid getting too technical. 


FACT 


To state Fact F15 precisely we would need to define “expansions”, but let us regard 
them in their non-technical sense, as “slight generalizations”. The main result of the 
series [Ch12, Ch12a] is the following: 


F15: [Ch12, Ch12a] Let G be a bull-free graph. Then either 
e the graph G is obtained from smaller bull-free graphs by substitution, or 
e one of the graphs G, G° is an expansion of a member of 7p U Ji U 7. 


Moreover, every graph obtained this way is bull-free. 


REMARK 


R7: In [ChPel3], a similar description of perfect bull-free graphs is obtained. 


2.5.4 Trigraphs 


In this subsection we describe an object slightly more general than a graph, called 
a trigraph. Trigraphs have proved to be a useful tool in describing structural properties 
of graphs with certain induced subgraphs excluded. Roughly, trigraphs are used to 
record the fact that the adjacency between a certain two vertices, say u and v, cannot 
be determined from the condition that the graph is F-free for the family F in question. 
In most cases that means that each of u and v can be replaced by a set of vertices 
(with certain restrictions on it), say U and V, where the adjacencies between members 
of U and the members of V are arbitrary. This in turn gives rise to notions such as 
“thickening” or “expansion” that were briefly mentioned in §2.5.4. 


Let us now be more formal. 
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DEFINITION 


D43: A trigraph G consists of a finite set V(G), called the vertex set of G, anda 
map 6: V(G)? + {—1,0,1}, called the adjacency function, satisfying: 


e for all v € V(G), Og(v,v) = 0 
e for all distinct u,v € V(G), 0¢(u,v) = Og(v, u) 
e for all u € V(G), there exists at most one v € V(G) \ {u} such that @¢(u,v) =0 


The idea is that, while a graph has two kinds of vertex pairs uv with u 4 v, adjacent 
and non-adjacent ones, a trigraph has three kinds: adjacent (for which 6(u,v) = 1), 
non-adjacent (for which 0(u,v) = —1), and semi-adjacent (for which 0(u,v) = 0). A 
good way to think of semi-adjacent vertex pairs is as vertex pairs whose adjacency is 
“undecided”. “Deciding” the adjacency of the undecided pairs results in a graph. 

A version of trigraphs was first used in [Ch03, Ch06], where the last condition of 
the present definition was omitted. However, it seems that in order to study families 
of graphs, the more restricted definition that we use here is both sufficient and much 
nicer to work with; see [Ch03, Ch06, ChSe08, ChSe12, ChP113, Ch12, Ch12a, ChPe13, 
ChkKil4, Pel3]. 


Using Trigraphs: Structure Theorems 


Let us now explain the use of trigraphs in a little more detail. Suppose that we are trying 
to understand the structure of the class of F-free graphs for some family F (denote this 
class by Gr), and we can prove a theorem of the following form: we describe a few 
classes of F-free graphs, and then say that every graph in Gr is obtained from members 
of these classes by “expanding” certain vertex pairs uv. This means that u and v are 
replaced by two disjoint sets of new vertices, U and V, respectively, with arbitrary 
adjacencies between members of U and members of V. In order for this construction to 
be explicit, we need to provide a description of all pairs (G,P), where G is a graph in 
Gy, and where P is the set of vertex pairs of G that can be expanded. To accomplish 
that, instead of working with Gr, we consider the class 7+ of F-free trigraphs; these 
are trigraphs that have the property that however the adjacency of the undecided pairs 
is decided, the resulting graph is F-free. Now we prove a similar theorem for trigraphs 
in Jz, where the vertex pairs that can be expanded are precisely the semi-adjacent 
pairs of the trigraph. This summarizes the way in which trigraphs are used in a purely 
structural setting. 


Using Trigraphs: Algorithms 


More recently, trigraphs have been used in the setting of algorithms. Algorithms that 
are based on decomposition theorems usually work as follows: take a graph G, “break 
it apart” via a decomposition given by the theorem, construct two new graphs Gj, Go, 
where each G; consist of a “piece” of G, together with a few more vertices recording 
the information about the remainder of G (let’s call this part of G; the marker for 
G;); process each of G1,G2 separately, and then put the results together to get an 
answer for G. It turns out that semi-adjacent pairs are a good way to keep track of the 
information recorded in the markers for G; and G2. Trigraphs were used in this way in 
[(ChTrTrVul4]. 
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2.5.5 Recognition Algorithms 


In this subsection, we discuss the question of testing if a given graph G is F-free for 
a given family F. Obviously this can be done in polynomial time if F is finite (just run 
through all subsets X of V(G) of size at most maxrer|V(F)| and check if the subgraph 
G|X is isomorphic to a member of F), so this question is only of interest when F is 
infinite. For brevity, let us say “testing for F” when we mean “testing if a graph is 
F-free”. 

After Fact F2 was proved, the following major question in the theory of perfect 
graphs remained open: given a graph G, test (in polynomial time) whether G is perfect. 
With Fact F2 in our ammunition bag, this turns into a question of testing if a given 
graph is Berge (we remind the reader that a graph is Berge if it is C-free, where C is the 
family consisting of all odd cycles of length at least five, and their complements). This 
was done in [ChCoLiSeVu05], and we will describe the algorithm briefly (the remainder 
of this subsection is closely based on [ChSe07]). 


DEFINITION 


D44: A pyramid is a graph consisting of a triangle {b1, bz, bs}, called the base, a vertex 
a €¢ {b1, bz, b3}, called the apex, and three paths P,, P, P3, such that for 7,7 € 1,2,3 


e the ends of P; are a and 0;, 


e ift Aj then V(P,) \ {a} is disjoint from V(P;) \ {a} and the only edge between 
them is b;b;, and 


e at most one of P,, P2, P3 has length one. 


In this case we say that the pyramid is formed by the paths P,, P2, P3. 
Let P be the family of all pyramids. 


FACTS 


F16: It is easy to see that every pyramid contains C,, for some odd n > 5, and therefore 
every Berge graph is P-free. 


It turns out that: 


F17: [ChCoLiSeVu05]. Testing for P is relatively easy and can be done in time 
O(\V(G)|?). 


This is the first step of the algorithm of [ChCoLiSeVu05]. 


Testing for Pyramids: the Shortest Paths Detector 


The idea is as follows. If G contains a pyramid, then it contains a pyramid P with the 
number of vertices smallest. We are going to “guess” (by trying all possibilities) some 
of the vertices of P in G, then find shortest paths in G between pairs of vertices that 
we guessed that were joined by a path in P, and then test whether the subgraph of G 
formed by the union of these shortest paths is a pyramid. If the answer is “yes”, then G 
contains a pyramid, and we stop. Surprisingly, it turns out, that choosing the shortest 
paths with a little bit of care, we can guarantee that if the answer is “no”, then there is 
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no pyramid in P. We call this general strategy of testing for a family F a shortest-paths 
detector for F. Let us now be more precise. 


DEFINITIONS 


D45: For u,v € V(G) we denote by dg(u,v) the length of the shortest path of G 
between wu and v. 


D46: If P is a pyramid, formed by three paths P,, P2, P3, with apex a and base 
{b1, bo, bs}, we say its frame is the 10-tuple 


a, by, bz, b3, $1, $2, $3, M1, M2, Ms 
where 
e for 7 =1,2,3, s; is the neighbor of a in P; 
e fori =1,2,3, m; € V(P;) satisfies dp,(a,m;) — dp,(mi, bi) € {0, 1} 
D47: A pyramid P in G is optimal if there is no pyramid P’ with |V(P’)| < |V(P)]. 


FACT 
F18: [ChCoLiSeVu05] Let P be an optimal pyramid, with frame 
a, by, be, 63, $1, $2, $3,7™M1, M2, M3 


Let 5), 7), be the subpaths of P; from m, to s1,b;, respectively. Let F' be the set of all 
vertices non-adjacent to each of s9, $3, bg, bs. 


1. Let Q be a path between s; and m, with interior in F’, and with minimum length 
over all such paths. Then a-s1-Q-m -T\-b, is a path (say P{), and Pj, Pz, P3 form 
an optimal pyramid. 


2. Let Q be a path between m, and 6; with interior in F’, and with minimum length 
over all such paths. Then a-s,-S1-m -Q-b, is a path (say Pj), and P;, P2, P3 form 
an optimal pyramid. 


Analogous statements hold for P3, P3. 
F19: Fact F18 can be used to design an algorithm to test for P: 


e guess the frame a, by, ba, b3, 51, $2, $3,7™1,m2,ms3 of an optimal pyramid P of G, 
by trying all 10-tuples of vertices; 


e find shortest paths between m, and 6,, and between mj, and s,, not containing 
any neighbors of s2, 83, bg, and b3; do the same for mg, bg, s2 and mz, b3, $3; 


test if the union of the six shortest paths, together with the vertex a, forms a 
pyramid. 


Now, by Fact F18, the answer is “yes” if and only if G contains a pyramid. 


REMARK 


R8: The algorithm in [ChCoLiSeVu05] is similar; it was modified a little to bring the 
running time down to O(|V(G)|’). 
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Easily Detectable Configurations; Cleaning; Finding Odd Holes 


The next idea in [ChCoLiSeVu05] is to use the shortest-path detector for odd holes. 
Unfortunately, there does not seem to be a theorem similar to Fact F18 for odd holes, 
and so, first, the graph needs to be “prepared” for using a shortest-paths detector. The 
first step is to test for P, and a few other families F that are easy to test for, and such 
that every Berge graph is F-free. Now we can assume that the graph in question is 
F-free for all these F. The next step is applying cleaning, a technique first proposed in 
[CoRa93]. The idea of cleaning is to find, algorithmically, polynomially many subsets 
X1,...,X, of V(G), such that if G contains an odd hole, then for at least one value 
of 7 € {1,...,k} the graph G; = G \ X; contains an odd hole that can be found using 
a shortest-paths detector. Finally, applying a shortest-paths detector for odd holes to 
each of Gj,...,G x, we detect an odd hole if and only if G contains one. 


REMARKS 


R9: In addition to the algorithm just described, [ChCoLiSeVu05] contains another 
algorithm to test for Bergeness, which instead of a shortest-paths detector for odd 
holes, uses a decomposition theorem for odd-hole-free graphs from [CoCoVu04], but we 
will not describe this algorithm here. 


R10: Both algorithms in [ChCoLiSeVu05] test for Bergeness, and not for the existence 
of an odd hole in a graph. The complexity of testing if a graph contains an odd hole is 
still unknown. 


Testing for Even Holes 


On the other hand, the problem of testing if a graph contains an even hole can be solved 
in polynomial time. There are three (!) known algorithms. One is due to Conforti, 
Cornuéjols, Kapoor, and Vuskovié [CoCoKaVu02a], another to Kawarabayashi, Sey- 
mour and the author [ChKaSe05], and a third one to da Silva and Vuskovié [daSiVu13]. 
All three algorithms use cleaning; the first algorithm uses a decomposition theorem of 
[CoCoKaVu02] for even-hole-free graphs, the second one is based on the shortest-paths 
detector, and the last one is again decomposition based [daSiVu13]. 


More Algorithms 


There are two other kinds of graphs that are somewhat similar to the pyramid, called 
a theta and a prism. 


DEFINITIONS 


D48: A theta is a graph consisting of two non-adjacent vertices s,t and three paths 
P,, Po, P3, each between s and t, such that the sets V(P,) \ {s,t}, V(P2) \ {s,t}, and 
V(P3) \ {s,t} are pairwise disjoint, and the union of every pair of P,, P2, P3 is a hole. 


D49: A prism is a graph consisting of two disjoint triangles {a1, a2, a3} and {b1, be, bs} 
and three paths P,, P2, P3, with the following properties: 


e for 7 =1,2,3, the ends of P; are a; and b,, 


e P,, Py, Ps are pairwise disjoint, and 


Section 2.5. Structural Graph Theory 139 


e forl<i<j <3, there are precisely two edges between V(P;) and V(P;), namely, 
aja; and b;b;. 


FACT 


F20: Let 7 be the family of all thetas, and Pr the family of all prisms. Then every 
even-hole-free graph is T UPr-free. (This is easy to check.) 


In view of Fact F20 prisms and thetas play a similar role for even-hole-free graphs 
to the one that pyramids play for odd-hole-free graphs. It turns out, however, that: 


FACTS 
F21: [MaTr05] Unlike with P, the problem of testing for Pr is NP-complete. 


F22: [LeLiMaTr09] The problem of testing for Pr is NP-complete even in graphs with 
clique number two. 


On the other hand, 
F23: [ChSel0a] Testing for J can be done in polynomial time. 
F24: [MaTr05] Testing for P UPr can be done in polynomial time. 


F25: [ChKa08] Testing for 7 UPr can be done in polynomial time. 


The Three-in-a-Tree Problem 


All the algorithms mentioned so far, except one, use variations on the ideas of cleaning 
and shortest paths detectors (or decomposition theorems), and that one exception is 
the algorithm for testing for 7. There the approach is different. In order to be able 
to test for 7, a slightly more general problem is studied: given a graph G, and three 
vertices U1, v2,v3 of G, does there exist an induced subgraph T of G, such that T is a 
tree and v1,v2,v3 € V(T)? This is the three-in-a-tree problem. 

It turns out that the answer to this question is “no” if and only if the graph admits 
a certain structure. This fact is then used to design a polynomial time algorithm for the 
three-in-a-tree problem. Now, if {v1, v2, v3} is a stable set of size three with a common 
neighbor w in G, the degree of each of v1, v2, v3 in G'\ {w} is one, and the degree of w in 
G is three, then the answer to the three-in-a-tree problem with input (G\{w}, v1, v2, v3) 
is “yes” if and only if G contains a theta using v1, v2, v3, w. 

On the other hand, if {v,, v2, v3} is a clique of size three, and no vertex of G has two 
neighbors in it, then the answer to the three-in-a-tree problem with input (G, v1, v2, v3) 
is “yes” if and only if G contains a pyramid with base {v1, v2, v3}. Thus, the algorithm 
to solve the three-in-a-tree problem can be used, after some pre-processing, to test both 
for P and for T (and this is the only algorithm known to test for 7). This result is 
particularly pleasing from the point of view of a structural graph theorist, because this 
is one of the few times that a structure (and not just a decomposition) theorem and an 
algorithm appear together in the study of graphs with forbidden induced subgraphs. 

As we have seen, the complexity of testing for F varies with F: for some families 
polynomial-time algorithms are known, while for others the problem can be shown to 
be NP-complete. An interesting open question is: what causes this difference? Can one 
characterize the families for which testing can be done efficiently? 
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2.5.6 Erdos—Hajnal Conjecture and \-Boundedness 


As the results surveyed in Subsection 2.5.5 illustrate, structure theorems for F- 
free graphs tend to be complicated to state, difficult to prove, and hard to use. At 
the moment, we are nowhere near having a structural conjecture for excluding a general 
induced subgraph. But what if we lower our sights, and ask whether excluding a general 
induced subgraph guarantees that the graph has certain special properties that a general 
graph does not possess? In 1989, Erd6s and Hajnal made a beautiful conjecture of this 
kind; it is now known as the Erdés—Hajnal Conjecture: 


CONJECTURE 


C1: [ErHa89] For every graph H, there exists a constant d(H) > 0, such that every 
H-free graph G has either a clique or a stable set of size at least |V(G)|°(”). 


FACTS 

In the same paper a partial result in this direction is proved, showing that for every 
H, H-free graphs behave differently from general graphs. It is a well-known theorem of 
Erdés that 


F26: [Er47] There exist graphs on n vertices, with no clique or stable set of size larger 
than O(log n). 


However, 


F27:  [ErHa89] For every graph H, there exists a constant c(H) > 0, such that every 
H-free graph G has either a clique or a stable set of size at least er aflegl AG). 


DEFINITION 


D50: Let us say that a graph H has the Erdés—Hajnal property if there exists a 
constant 6(H) > 0, such that every H-free graph G has either a clique or a stable set 
of size at least |V(G)|°(”). 


FACTS 
F28: Clearly, H has the Erdés—Hajnal property if and only if H° does. 
Very few graphs have been shown to have the Erdds-Hajnal property. 


F29: It is not difficult to show that all graphs on at most four vertices have the 
Erd6s-Hajnal property. 


A much more complicated argument is needed to show that: 
F30: [ChSa08] The bull has the Erdés—Hajnal property. 
In [AlPaSo01] it was shown that: 


F31: [AlPaSo01] If H1, Hz have the property, then so does every graph obtained from 
A, and Ho by substitution. 


Thus in order to prove Conjecture C1, it is enough to show that every prime graph 
has the Erdés—Hajnal property. However, this question is still open for C5, and for the 
five-vertex path. No prime graphs on at least six vertices have been shown to have the 
Erd6s—Hajnal property. 
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Tournaments 
Next we introduce another version of Conjecture Cl. 


DEFINITION 


D51: A tournament is a directed graph G where for every distinct u,v € V(G), 
exactly one of the (ordered) pairs uv and vu belongs to E(G). If ww € E(G), we say 
that u is adjacent to v. 


D52: A tournament is transitive if it has no directed cycles (or, equivalently, no 
directed cycles of length three). 


D53: For a tournament T, we denote by a(T) the largest number of vertices in a 
transitive subtournament of T. 


D54: For tournaments S$ and T, we say that T is S-free if no subtournament of T is 
isomorphic to S. 


CONJECTURE 


C2: [AlPaSo01] For every tournament S, there exists a constant 6(S) > 0, such that 
every S-free tournament T satisfies a(T) > |V(T)|°. 


FACT 


It is also shown that: 


F32: [AlPaSo01] Conjectures Cl and C2 are equivalent. 


DEFINITION 


D55: As with graphs, let us say that a tournament S has the Erdés—Hajnal prop- 
erty if there exists 6(S) > 0, such that every S-free tournament T satisfies a(T) > 
VCE) 


D56: Similarly to graphs, a tournament T is prime if there is no X C V(T) with 
1 < |X| < |V(Z)| such that for every v € V(T) \ X, either v is adjacent to every vertex 
of X, or v is adjacent from every vertex of X. 


For some reason, Conjecture C2 seems to be a little more approachable than Con- 
jecture C1: 


FACT 


F33: [BeChCh14] Unlike in the case of graphs, there is a known infinite family of 
prime tournaments, all of which have the Erdés—Hajnal property. 


We refer the reader to [Ch13] for more information about recent progress on Con- 
jecture Cl and Conjecture C2. 
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y-Boundedness 


Let us now consider another notion, related to Conjecture Cl. 


DEFINITIONS 


D57: A class of graphs G is hereditary if H € G for every G € G and every induced 
subgraph H of G. 


D58: We say that a hereditary graph G is y-bounded if there exists a function 
f :N—-N such that y(G) < f(w(G)) for every G € G. In this situation we call f a 
x-bounding function for G. 


EXAMPLES 
E12: The class of ¥-free graphs is hereditary for every family F. 
E13: The class of perfect graphs is y-bounded by the identity function. 


E14: Suppose that for some graph H the class of H-free graphs has a y-bounding 
function that is a polynomial. Then there exists t > 1 such that y(G) < w(G)* for every 
H-free graph G. Since in every coloring of G, each color class has size at most a(G), it 
follows that 

w(G)'a(G) = |V(G)| 


and so G has either a clique or a stable set of size at least |V(G)|1/(+)), and H has the 
Erdés—Hajnal property. 


FACT 


It is tempting to conjecture that the class of H-free graphs is y-bounded for every 
H. However, this is false, as shown by the following theorem of Erdés [Er59]: 


F34: [Er59] For every pair of integers k,g > 0 there exists a graph G with x(G) >k 
and no cycle of length less than g. 


Thus in order for the class of H-free graphs to be y-bounded, H must contain no 
cycles (otherwise, every graph G with no cycle of length at most |V(H)| is H-free, and 
has w(G) < 2; and by Fact F34 there exist such graphs with arbitrarily large chromatic 
numbers). A famous conjecture of Gydérfds and Sumner [Gy75, Su81] states that this 
necessary condition is in fact sufficient: 


CONJECTURE 
C3: [Gy75, Su81] For every forest F’, the class of F-free graphs is y-bounded. 


This conjecture is still open. Gydrfds [Gy75] proved that it holds when F is a path. 
Kierstead and Penrice [KiPe90, KiPe94] and Scott [Sc97] made further progress. Some 
of the theorems in Subsection 2.5.2 and Subsection 2.5.3 also imply y-boundedness 
results for certain classes of F-free graphs. We list some of them here. 
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FACTS 

F35: [AdChHaReSe08] For every even-hole-free graph G, y(G) < 2w(G) —1. This 
follows from Fact F8. Incidentally, the question of whether the class of odd-hole-free 
graphs is y-bounded is still open. 


F36: [ChSe10] If G is an induced subgraph of a connected claw-free graph G’ with 
a(G’) > 3, then y(G) < 2w(G). This follows from the main result of [ChSe08]. 


F37: [ChFr07] For every quasi-line graph G, y(G) < $w(G). This follows from the 
main result of [ChSe12]. 


For more results of this type, see [Vul3]. 


2.5.7 Well-Quasi-Ordering and Rao’s Conjecture 
DEFINITION 
D59: <A quasi-order Q consists of a class E(Q) and a transitive reflexive relation 


which we denote by < or <q; and it is a well-quasi-order or wgo if for every infinite 
sequence q; (i = 1,2...) of elements of E(Q) there exist 7 > i > 1 such that q <q qj. 


FACTS 
One of the consequences of the Robertson—Seymour graph minor project is that: 


F38: [RoSe04] The class of all graphs forms a well-quasi-order under minor contain- 
ment. 


F39: The same is not true for induced subgraphs: the sequence C3, C4, ... is an infinite 
sequence of graphs, none of which is an induced subgraph of another. 


This is disappointing, but $.B. Rao proposed the following “fix”: 


DEFINITIONS 


D60: Let us say two graphs G,G’ are degree-equivalent if they have the same vertex 
set, and for every vertex, its degrees in G and in G’ are equal. 


D61: A graph H is Rao-contained in a graph G if H is isomorphic to an induced 
subgraph of some graph that is degree-equivalent to G. 
FACT 


In the early 1980s Rao [Ra81] conjectured the following , which was proved in [ChSe13]: 


F40: [ChSe13] In any infinite set of graphs, there exist two, say G and H, such that 
HA is Rao-contained in G. 
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Outline of the Proof of Fact F40 


DEFINITION 


D62: A graph is called split if its vertex set can be partitioned into a clique and a 
stable set. 


The first part of the proof is a structural result. It states, roughly, that 
FACT 


F41: For every graph H there exists a constant cz, such that if G is a graph that does 
not Rao-contain H, then V(G) = AU BUC where 


e A,B,C are pairwise disjoint, 
e G|A is a split graph, 


e either every vertex of B has at most cy neighbors in G|B, or every vertex of B 
has at most cy non-neighbors in G|B, 


e |C| < cu, 
e the edges between A and B are tightly controlled. 


Now suppose that the class of all graphs is not a wqo under Rao-containment. Then 
there exists an infinite sequence G1, G2,... such that G; does not Rao-contain G; for 
1<i<j. In particular, for all 1 < 7, G; does not Rao-contain G;. By the structural 
result that we just mentioned, that means that all of Go,G3,... have the structure 
described in the previous paragraph. It is therefore enough to prove that the class of 
graphs with that structure, where H = G, is a wqo. Because the size of C' is bounded 
by a constant that depends only of G,, and the edges between A and B are tightly 
controlled, standard techniques allow us to reduce the problem to proving that the class 
of pairs (F, J) where F is a split graph and J is a graph with all degrees at most cq, is 
a wqo under Rao-containment. This, in turn (using Higman’s theorem [Hi52]), reduces 
to proving that 


FACTS 


F42: In any infinite set of split graphs, there exist two, say G and H, such that H is 
Rao-contained in G. 


F43: For every c > 0, in any infinite set of graphs where every vertex has at most c 
neighbors, there exist two, say G and H, such that H is Rao-contained in G. 


We will not dwell on the proof of Fact F43; let us just mention that it has recently 
been proven again using different and interesting methods in [A112]. The proof Fact F42 
though developed in a somewhat unexpected direction. It was reduced to proving that a 
certain family of directed graphs (called contests) is a wqo under a certain containment 
relation, which is very closely related to the well-known concept of immersion. This 
led to a number of new results regarding immersion of directed graphs; we will discuss 
some of them in the next subsection. 
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2.5.8 Tournament Immersion 


A directed graph H is immersed in a directed graph G if the vertices of H are mapped 
to (distinct) vertices of G, and the (directed) edges of H are mapped to directed paths 
joining the corresponding pairs of vertices of G, in such a way that the paths are pairwise 
edge-disjoint. 

More precisely, 


DEFINITION 


D63: Let G,H be directed graphs. A weak immersion of H in G is a map 7 such 
that 


e n(v) € V(G) for each v € V(H) 
e (wu) £ n(v) for distinct u,v € V(A) 


for each edge e = uv of H (this notation means that e is directed from u to v), 7(e) 
is a directed path of G from n(u) to n(v) (paths do not have “repeated” vertices) 


if e, f € E(#) are distinct, then n(e),7(f) have no edges in common, although 
they may share vertices 


If in addition we add the condition 


e ifv € V(A) and e € E(#), and e is not incident with v in H, then 7(v) is not a 
vertex of the path 7(e) 


we call the relation strong immersion. (For undirected graphs the definitions are the 
same except we use paths instead of directed paths.) 


The following is a theorem from [RoSe10], conjectured by Nash-Williams: 


FACT, CONJECTURED BY NASH-WILLIAMS 


F44: [RoSel0] The class of all graphs is a wqo under weak immersion. 


REMARK 


R11: It remains open whether the class of all graphs is a wqo under strong immersion 
(this is another conjecture of Nash-Williams). Robertson and Seymour believe that at 
one time they had a proof, but they have never written it down [RoSe11]. 


FACTS 


F45: Unfortunately, weak immersion does not provide a wqo of the class of directed 
graphs. To see this, let D, be a cycle of length 2n and direct its edges alternately 
clockwise and counterclockwise; then no member of the set {D; : i > 2} is weakly 
immersed in another. 


But what about tournaments? The main result of [ChSe11] is that: 


F46: [ChSel1] The class of all tournaments is a wqo under strong immersion. 
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The proof of Fact F46 relies on a new graph parameter, defined in [ChSe11]. 


DEFINITION 

D64: If k > 0 is an integer, an enumeration (v1,..., Un) of the vertex set of a directed 
graph has cut-width at most & if for all 7 € {1,...,n —1}, there are at most k edges 
uv such that u € {v1,...,u;} and v € {vj41,..., Un}; and a digraph has cut-width at 


most k if there is an enumeration of its vertex set with cut-width at most k. 


So if a directed graph has low cut-width, it means that it is in some sense degenerate. 


On the Proof of Fact F46 


The proof of Fact F46 consists of two ingredients. First it is shown that: 


FACT 


F47: ([ChSel1] For every tournament S there exists an integer cg, such that if a 
tournament T does not strongly immerse S, then T has cut-width at most cg. 


Similarly to the proof of Fact F40, the proof of Fact F46 now reduces to proving that 
for every c > 0, the class of tournaments of cut-width at most c is a wqo under strong 
immersion. This is a much more manageable task than proving Fact F46 directly; the 
proof can be found in [ChSel11]. 


2.5.9 Topological Containment in Tournaments 


In this subsection we discuss another containment relation on tournaments. Let 
FH be directed graphs. Then F is a subdivision of H if it can be obtained from H by 
repeatedly deleting an edge uv, adding a new vertex w, and adding two new edges uw 
and wv. We say that a directed graph G topologically contains a graph H if G has a 
subgraph isomorphic to a subdivision of H. 

It turns out that a theorem similar to Fact F47 exists for topological containment, 
except “cut-width” needs to be replaced by a more complicated parameter. The follow- 
ing was defined in [FrSe13]. 


DEFINITIONS 


D65: Given a directed graph D, a sequence W = [W,,...,W,] of subsets of V(D) is 
a path-decomposition of D if the following conditions are satisfied: 


1. Uj Wi = V(D) 

2. Wi0W, CW; forl<i<g<k<r 

3. for each edge uv € E(D), ue Wi,v € W; for some i > j 
The width of a path-decomposition is max;<;<;(|Wi| — 1). 


D66: The path-width of a directed graph D is the minimum width over all path- 
decompositions of D. 
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REMARK 


R12: Some readers may be familiar with the concept of path-decomposition and path- 
width for undirected graphs; this is, of course, the directed analogue. 


FACTS 


F48: [FrSe13] For every tournament S there exists an integer cg, such that if a tour- 
nament T does not topologically contain S, then T has path-width at most cg. 


F49:  [FrSe13] Fact F48 is then used to obtain a polynomial-time algorithm to test if 
a given tournament topologically contains a fixed tournament S. 


REMARKS 


R13: Unlike in the case of immersion, topological containment turns out not to be a 
wqo. 


R14: Some of the results mentioned here and in Subsection 2.5.10 are in fact proved for 
a wider class of directed graphs, called semi-complete digraphs, but we will not discuss 
it here. The reader is referred to [ChSe11] and [FrSe13]. 


2.5.10 Disjoint Paths Problems in Tournaments 


DEFINITIONS 


D67: Let s1,t1,...,5x,tk be vertices of a graph or directed graph G. The k edge- 
disjoint paths problem is the problem of determining whether there exist edge-disjoint 
paths P,,...,P, (directed paths, in the case of a directed graph) such that P; is from 
Sj to tj for 1 <i<k. 


D68: The vertex-disjoint paths problem is defined similarly, except that the paths 
are required to be vertex disjoint. 


FACTS 


F50: [RoSe95] For undirected graphs, both problems are solvable in polynomial time 
for all fixed k; this was one of the highlights of the Graph Minors project of Robertson 
and Seymour. 


The directed version is therefore a natural and important question, but it was shown 
by Fortune, Hopcroft, and Wyllie [FoHoWy80] that 


F51: [FoHoWy80] Without further restrictions on the input G,, both the edge-disjoint 
paths problem and the vertex-disjoint paths problem are NP-complete for directed 
graphs, even for k = 2. 


So, it becomes significant to study subclasses of directed graphs for which the prob- 
lems can be solved in polynomial time. 
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The Edge-Disjoint Paths Problem 


Let us first address the edge-disjoint version of the problem. 


FACT 


F52: [FrSel3a] The edge-disjoint paths problem in a tournament can be solved in 
polynomial time for any fixed k. 


The idea of the algorithm is as follows. If the tournament in question has high 
cut-width, then a certain structure is found that can be excised from the graph without 
changing the answer, and so the problem is reduced to a smaller graph; if the cut-width 
is low, the problem is solved using dynamic programming. Also in [FrSe13a], this result 
is extended to directed graphs for which the underlying undirected graph has bounded 
stability number. 


The Vertex-Disjoint Paths Problem 


One might hope that a similar approach would work when considering the vertex-disjoint 
paths problem in tournaments, with path-width replacing cut-width. However, this does 
not seem to be the case. The main result of [ChScSe13] is that: 


FACT 


F53: [ChScSe13] The vertex-disjoint paths problem can be solved in polynomial time 
for any fixed k in tournaments (in fact, in semi-complete digraphs). 


But the only tool used there is a version of dynamic programming. 
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Glossary for Chapter 2 


adjacency list representation — for a graph or digraph G = (V, £): an array L of |V| 
lists, one for each vertex in V; for each vertex 7, there is a pointer LD; to a linked list 
containing all vertices 7 adjacent to 7. 
adjacency matrix representation — of a simple graph or digraph G = (V, F): a|V|x|V| 
matrix A, where Ali, j] = 1 if there is an edge from vertex 7 to vertex j, and Ali, 7] = 0 
otherwise. 
adversary reconstruction number — of a graph G: the minimum number & such 
that every choice of & subgraphs from the deck of G determines G' uniquely. 
all-pairs shortest-paths problem: determining the shortest path between every pair 
of vertices in a graph. 
ally reconstruction number — of a graph G: same as the reconstruction number. 
X-anticomplete vertex in a graph G, where X C V(G): a vertex v € V(G) \ X such 
that v has no neighbors in X. For disjoint subsets X,Y C V(G), we say that Y 
is complete to X (or that the pair (X,Y) is anticomplete) if every vertex in Y is 
X-anticomplete. 
anticonnected graph: a graph G whose complement G° is connected. 
anticomponent of a graph G: a maximal anticonnected induced subgraph of G. 
antihole in a graph G: an induced subgraph H that is isomorphic to the complement 
C® of a cycle graph C,, for some integer n > 4. 
___, odd: an antihole with an odd number of vertices. 
__, even: an antihole with an even number of vertices. 
automorphism of a graph: an isomorphism from the graph to itself. 
automorphism group — of a graph: the group of automorphisms of the graph under 
the operation of functional composition. 
back edge — for a spanning tree in a directed graph: a nontree edge that joins a vertex 
to a proper ancestor. 
balanced graph: a graph all of whose induced cycles are of length divisible by 4. 
bandwidth-k graph: a graph for which there exists a vertex labeling h : V > 
{1,2,...|V|} such that {u,v} © BE => |h(u) — h(v)| < k. 
Berge graph: a graph with no odd holes and no odd antiholes (terminology due to 
Chvatal). 
Berge-basic graph: a graph G such that either G or G° is bipartite, the line graph of 
a bipartite graph, or a double-split graph. 
bidegreed graph: a graph whose vertices have only two possible degrees. 
bisimplicial vertex in a graph G: a vertex v whose set of neighbors is the union of 
two cliques. 
branch-decomposition — of a graph G = (V,£): a pair (T, f), where T is a tree in 
which every non-leaf vertex has exactly three neighbors and f is a bijection from the 
leaves of T to E. 
___, partial: a branch-decomposition in which the degree of every non-leaf vertex in 
T is at least 3. 
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branchwidth-k graph: a graph whose branchwidth is no greater than k. 
branchwidth — of a graph G: the minimum width taken over all branch-decompositions 
of G. 
breadth-first search: a systematic method for finding all vertices of a graph that 
are reachable from a given start vertex, by beginning at the start vertex and then 
visiting the unvisited vertices in a shortest-distance-from-the-start-vertex order. 
breadth-first tree: tree of all vertices reachable from a given start vertex of a graph 
during a breadth-first search. 
bull: the graph with vertex set {v1, v2, v3, v4, U5} where {v2, v3, v4} is a clique, v; is 
adjacent to v2, v4 is adjacent to vs, and there are no other edges. 
__,-free graph: a graph with no induced subgraph isomorphic to the bull. 
canonical form — of a simple graph (for isomorphism testing): an arbitrary labeling 
of the vertex set of a representative of each isomorphism type of graph. Thus, two 
graphs are isomorphic if and only if their canonical forms are identical. 
CAP: see color automorphism problem. 
certificate for isomorphism: synonym for a complete invariant. 
chordal graph: a graph that contains no induced cycles of length greater than 3. 
chromatic number \(G) of a graph G: the smallest number k for which the vertices 
of G can be colored with k colors, so that no two adjacent vertices receive the same 
color. 
circular interval graph: the intersection graph of a set of arcs on the circle. 
___, long: a circular interval graph for which no three of the arcs cover the circle. 
class edge-reconstruction number — of a graph G in a class C: the least number of 
subgraphs in the edge-deck of G which, together with the information that G is in 
the class C, guarantees that G is uniquely determined. 
class reconstruction number — of a graph G in a class C: the least number of 
subgraphs in the deck of G which, together with the information that G is in the 
class C, guarantees that G is uniquely determined. 
claw: the complete bipartite graph 1,3. 
clique in a graph G: a set of vertices of G, all pairwise adjacent. 
clique number w(G) of a graph G: the largest size of a clique in G. 
cliquewidth — of a graph: the minimum number of labels that are sufficient to construct 
a graph from isolated vertices, while using only the union, module join, and relabeling 
operations. 
cliquewidth-k graph: defined recursively as follows ([k] denotes the set of integers 
{1,25 aaghh ys 
e Any graph G with V(G) = {v} and I(v) € [k] is a cliquewidth-k graph. 
e If G; and G» are cliquewidth-k graphs and i,j € [k], then 
1. the disjoint union Gj U G2 is a cliquewidth-k graph. 
2. the graph (G);x; is a cliquewidth-k graph, where (Gj);x; is formed from 
G by adding all edges (v1, v2) such that I(v1) = 7 and I(v2) = 7. 
3. the graph (Gj);-,; is a cliquewidth-k graph, where (G',);-,; is formed from 
G by switching all vertices with label 7 to label 7. 
cograph: defined recursively as 
e A graph with a single vertex is a cograph. 
e If G; and Gp are cographs, then the disjoint union G'; U G2 is a cograph. 
e If G, and G2 are cographs, then the cross-product G, x G2 is a cograph, which is 
formed by taking the union of G; and G2 and adding all edges (v;, v2) where v1 
is in G, and vo is in Go. 


Glossary for Chapter 2 155 


color automorphism problem (CAP): the problem of finding a set of generators for 
the subgroup of color-preserving permutations, within a given permutation group 
acting on a given colored set. 

color class — for a graph: the set of all vertices that are assigned the same color. 

coloring — of a graph G: a mapping o : Ve > C from its vertex set to a set C (often 
a set of integers); alternatively, a partition o = [Ci,...,Cm] of the vertex set into 
color classes. 


__, trivial — for a graph: a coloring that assigns the same color to every vertex. 
color-preserving mapping: a graph mapping such that any two like-colored vertices 
of the domain are mapped to like-colored vertices in the codomain. 

X-complete vertex in a graph G, where X C V(G): a vertex v € V(G) \ X such 
that v is adjacent to every vertex in X. For disjoint subsets X,Y C V(G), we say 
that Y is complete to X (or that the pair (X,Y) is complete) if every vertex in Y is 
X-complete. 

complement or edge-complement G° of a simple graph G: a graph with vertex set 
V(G), such that two vertices are adjacent in G° if and only if they are non-adjacent 
in G. 

component of a graph G: a maximal connected subgraph of G. 

composition of strips — a generalization of a line-graph of H: every edge of H is 
replaced by a member of 1 of 15 prescribed families of graphs (a “strip”), and then 
certain edges are added between the subgraphs corresponding to edges of H that 
share an end. 

cover of a graph G by Ff — for a sequence F = (F}, Fo,..., F) of graphs (in which 
different F; could be isomorphic): a sequence G = (Gi, Go,...,Gz) of subgraphs of 
G (not necessarily distinct) such that (i) G; ~ Fj, 1=1,...,k and (ii) G = U;G;.; 
the number of covers of G by F is denoted by c(F, G). 

cross edge — for a spanning forest in a directed graph: a nontree edge that joins two 
vertices that are neither ancestors nor descendants of each other. 

cutwidth of a graph G: the smallest integer k such that the vertices of G can be 
arranged in a sequence v1,...,Un so that, for every 7 = 1,...,2—1, there are at most 
k edges with one endpoint in {v;,...,v;} and the other in {vj41,..., Un}. 

cycle (graph) C,, of length n (where n > 3 is an integer: the graph with vertex set 
{U1,---,Un}, such that v; is adjacent to v; if and only if |¢— j| = 1 mod n. 

deck — of a graph G: the collection D(G) of all vertex-deleted subgraphs of the graph G. 

degree of a vertex v: the number of vertices adjacent to v. 

degree sequence — of a graph G: the sequence of degrees of the vertices of G, written 
in non-descending order. 

degree-equivalent pair of graphs: two graphs with the same degree sequence. 

degree vector — of a graph coloring o = [C),...,Cm]: the vector assignment 


vrs deg,(v) = [|N(v) NC4I,...,|N(v) A Cal] 


dense graph G = (V, £): one in which the order of magnitude of |E| is close to |V|?. 
depth-first forest: set of depth-first trees formed in a depth-first search of a graph. 


depth-first search: a systematic method for visiting all vertices of a graph by begin- 
ning at a vertex, picking an unvisited adjacent vertex, and recursively continuing 
the search from that vertex. 
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depth-first tree: tree formed by tree edges discovered in a depth-first search of a 
graph. 

disjoint paths problem, edge: the problem of determining for a graph or for a 
directed graph G with vertices s,,t1,...,5,%,tx, whether there exist edge-disjoint 
paths P,,...,P, (directed paths, in the case of a directed graph) such that P; is 
from s; tot; for 1 <i<k. 

disjoint paths problem, vertex: the problem of determining for a graph or for a 
directed graph G with vertices s1,t1,...,5x,t,, whether there exist vertex-disjoint 
paths P,,...,P, (directed paths, in the case of a directed graph) such that P; is 
from s; tot; for 1 <i<k. 

double-split graph: a graph G, whose vertex-set V(G) can be partitioned into four 
sets {a1,...,@m}, {b1,---,0m}, {ar,---, en}, {di,...,dn} for some m,n > 2, such 
that 
e a; is adjacent to b; for 1 <i <™m, and c; is non-adjacent to d; for 1 <j <n. 
e there are no edges between {a;,b;} and {a;,b;} for 1 <i < «’ <™m, and all four 

edges between {c;,d;} and {c,,dj} for l<j<ji <n. 
e there are exactly two edges between {aji,bi} and {cj,d;} for 1 < i < m and 
1 <j <n, and these two edges have no common end. 

edge-contraction — of an edge e in a graph G = (V, £): an operation that results in a 
graph with edge-set F — {e} but with a vertex-set obtained by replacing (“merging” ) 
the endpoints of e in G, thus creating a new single vertex where the latter inherits 
all of the adjacencies of the pair of replaced vertices, without introducing loops or 
multiple edges. 

edge-deck — of a graph G: the collection ED(G) of all edge-deleted subgraphs of G. 

edge-deleted subgraph — of a graph G: a graph G — e obtained from G by deleting 
an edge e; also called edge-deletion subgraph. 

k-edge-deleted subgraph — of a graph G: a subgraph obtained from G by deleting k 
of its edges. 

edge-extraction — on a graph G = (V, E£): an operation that removes an edge e leaving 
the edge-deletion graph G — e. 

edge-recognizable class: a class C of graphs such that, for any graph G € C, every 
edge-reconstruction of G is also in C. 

edge-reconstructible graph: a graph G whose every edge-reconstruction is isomor- 
phic to G. 

edge-reconstructible parameter: a graph parameter P such that, for any graph G 
with parameter value p, every edge-reconstruction of G also has value p for that 
parameter. 

Edge-Reconstruction Conjecture: the conjecture that every graph on at least four 
edges is edge-reconstructible. 

edge-reconstruction number — of a graph G: the least number of subgraphs in the 
edge-deck of G which guarantees that G is uniquely determined. 

edge-reconstruction of a graph G: a graph H with the same edge-deck as G. 

edge-reconstruction problem for a structure (D,I, £) — where all the subsets 
FE — 2 are given, up to action by the group I: the question of whether E can be 
reconstructed from these subsets uniquely, again up to action by the group [. 

k-edge-reconstruction problem: the problem of determining uniquely, up to isomor- 
phism, a graph or a structure from its k-edge-deleted subgraphs or substructures. 

elementary graph: a graph in which any component is either an edge or a cycle. 

endvertex — of a graph G: a vertex whose degree is 1. 
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endvertex-deck — of a graph G: the collection of graphs G — v for all endvertices v 
of G. 

endvertex-reconstructible graph: a graph that is uniquely determined by its end- 
vertex deck. 

equitable partition: a partition of the vertices of a graph into cells such that, for any 
two vertices v,w in the same cell, and for any cell C of the partition, the vertices uv 
and w are adjacent to the same number of vertices in C. 

Erd6s—Hajnal property of a graph H: the property that there exists a constant 
6(H) > 0, such that every H-free graph G has either a clique or a stable set of size 
at least |V(G)|°). 

Floyd—Warshall algorithm: an algorithm to compute the shortest length path (or 
least cost) between vertex i and vertex j, for all vertices 7 and j. 

forward edge — for a spanning tree in a directed graph: a nontree edge that joins a 
vertex to a proper descendant. 

H-free graph: a graph G such that no induced subgraph of G is isomorphic to H. For 
a family F of graphs, a graph G is F-free if G is H-free for every H € F. 

GI: an abbreviation for the general decision problem of determining whether two graphs 
are isomorphic. 

graph invariant: a property of graphs that has the same value for any two isomorphic 
graphs. 

___, complete: an invariant that assigns different values to any two non-isomorphic 
graphs. 

p-group — for a prime p: a group whose order is a power of the prime p. 

Halin graph: a planar graph whose edge set can be partitioned into a spanning tree, 
with no vertices of degree 2, and a cycle through the leaves of this tree. 

k-HB graph: graph that yields a balanced modular decomposition when a certain 
decomposition algorithm is applied; see Definition D30 in §2.4. 

hereditary class of graphs: a class G such that every induced subgraph of a graph 
in G is also in G. 

hole in a graph G: an induced subgraph H that is isomorphic to a cycle graph C’, for 
some integer n > 4. 

__, odd: a hole with an odd number of vertices. 
__, even: a hole with an even number of vertices. 

illegitimate deck: a collection of graphs G1,G2,...,Gn, each on n — 1 vertices such 
that there is no graph G having the given collection as its deck. 

illegitimate deck problem: the problem to determine whether or not a given collec- 
tion of graphs is indeed the deck of some graph. 

immersion of a (directed) graph H in a (directed) graph G: a one-to-one mapping 
n: V(H) > V(G) along with a mapping of the (directed) edges of H to (directed) 
paths joining the corresponding pairs of vertices of G, in such a way that the paths 
are pairwise edge-disjoint. 

__, strong: an immersion such that if v € V(H) and e € E(#), and if e is not 
incident with v in H, then n(v) is not a vertex of the path n(e). 

imperfect graph: a graph that is not perfect. 

incidence matrix, representation — of a simple graph G = (V, E): a |V| x |E| matrix 
I, where I[v, e] = 1 if e is incident on v and 0 otherwise. 

incidence matrix representation — of a simple digraph G = (V, E): a |V| x |E| matrix 
I, where 
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—1_ if edge e is directed to vertex v 
I[v,e]= 41 _ if edge e is directed from vertex v 
0 otherwise 


induced subgraph of a graph G: a graph H such that V(H) C V(G), and uv € E(#) 
if and only if ww € E(G) for every u,v € V(H). For X C V(G), the subgraph of G 
induced by X is denoted G|X. 

ISO: another abbreviation for the general decision problem of determining whether two 
graphs are isomorphic. 

isomorphic graphs: two graphs G and H, such that there is an isomorphism G > H. 

isomorphism of labeled graphs G and H: an isomorphism ¢: G —> H, such that 
for each v € Va, the vertices v and ¢(v) have the same label. 

isomorphism of simple graphs: a vertex bijection that preserves adjacency relation- 
ships. 

isomorphism-complete problem: a decision problem that is polynomial-time equiv- 
alent to GI. 

Kleene closure of a set of strings S: the set S* = U%)S". 

Kleene’s algorithm: an algorithm for constructing a regular expression that describes 
all paths between every pair of vertices in a labeled graph. 

labeled graph: a graph whose vertices and/or edges are labeled, possibly with repeti- 
tions, using symbols from a finite alphabet. 

line graph L(H) of a graph H: the graph with vertex set E(H), such that ef € 
E(L(#)) if and only if the edges e and f share an endpoint in H. 

linear-time algorithm: algorithm that runs in O(V + EF) time for input graph G = 
(V, EB). 

minor — of a graph G: a graph that can be obtained from G by a finite sequence of 
edge-extraction and edge-contraction operations. 

module: with respect to a subgraph, a set of vertices that share exactly the same 
neighbors outside this subgraph. 

monomorphism with forbidden X — of simple graphs G and H, where X is a subset 
of the edges of G: a bijection of V such that if {u,v} is an edge in E(G) — X then 
{f(u), f(v)} is also an edge in H, but if {u,v} is an edge in X then {f(u), f(v)} is 
not an edge in H. The number of monomorphisms from G to H with forbidden X 
is denoted by [H]q\ x- 

monomorphism — of simple graphs G and H: a one-to-one function f : Ve — Vy 
such that if {u,v} is an edge of G, then {f(u), f(v)} is an edge of H. The number 
of monomorphisms from G to H is denoted by [H]a. 

nauty: the name of a practical computer program for use in graph isomorphism testing. 
(The name is a quasi-acronym for “no automorphisms, yes” .) 

neighborhood — of a vertex v of a graph: the set of all vertices adjacent to v. It is 
denoted by N(v). 

k-NLC (node-label-controlled) graph: defined recursively as follows ([&] denotes 
the set of integers {1,2,...,4}, and B denotes a bipartite graph on [k] x [k]): 

e Any graph G with V(G) = {v} and I(v) € [k] is a K-NLC graph. 

e If G, and G2 are k-NLC graphs and i, 7 € [k], then the join G; x g G2 is a k-NLC 
graph, where G' x p G2 is formed from GU G2 by adding all edges (v1, v2) where 
vy € Vy, U(vy) = 4; ve € Vo, lve) = 7, and (i, 7) is an edge in Ep. 

e The graph (G1);,; is a k-NLC graph, which is formed from G by switching all 
vertices with label 7 to label 7. 
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nondeterministic finite automaton: a directed graph (possibly with multiple edges) 
between the same pair of vertices, having a distinguished start state, a set of final 
states, and labels on the edges. 

N-reconstructible digraph: a digraph D such that the set of triples (D—v;, deg;,,(v;), 
deg,.4(v:)), for all vertices v; of D, is sufficient information to determine D uniquely. 

order — of an edge e in T in a branch-decomposition (T, f) of a graph G = (V,E): 
the number of vertices v € V such that there exist leaves /; and lz of T residing in 
different components of T — e, where f(J1) and f(Iz) are both incident on v. 

partial k-tree: subgraph of a k-tree. 

partition refinement (for isomorphism testing): any process of making a partition of 
the vertex sets finer (i.e., breaking cells of partitions into smaller cells) by detecting 
combinatorial differences between the vertices. 

path in a graph: a sequence of edges (v1, v2), (V2, U3),---; (Un—1; Un)- 

path-decomposition: a tree-decomposition whose tree is a path. 

pathwidth-k graph: a graph that has pathwidth no greater than k. 

pathwidth — of a graph G: the smallest width taken over all path-decompositions of G; 
measures how closely the graph resembles a path. 

perfect graph: a graph G such that x(H) = w(H), for every induced subgraph H of 
G. 

peripheral vertex of a tree: a vertex that has maximum distance from the center of 
the tree. 

polynomial deck: the collection (multi-set) of the characteristic polynomials of all 
subgraphs in the deck. 

polynomial-time algorithm: an algorithm that runs in O((V + E)*) time for input 
graph G = (V, £) for some constant k. 

prime graph: a graph that cannot be obtained from smaller graphs by substitution. 

prism (graph): a graph consisting of two disjoint triangles {a1, a2, a3} and {b1, be, bs} 
and three paths P;, Pj, P3, with the following properties: 
e for 7 =1,2,3, the ends of P; are a; and b;, 
e P,, Po, P3 are pairwise disjoint, and 
e forl<i<jyj <3, there are precisely two edges between V(P;) and V(P;), namely, 

aja; and b4b;. 

property A; — of a graph G: the property that whenever A and B are distinct k-sets 
of vertices of G, the graphs G — A and G — B are not isomorphic. 

pyramid: a graph consisting of a triangle {b1,b2,b3}, called the base, a vertex a ¢ 
{b1, b2, bs}, called the apex, and three paths P,, P2, P3, such that for i,7 € 1,2,3 
e the ends of P; are a and 0;, 
e ifi Aj then V(P,) \ {a} is disjoint from V(P;) \ {a} and the only edge between 

them is b;b;, and 

e at most one of P,, P2, P3 has length one. 

In this case we say that the pyramid is formed by the paths P,, P2, P3. 


__, frame of: the 10-tuple 


a, b1, be, b3, 81, 82, $3, M1, M2,mM 
where ? 1; 25 35 1; 25 35 1; 25 35 


e for i =1,2,3, s; is the neighbor of a in P; 


e fori =1,2,3, m; € V(P;) satisfies dp,(a,m;) — dp,(mi, b;) € {0,1}. 
__, optimal: a pyramid P such that there is no pyramid P’ with iV(P’)| < |V(P)|. 
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quasi-line graph: a graph in which every vertex is bisimplicial. 
quasi-order Q consists of a class E(Q) and a transitive, reflexive relation which we 
denote by < or <g. 
___, well or wqo: a quasi-order in which for every infinite sequence q; (i = 1,2...) of 
elements of E(Q), there exist 7 > i > 1 such that q <g q;. 
Rao-contained graph in a graph G: a graph H that is isomorphic to an induced 
subgraph of some graph that is degree-equivalent to G. 
recognizable class of graphs: a class C of graphs such that, for any G € C, every 
reconstruction of G is also in C. 
reconstructible graph: a graph G whose every reconstruction is isomorphic to G. 
reconstructible parameter: a graph parameter P such that, for any graph G with 
the value p for that parameter, every reconstruction of G also has parameter value p. 
Reconstruction Conjecture: the conjecture that every graph with at least three 
vertices is reconstructible. 
reconstruction index — of a group [: the smallest number t such that for any E Cc D 
with |E| >t, the structure (D,T, £) is edge-reconstructible. 
reconstruction number — of a graph G: the least number of subgraphs in the deck 
of G which guarantees that G is uniquely determined. 
reconstruction of a graph G: a graph H with the same deck as G. 
recursively constructed graph class: defined by a set (usually finite) of primitive 
or base graphs, in addition to one or more operations that compose larger graphs 
from smaller subgraphs; each operation involves either fusing specific vertices from 
each subgraph or adding new edges between specific vertices from each subgraph. 
refinement of a graph coloring — an operation that yields a new coloring of the 
graph: two vertices with the same old color get the same new color if and only if 
they have the same numbers of neighbors of every old color. 
regular expression: a notation for describing a regular set by using the operators 
union, concatenation, and Kleene closure. 
series-parallel graph with distinguished terminals | and r, denoted (G,1,r) — defined 
recursively: 
e The graph consisting of a single edge (v1, v2) is a series-parallel graph (G,1,r) 
with 1 = v, and r = vo. 
e A series operation (G1,11,,71) ©s (Go,l2,r2) forms a series-parallel graph by 
identifying r, with Jj. The terminals of the new graph are |, and ro. 
e A parallel operation (G1,11,71) Op (Ge, l2,r2) forms a series-parallel graph by 
identifying 1; with lj and r; with rg. The terminals of the new graph are /; and ry. 
e A jackknife operation (G1, 1,71) ©; (Ge, l2,r2) forms a series-parallel graph by 
identifying r, with J2; the new terminals are /; and rj . 
set edge-reconstructible — graph or a parameter: a graph or a parameter that can 
be determined from the set of non-isomorphic subgraphs in the edge-deck. 
set reconstructible — graph or a parameter: a graph or a parameter that can be 
determined from the set of non-isomorphic subgraphs in the deck. 
skew-partition in a graph G: a partition (A, B) of V(G) such that A is not connected 
and B is not anticonnected. 
sparse graph G = (V, £): one in which the order of magnitude of |E| is |V| or less. 
split graph: a graph whose vertex set can be partitioned into a clique and a stable set. 
stabilization of a coloring o: the coloring that results from iterating the refinement 
process until a stable coloring is obtained. It is denoted o*. 
stable coloring: a graph coloring that is unchanged by the refinement operation. 


Glossary for Chapter 2 161 


stable set or independent set in a graph G: a set of vertices, all pairwise non- 
adjacent. 
stability number or independence number a(G) of a graph G: the largest size of 
a stable set. 
k-star cutset in a graph G: a partition (A, B) of V(G) such that A is not connected, 
such that there is a clique K C B, with |K| = k, and such that every vertex of B\ K 
has a neighbor in kK. A 1-star cutset is usually called a star cutset. 
structure: a triple (D,T, E) where D is a finite set, Tis a group of permutations acting 
on D, and E is a subset of D. 
substitution, result of of Hz for v € V(H;): the graph H such that 
e V(H) = (V(A1) U V(Aa)) \ {vr}, 
e A\V (A) — Ao, 
e A\(V(A,) \ {v}) = A, \ v, and 
e u € V(HA}) is adjacent in H to w € V(Hz) if and only if u is adjacent in H; to v. 
k-terminal recursive graph: graph that has at most k& special vertices called termi- 
nals, and that can be obtained by operations that fuse some of the terminals in its 
constituent k-terminal subgraphs. (See Definition D25 in §2.4.) 
theta (graph): a graph consisting of two non-adjacent vertices s,t and three paths 
P,, Po, P3, each between s and t, such that the sets V(P,) \ {s,t}, V(P2) \ {s, ¢}, 
and V(P3) \ {s,t} are pairwise disjoint, and the union of every pair of P;, P2, P3 is 
a hole. 
transitive closure of a graph G: a graph G* that has an edge (i, 7) if and only if 
there is a path of length 1 or more in G from 7 to j. 
tree: a connected graph with no cycles, and sometimes with a designated root. 
__, recursively defined: a graph with a single vertex r as its root r; or, a graph 
formed by joining the roots of two trees. 
k-tree (recursively defined): the complete graph K,; or, a graph constructed from a 
k-tree on n vertices by adding a vertex adjacent to all vertices of one of its Ky 
subgraphs, and only to those vertices. 
__, partial: a subgraph of a k-tree. 
tree-decomposition — of a graph G = (V,E): a pair ({X; | i € I},T), such that 
{X; | 7 € I} is a family of subsets of V and T is a tree with vertex set J such that 
© Uier Xi =V, 
e for all edges (x,y) € E there is an element 7 € I with 2, y € Xj, 
e for all triples i,j,k € I, if 7 is on the path from 7 to k in T, then X;(| Xz © Xj. 
treewidth — of a graph G: the minimum width taken over all tree-decompositions of 
G; measures how closely the graph resembles a tree. 
treewidth-k graph: a graph whose treewidth is no greater than k. 
trigraph: a graph G with a map 0: V(G)? > {—1,0, 1}, called the adjacency function, 
satisfying: 
e for all v € V(G), O¢(v,v) =9, 
e for all distinct u,v € V(G), 0e(u,v) = OG(v, 4), 
e for all u € V(G), there exists at most one v € V(G) \ {u} such that 0¢(u, v) = 0. 
tournament: a directed graph G where for every distinct u,v € V(G), exactly one of 
the (ordered) pairs uv and vu belongs to E(G). 
__, transitive: a tournament in which there are no directed cycles. 
vertex-deleted subgraph — of a graph G: a graph G — v obtained from G by deleting 
a vertex v and all the edges incident to it; also called a vertez-deletion subgraph. 
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k-vertex-deleted subgraph — of a graph G: a subgraph obtained from G' by deleting 
k; of its vertices and all the edges incident to them. 

weakly edge-reconstructible graph — relative to a class C: a graph G € C such that 
every edge-reconstruction of G which is also in the class C is isomorphic to G. 

weakly reconstructible graph — relative to a class C: a graph G € C such that every 
reconstruction of G which is also in the class C is isomorphic to G. 

width, — of a branch decomposition (T, f): the maximum order of the edges of T. 

widthz — of a tree-decomposition ({X; | 7 € I},T): maxjer{|X;| — 1}. 
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INTRODUCTION 


This section extends the basic terminology and properties begun in Chapter 1, and it 
describes several classical digraph models that preview later sections of the Handbook. 
Many of the basic methods and algorithms for digraphs closely resemble their coun- 
terparts for undirected graphs. Some general references for digraphs are [ChLeZh10], 
[GrYe06], and [We01]. A comprehensive and in-depth reference for digraphs is [BaGu01]. 


3.1.1 Terminology and Basic Facts 


TERMINOLOGY NOTE: The term arc is used throughout this section instead of its syn- 
onym directed edge. 


NOTATION: Often, when the digraphs under consideration do not have multi-arcs, an 
arc that is directed from vertex u to v is represented by the ordered pair (u,v) or by 
the juxtaposition wv. 


TERMINOLOGY: An arc that is directed from vertex u to v is said to have tail u and 
head v. 


Reachability and Connectivity 


DEFINITIONS 


D1: Ina digraph, a directed walk from vo to vy is an alternating sequence 
W = (v0, €1, U1, €25 +65 Un—1; En; Un) 


of vertices and arcs, such that tail(e;) = v;_1 and head(e;) = v;, for i = 1,...,n. 
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TERMINOLOGY: A directed walk from a vertex x to a vertex y is also called an x-y 
directed walk. 


D2: The length of a directed walk is the number of arc-steps in the walk sequence. 


D3: A connected digraph is a digraph whose underlying graph is connected. Else- 
where, the term weakly connected is often used to describe such digraphs. 


D4: Let u and v be vertices in a digraph G. Then wu and v are said to be mutually 
reachable in G if G contains both a directed u-v walk and a directed v-u walk. Every 
vertex is regarded as reachable from itself (by the trivial walk). 


D5: A digraph is strongly connected if every two vertices are mutually reachable. 


D6: A strong component of a digraph G is a maximal strongly connected subdigraph 
of G. Equivalently, a strong component is a subdigraph induced on a maximal set of 
mutually reachable vertices. 


D7: Let S1,S9,...,S;, be the strong components of a digraph G. The condensation 
of G is the simple digraph G* with vertex-set Ve~ = {s1, $2,..., 8}, such that there is 
an arc in digraph G* from vertex s; to vertex s; if and only if there is an arc in digraph 
G from a vertex in component 5; to a vertex in component S;. 


EXAMPLE 


E1: Figure 3.1.1 shows a digraph G, its four strong components, $1, $2, 53,54, and its 
condensation G*. Notice that the vertex-sets of the strong components of G partition 
the vertex-set of G and that the edge-sets of the strong components do not include all 
the edges of G. This is in sharp contrast to the situation for an undirected graph G, in 
which the edge-sets of the components of G partition Eg. 


b ¥ ‘ c 
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Figure 3.1.1: A digraph, its four strong components, and its condensation. 


FACT 


F1: Let G be a digraph. Then the mutual-reachability relation is an equivalence 
relation on Vg, and the strong components of digraph G are the subdigraphs induced 
on the equivalence classes of this relation. 
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Measures of Digraph Connectivity 


We introduce a few basic measures of the connectedness of a digraph. Connectivity 
of graphs and digraphs is discussed extensively in $4.1 and §4.7. The concept of an 
edge-cut plays an important role in the study of flows in networks (§11.1) and in certain 
algebraic properties of a graph or digraph (86.4). 


DEFINITIONS 


D8: <A complete digraph is a simple digraph such that between each pair of its 
vertices, both (oppositely directed) arcs exist. 


D9: A vertex-cut in a strongly connected digraph G = (V,F) is a vertex subset 
S CV such that the vertex-deletion subdigraph G — S$ is not strongly connected, and 
an edge-cut (arc-cut) is an arc subset F' C E such that the arc-deletion subdigraph 
G — F is not strongly connected. 


D10: The (vertex-) connectivity of an n-vertex non-complete digraph G = (V, £), 
denoted k,(G), is the minimum size of a vertex subset S such that G—S is neither 
strongly connected nor the trivial digraph. (The connectivity of a complete n-vertex 
digraph is n — 1.) 


D11: The edge-connectivity of a non-trivial digraph, denoted «.(G), is the minimum 
size of an edge subset F' such that G — F is not strongly connected. 


NOTATION: When the context is clear, the vertex- and edge-connectivity are denoted Kk, 
and ke, respectively. Some other sections of the Handbook use the “traditional” « and 
A instead of Ky and Ke, respectively. 


TERMINOLOGY NOTE: Synonyms for vertex-cut are cut and disconnecting set. Syn- 
onyms for edge-cut are edge-disconnecting set (or arc-disconnecting set) and cut- 
set. 


Directed Trees 


DEFINITIONS 
D12: A directed tree is a digraph whose underlying graph is a tree. 


D13: A rooted tree is a directed tree having a distinguished vertex r, called the root, 
such that for every other vertex v, there is a directed r-v path. 


TERMINOLOGY NOTE: Occasionally encountered synonyms for rooted tree are out-tree, 
branching, and arborescence. 


REMARKS 


R1: Since the underlying graph of a rooted tree is acyclic, the directed r-v path is 
unique. 


R2: Designating a root in a directed tree does not necessarily make it a rooted tree. 
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Tree-Growing in a Digraph 


Algorithm 3.1.1, shown below, is simply the basic tree-growing algorithm of §1.1 (Al- 
gorithm 1.1.1), recast for digraphs. Its output, as in Algorithm 1.1.1, is a rooted tree 
whose vertices are reachable from the starting vertex. But because the paths to these 
vertices are directed (i.e., one-way), the vertices in this output tree need not be mutually 
reachable from one another. 


DEFINITION 


D14: A frontier arc for a rooted tree T in a digraph is an arc whose tail is in T and 
whose head is not in T. 


Algorithm 3.1.1: Basic Tree-Growing in a Digraph 


Input: a digraph G and a starting vertex uv € Va. 
Output: a rooted tree T’ with root v and a standard vertex-labeling of T. 


Initialize tree T as vertex v. 

Write label 0 on vertex v. 

Initialize label counter i := 1 

While there is at least one frontier arc for tree T 
Choose a frontier arc e for tree T. 
Let w be head(e) (which lies outside of T). 
Add arc e and vertex w to tree T. 
Write label 7 on vertex w. 
t:=i4+1 

Return tree T and vertex-labeling of T’. 


COMPUTATIONAL NOTE: We assume that there is some implicit default priority for 
choosing vertices or edges, which is invoked whenever there is more than one frontier 
arc from which to choose. 


EXAMPLE 


E2: Figure 3.1.2 shows a digraph and all possible output trees that could result for 
each of the different starting vertices and each possible default priority. Two opposite 
extremes for possible output trees are represented here. When the algorithm starts 
at vertex u, the output tree spans the digraph. The other extreme occurs when the 
algorithm starts at vertex x (because x has outdegree 0). Notice that any two output 
trees in Figure 3.1.2 with the same vertex-set have roots that are mutually reachable. 


u 
¥ “ ul u 2 y 
x v 
Vv Vv 7 PS y x Vv 
Z Z y x ‘ Z 
y y x 
y 2 


Figure 3.1.2: A digraph and all possible output trees. 
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FACTS 


F2: Let u and v be two vertices of a digraph G. Then u and v are in the same strong 
component of G if and only if the output trees that result from starting Algorithm 3.1.1 
at vertex u and at vertex v have the same vertex-set. 


F3: Ifthe digraph G is strongly connected, then the output tree is a spanning rooted 
tree of G, regardless of the starting vertex. 


REMARK 


R3: Example E2 above illustrates an important distinction between undirected and 
directed graphs: whereas tree-growing in an undirected graph provides a simple algo- 
rithm to determine the components of the graph, in a digraph this is not the case. Other 
differences were suggested earlier in Example El. The use of tree-growing, specifically 
depth-first search (§10.1), in finding the strong components of a digraph is considerably 
more intricate than its undirected counterpart. For discussions of strong-component- 
finding algorithms, see, e.g., [BaGe99], [GrYe06, §11.4], and [St93]. 


Oriented Graphs 


DEFINITIONS 


D15: An oriented graph is a digraph obtained by choosing an orientation for each 
edge of an undirected simple graph. Thus, an oriented graph does not have both oppo- 
sitely directed arcs between any pair of vertices, which means that an oriented tree is 
the same as a directed tree. 


D16: A tournament is an oriented complete graph. That is, it has no self-loops, and 
between every pair of vertices, there is exactly one arc. See §3.3 for extensive coverage 
of tournaments. 


D17: A graph G is strongly orientable if there exists an assignment of directions to 
the edge-set of G such that the resulting digraph is strongly connected. 
EXAMPLE 


E3: Of the three graphs shown in Figure 3.1.3, only the graph G2 is strongly orientable. 


G, Gy G, 


Figure 3.1.3: Only the graph G2 is strongly orientable. 
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Notice that G2 is the only graph in the example that does not have a cut-edge. In 
fact, the absence of cut-edges is a necessary and sufficient condition for a graph to be 
strongly orientable. This characterization of strongly orientable graphs was proved by 
H. E. Robbins in 1939. 


FACT 


F4: Robbins’s Theorem [Ro39] A connected graph G is strongly orientable if and 
only if G has no cut-edges. 


Adjacency Matrix of a Digraph 


DEFINITION 
D18: The adjacency matrix of a digraph G = (V, FE), denoted Ag, is given by 


i= fe number of arcs from utov ifuA#v 


the number of self-loops ifu=v 


FACTS 


F5: A row-sum in a directed adjacency matrix equals the outdegree of the correspond- 
ing vertex, and a column-sum equals the indegree. 


F6: Let G be digraph with adjacency matrix Ag. Then the value of the entry AG[u, v] 
of the rt power of matrix Ag equals the number of directed u-v walks of length r. 


EXAMPLE 


E4: The adjacency matrix of the digraph in Figure 3.1.4 uses the vertex ordering 
u,v, w,x. As an illustration of Fact F6, observe that the number of directed w-v walks 
equals 3, which is the (w,v)-entry of A2,. 


U vVvWwW 

ufO0101 

Vv 0 1 0 0 

G AG, 111 £0 
x\0 0 0 0 


Figure 3.1.4: A digraph and its adjacency matrix. 


REMARK 


R4: Another matrix representation of a digraph is the node-arc incidence matrix, where 
the columns are labeled by the arcs, and the rows are labeled by the vertices (nodes). 
Connections between the incidence matrix of a graph or digraph and the structural 
properties of the graph are explored in §6.4. 
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3.1.2 A Sampler of Digraph Models 


In this subsection, we sample a few of the digraph models. Acyclic digraph models 
are the focus of §3.2. 


Markov Chains and Markov Digraphs 


The topic of Markov processes is part of a more general area known as stochastic pro- 
cesses, a branch of mathematics and operations research with far-ranging applications 
and theoretical challenges. The reader may consult any of the standard texts in this 
subject for a formal presentation of these concepts (e.g., [Ci75], [Wi94]). 


DEFINITIONS 


D19: A sequence of random variables {X,}, t = 0,1,2,..., isa (finite) discrete-time 
Markov chain (DTMC) on a state-space S = {1,2,...,n} if X, € S for all times 
t = 0,1,2,..., and the probability distribution of X;4; depends only on the value of 
X,. In particular, 


prov( Xt41 = g|Xt = a, Xt-1 = lt-1, eae , Xo => io) = prov( Xt41 = g\|Xt = i) 


D20: A stationary DTMC satisfies the additional condition that for all states 
i,j € S and all times t, the transition probability prob(Xi41 = j|Xt = 1) = pi; is 
independent of t. 


D21: A Markov digraph G = (V, E) of a stationary DTMC with state-space S and 
transition probabilities p;; is a digraph with vertex-set V = S, arc-set E = {ij|p;; > O}, 
and to each arc ij € FE is assigned the probability p;;. 


D22: The transition matrix of a Markov chain is the matrix whose ij*" entry is the 
transition probability p;;. 


EXAMPLE 


E5: A Gambler’s Problem: A gambler starts with $3 and plays the following game. 
Two coins are tossed. If both come up heads, then he wins $3; otherwise, he loses 
$1. He plays until either he loses all his money or he reaches a total of at least $5. 
Let X; be the amount of money he has after t plays, with Xo = 3. The state space 
is S = {0,1,2,3,4,5}, and the sequence {X;} is a discrete-time Markov chain. The 
transition matrix and Markov digraph for this Markov chain are shown in Figure 3.1.5. 


0 1 2 38 4 35 
0 1 0 O O 0 0 
1 7 O OO O 25 O 
2 0 7 O O O. .25 
3 0 O 75 O O- .25 
4 0 O O 75 O- .25 
>5\0 0 0 0 O 1 


Figure 3.1.5: Gambler’s transition matrix and Markov digraph. 
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Equipment-Replacement Policy 


We present a digraph model that can be used to determine a replacement policy that 
minimizes the net cost of owning and operating a car for a pre-specified number of years. 


EXAMPLE 


E6: Suppose that today’s price for a new car is $16,000, and that the price will increase 
by $500 for each of the next four years. The projected annual operating cost and resale 
value of this kind of car are shown in the table below. To simplify the setting, assume 
that these data do not change for the next five years. 


Annual Operating Cost Resale Value 


$600 (for 1st year of car) $13,000 (for a 1-year-old car) 
$900 (for 2nd year of car) $11,000 (for a 2-year-old car) 
$1200 (for 3rd year of car) $9,000 (for a 3-year-old car) 
$1600 (for 4th year of car) $8,000 (for a 4-year-old car) 
$2100 (for 5th year of car) $6,000 (for a 5-year-old car) 


Digraph Model: The digraph has six vertices, labeled 1 through 6, representing the 
beginning of years 1 through 6. The beginning of year 6 signifies the end of the planning 
period. For each 7 and j with i < j, an arc is drawn from vertex 7 to vertex j and 
is assigned a weight c;;, where c,; is the total net cost of purchasing a new car at the 
beginning of year 7 and keeping it until the beginning of year 7. Thus, 


price of new car at beginning of year 7 


Cij = + sum of operating costs for years 1,2 +1,...,7-—1 
— resale value at beginning of year 7 


Figure 3.1.6 shows the resulting digraph with 7 of its 15 arcs drawn. The arc-weights 
are in units of $100. 


© © 


Figure 3.1.6: Part of the digraph model for a car-replacement problem. 


The problem of determining the optimal replacement policy is reduced to finding the 
shortest (least-cost) path from vertex 1 to vertex 6. This is a simple task for Dijkstra’s 
algorithm, even for much larger instances of this kind of problem. Dijkstra’s algorithm 
is discussed in §10.1. 
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The Digraph of a Relation and the Transitive Closure 


Our focus here is on general relations and their transitive closure. Digraphs of posets 
(partially ordered sets) are discussed in §3.2. 


DEFINITIONS 
D23: A relation R on a finite set S is a subset of the cartesian product S x S. 


D24: The digraph representation of a relation R on a finite set S is the digraph 
whose vertices correspond to the elements of S, and whose arcs correspond to the ordered 
pairs in the relation; that is, an arc is drawn from vertex x to vertex y if (x,y) € R. 


Conversely, a digraph G induces a relation R on Vp in a natural way, namely, (x,y) € R 
if and only if there is an arc in digraph G from vertex x to vertex y. 


D25: A transitive digraph is a digraph whose corresponding relation is transitive. 
That is, if there is an arc from vertex x to vertex y and an arc from y to z, then there 
is an arc from z to z. 


D26: The transitive closure R* of a binary relation R is the relation R* defined by 
(x,y) € R* if and only if there exists a sequence x = U0, V1, V2,.--,Ur = y such that 
k > 1 and (v;, vi41) € R, fori =0,1,...,4 — 1. Equivalently, the transitive closure R* 
of the relation R is the smallest transitive relation that contains R. 


D27: Let G be the digraph representing a relation R. Then the digraph G* representing 
the transitive closure R* of R is called the transitive closure of the digraph G. Thus, 
an arc (a,y), « # y, is in the transitive closure G* if and only if there is a directed x-y 
path in G. Similarly, there is a self-loop in digraph D* at vertex x if and only if there 
is a directed cycle in digraph G that contains x. 


EXAMPLES 


E7: Suppose a relation R on the set S = {a, b,c, d} is given by 


{(a, 4), (4,6); (6, €); (6), (4 @)} 


Then the digraph G representing the relation R and the transitive closure G* are as 
shown in Figure 3.1.7. 


G G* 


Figure 3.1.7: The digraph G and its transitive closure G*. 
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E8: ‘Transitive Closure in a Paging Network: Suppose that the arcs of an n-vertex 
digraph G represent the one-way direct links between specified pairs of nodes in an 
n-node paging network. Thus, an arc from vertex 7 to vertex j indicates that a page 
call can be transmitted from person 7 to person 7. 


To send an alert from person 7 to person J, it is not necessary to have a direct link from 
i to j. There need only be a directed i-j7 path. The transitive closure G* of digraph G 
specifies all pairs 7,7 of vertices for which there exists a directed i-7 path in G. 


Constructing the Transitive Closure of a Digraph: Warshall’s 
Algorithm 


Let G be an n-vertex digraph with vertices v1, v2,...,Un. A computationally efficient 
algorithm, due to Warshall [Wa62], constructs a sequence of digraphs, 


Do, D1,--+5 Pn; 


such that Do = G, Dj;_-1 is a subgraph of D;, 1 = 1,...,n, and such that D,, is the 
transitive closure of D. Digraph D; is obtained from digraph D;_-1 by adding to Dj, 
an arc (vj, Ux) (if it is not already in D;_1) whenever there is a directed path of length 
2 in D;-1 from v; to vg, having v; as the internal vertex (see Figure 3.1.8). 


A related algorithm of Floyd [F162] determines the shortest distance between all pairs 
of vertices in an edge-weighted digraph. 


Vj Vj Vk Vi V; Ve 
Ce eee 
D.., D; 


Figure 3.1.8: The arc (v;,v,) is added to digraph D,_1. 


Algorithm 3.1.2: Warshall’s Transitive Closure [Wa62] 


Input: an n-vertex digraph D with vertices v1, v2,...,Un. 
Output: the transitive closure of digraph D. 

Initialize digraph Do to be digraph G. 

Fori=1 ton 

For 7 =1 ton 
If (v;,v;) is an arc in digraph D;_1 
Fork =1ton 
If (v;, Ue) is an arc in digraph Dj;-1 
Add arc (vj, vx) to Dj_1 (if it is not already there). 
Return digraph D,,. 
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Activity-Scheduling Networks 


In large projects, often there are some tasks that cannot start until certain others are 
completed. Figure 3.1.9 shows a digraph model of the precedence relationships among 
some tasks for building a house. Vertices correspond to tasks. An arc from vertex u 
to vertex v means that task v cannot start until task u is completed. To simplify the 
drawing, arcs that are implied by transitivity are not drawn. This digraph is the cover 
diagram of a partial ordering of the tasks. Section 3.2 discusses this model further and 
introduces a different model in which the tasks are represented by the arcs of a digraph. 


Activit 8 
ctivity 3 


Foundation 
Walls and ceilings 2 
Roof 6 


Electrical wiring 
Windows 1 


Siding 
Paint interior 4 
Paint exterior 5 


ANOoaRWND — 


Figure 3.1.9: An activity digraph for building a house. 


Scheduling the Matches in a Round-Robin Tournament 


Suppose that each pair of n teams is to play one match in a tournament. Typically, 
one would like to schedule the matches so that all matches are completed in a minimum 
number of days (assume that each team plays at most one match on a given day). 
If the teams are from different cities, an additional objective is to have an equitable 
distribution of home and away matches. We preview here a strategy that is discussed 
in §5.6. 


DEFINITIONS 


D28: A compact schedule for a round-robin tournament is one in which each team 
plays a match each day. 


D29: <A team is said to have a break if it is either home for two consecutive matches 
or away for two. 


D30: A proper arc-coloring of a digraph G = (V,£) is an assignment of colors 
to the arcs in G so that any two arcs that have an endpoint in common are assigned 
different colors. Graph coloring is discussed in 85.1 and 85.2, and the related concept 
of graph factorization is discussed in §5.4. 


REMARK 


R5: An algorithm for constructing a compact schedule for a n-team round-robin tour- 
nament, where n is even, that minimizes the total number of breaks is given in §5.6 
(Algorithm 5.6.1). The strategy is based on orienting the edges of a complete graph 
and then producing a proper arc-coloring so that each color is assigned to exactly n/2 
arcs. 
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Flows in Networks 


A pipeline network for transporting oil from a single source to a single sink is one proto- 
type of a network model. Each arc represents a section of pipeline, and the endpoints of 
an arc correspond to the junctures at the ends of that section. The arc capacity is the 
maximum amount of oil that can flow through the corresponding section per unit time. 
A network could just as naturally represent a system of truck routes for transporting 
commodities from supply points to demand points, or it could represent a network of 
phone lines from one distribution center to another. 


DEFINITIONS 


D31: A cost flow network G = (V,E,cap,c,b) is a directed graph with vertex-set 
V, arc-set FE, a nonnegative capacity function cap : E — N, a linear cost function 


c: E > Z, and an integral supply vector b: V + Z that satisfies )* b(w) = 0. 
wEeV 


D32: An s-t flow network G = (V, E, cap, s,t) is a directed graph (typically without 
the cost and supply functions) with a nonnegative capacity function cap : E > N, 
that has a distinguished vertex s, called the source, with nonzero outdegree, and a 
distinguished vertex t, called the sink, with nonzero indegree. 


D33: The maximum-flow problem is to determine the maximum flow that can be 
pushed through an s-t network from source s to sink t such that the flow into each 
intermediate node equals the flow out (conservation of flow) and the flow across any arc 
does not exceed the capacity of that arc. (See §11.1.) 


D34: The minimum-cost-flow problem is to find an assignment of flows on the arcs 
of the flow network that satisfy the supply and demand (negative supply) requirements 
at minimum cost. (See $11.2.) 


Software Testing and the Chinese Postman Problem 


During execution, an application software’s flow moves between various states, and the 
transitions from one state to another depend on the input. In testing software, one would 
like to generate input data that forces the program to test all possible transitions. 


DEFINITIONS 


D35: An eulerian tour of a digraph G is a closed directed walk that uses each arc 
exactly once. 


D36: A postman tour (or covering walk) is a closed directed walk that uses each 
arc at least once. 


D37: Given a directed edge-weighted graph G, the Directed Chinese Postman 
Problem is to find a minimum-weight postman tour. 


Digraph Model: The software’s execution flow is modeled as a digraph, where the 
states of the program are represented by vertices, the transitions are represented by arcs, 
and each of the arcs is assigned a label indicating the input that forces the corresponding 
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transition. Then the problem of finding an input sequence for which the program invokes 
all transitions and minimizes the total number of transitions is equivalent to the Directed 
Chinese Postman Problem, where all arc-weights equal one. 


REMARKS 


R6: Since certain transitions take more execution time than others, one might want 
to minimize the total time of execution during the testing (instead of the number of 
transitions). In that case, each arc is assigned a weight equal to the transition time 
corresponding to that arc. 


R7: Under certain reasonable assumptions, the flow digraph modeling a program’s 
execution can be assumed to be strongly connected, which guarantees the existence of 
a postman tour. 


R8: Eulerian digraphs and graphs, along with algorithms to construct eulerian tours, 
are discussed in detail in 84.2, and various versions of the Chinese Postman Problem 
and its algorithms are discussed in §4.3. 


Lexical Scanners 


The source code of a computer program may be regarded as a string of symbols. A 
lexical scanner must scan these symbols, one at a time, and recognize which symbols go 
together to form a syntactic token or leceme. We now consider a single-purpose scanner 
whose task is to recognize whether an input string of characters is a valid identifier in 
the C programming language. Such a scanner is a special case of a finite-state recognizer 
and can be modeled by a labeled digraph, as in Figure 3.1.10. One vertex represents 
the start state, in effect before any symbols have been scanned. Another represents the 
accept state, in which the substring of symbols scanned so far forms a valid C identifier. 
The third vertex is the reject state, indicating that the substring has been discarded 
because it is not a valid C identifier. Each arc label tells what kinds of symbols cause a 
transition from the tail state to the head state. If the final state after the input string 
is completely scanned is the accept state, then the string is a valid C identifier. 


letter 
> (accept) letter or 
digit 


any char except 
letter or digit 


non-letter 


any 
char 


Figure 3.1.10: Finite-state recognizer for identifiers. 
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3.1.3. Binary Trees 


At first glance, a discussion of binary trees does not seem to belong in a section on 
digraphs. In fact, binary trees are digraphs. In particular, they are special rooted trees. 
Here we describe a few applications. 


Rooted Tree Terminology 


DEFINITIONS 


D38: In a rooted tree, the depth or level of a vertex v is its distance from the root, 
that is, the length of the unique path from the root to v. (Thus, the root has depth 0.) 


D39: The height of a rooted tree is the length of a longest path from the root (which 
equals the greatest depth in the tree). 


D40: If vertex v immediately precedes vertex w on the path from the root to w, then 
v is the parent of w and w is the child of v. 


D41: A vertex w is called a descendant of a vertex v (and v is called an ancestor 
of w), if v is on the unique path from the root to w. If, in addition, w 4 v, then w isa 
proper descendant of v (and v is a proper ancestor of w). 


D42: An ordered tree is a rooted tree in which the children of each vertex are assigned 
a fixed ordering. 


D43: <A standard plane representation of an ordered tree is a standard plane 
drawing of the tree such that at each level, the left-to-right order of the vertices agrees 
with their prescribed order. 


D44: A binary tree is an ordered tree in which each vertex has at most two children, 
and each child is designated either a left-child or a right-child. 


Figure 3.1.11: A binary tree of height 4. 


D45: The left (right) subtree of a vertex v in a binary tree T is the binary subtree 
spanning the left (right)-child of v and all of its descendants. 


FACT 


F7: Every binary tree of height h has at most 2”*+! — 1 vertices. 
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Binary Search 


An entry in a random-access table consists of two fields. One field is for the actual data 
element, and the other one is for the key. An entry is found in a random-access table by 
searching for its key, and the most generally useful implementation of a random-access 
table uses the following information structure. 


DEFINITIONS 


D46: A binary-search tree (BST) is a binary tree, each of whose vertices is assigned 
a key, such that the key assigned to any vertex v is greater than the key at each vertex 
in the left subtree of v, and is less than the key at each vertex in the right subtree of v. 


D47: A binary tree is balanced if for every vertex, the number of vertices in its left 
and right subtrees differ by at most one. 


22 


12 21 23 40 


Figure 3.1.12: A balanced binary-search tree and an unbalanced one. 


Algorithm 3.1.3: Binary-Search-Tree Search 


Input: a binary-search tree T and a target key t. 
Output: a vertex v of T such that key(v) = t if t is found, 
or NULL if ¢ is not found. 
v := root(T) 
While (v 4 NULL) and (t 4 key(v)) 
If t > key(v) 
v := rightchild(v) 
Else v := leftchild(v) 
Return v. 


COMPUTATIONAL NOTE: Since each comparison of a binary search performed on a 
binary-search tree moves the search down to the next level, the number of comparisons 
is at most the height h of the tree plus one. If the tree is balanced, then it is not hard 
to show that the number of vertices n is between 2” and 2”+!. Hence, the worst-case 
performance of the binary search on a perfectly balanced binary-search tree is O(log2n). 
The other extreme occurs when each internal vertex of the binary tree has only one child. 
Such a binary tree is actually an ordinary linked list, and therefore the performance of 
the search degenerates to O(n). 
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INTRODUCTION 

When a digraph has no directed cycles, it is called a directed acyclic graph, or a DAG. 
While being acyclic may seem to be a stringent condition, it arises quite naturally be- 
cause vertices often have a natural ordering. For instance, vertices may represent events 
ordered in time or ordered by hierarchy. This ordering makes results and algorithms for 
DAGs relatively simple. 


3.2.1 Examples and Basic Facts 


DEFINITIONS 

D1: A digraph is acyclic if it has no directed cycles. 
D2: DAG is an acronym for directed acyclic graph. 
D3: A source in a digraph is a vertex of indegree zero. 
D4: A sink in a digraph is a vertex of outdegree zero. 


D5: A basis of a digraph is a minimal set of vertices such that every other vertex can 
be reached from some vertex in this set by a directed path. 
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EXAMPLES 


El: Operations Research. A large project consists of many smaller tasks with a 
precedence relation — some tasks must be completed before certain others can begin. 
One graphical representation of such a project has a vertex for each task and an arc from 
u to v if task u must be completed before v can begin. For instance, in Figure 3.2.1, the 
food must be loaded and the cabin cleaned before passengers are loaded, but luggage 
unloading is independent of the timing of cabin activities. This model of a project will 
always be a DAG, because if there were a directed cycle, the project could not be done: 
every task on the cycle would have to be started before every other one on the cycle. 


unload load 
Papeete passengers 
clean cabin 
e——® 
unload luggage load new luggage 


Figure 3.2.1: A digraph of precedence in an airplane stopover. 


E2: Sociology and Sociobiology. A business (or army, or society, or ant colony) has a 
hierarchical dominance structure. The nodes are the employees (soldiers, citizens, ants) 
and there is an arc from u to v if u dominates v. If the chain of command is unique, 
with a single leader, and if only arcs representing immediate authority are included, 
then the result is a rooted tree, as in Figure 3.2.2. (Also see §3.2.2.) 


Dept. Head 


Supervisors 


Office workers 


Figure 3.2.2: A corporate hierarchy. 


E3: Computer Software Design. A large program consists of many subprograms, some 
of which can invoke others. Let the nodes of D be the subprograms, and let there be 
an arc from u to v if subprogram u can invoke subprogram v. Then this call graph D 
encapsulates all possible ways control can flow within the program. Must D be a DAG? 
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No, but each directed cycle represents an indirect recursion and serves as a warning 
to the designer to ensure against infinite loops. See Figure 3.2.3, where Proc 2 can 
call itself indirectly. To determine if a digraph is a DAG or not, do a topological sort 
(§3.2.4). 


Main 
Program 


Figure 3.2.3: The call graph of a computer program. 


E4: Ecology. A food web is a digraph in which nodes represent species and in which 
there is an arc from u to v if species u eats species v. Figure 3.2.4 shows a small food 
web. In general, food webs are acyclic, because animals tend to eat smaller animals or 
animals in some way “lower down” in the “food chain.” The very fact that phrases like 
this are used indicates that there is a hierarchy, and thus no directed cycles. 


frog ———> spider <———— grackle 


a 


beetle —————> cherry tree 


Figure 3.2.4: A small food web. 


E5: Genealogy. A “family tree” is a digraph, where the orientation is traditionally 
given not by arrows but by the direction down for later generations. Despite the name, a 
family tree is usually not a tree, since people commonly marry distant cousins, knowingly 
or unknowingly. However, it is always a DAG, because if there were a cycle, everyone 
on it would be older than everyone else on the cycle. 


E6: State Diagrams. Let the vertices of D be a set of states of some process, and 
let the arcs represent possible transitions. For instance, the process might be a board 
game, where the states are the configurations and each arc represents the transition of 
a single move. Then walks through D represent “histories” that the process/game can 
follow. If the game can never return to a previous configuration (e.g., as in tic-tac-toe), 
the state diagram of the game is a DAG. 


FACTS 


F1: Every DAG has at least one source and at least one sink. 
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F2: Every DAG has a unique basis, namely, the set of all its sources. 
F3: Every subgraph of a DAG is a DAG. 

F4: The transitive closure of a DAG is a DAG. 

F5: A digraph is a DAG if and only if every walk in it is a path. 


F6: A digraph is a DAG if and only if it is possible to order the vertices so that, in the 
adjacency matrix, all nonzero entries are above the main diagonal. (Topological sort in 
§3.2.4 finds the ordering.) 


F7: The condensation of any digraph is a DAG. Figure 3.2.5 shows a digraph and its 
condensation. 


Figure 3.2.5: A digraph and its condensation. 


F8: A digraph is a DAG if and only if it is isomorphic to its condensation. 


F9: A digraph is strongly connected (unilateral, weakly connected) if and only if its 
condensation is strongly connected (unilateral, weakly connected). 


F10: A DAG is never strongly connected, unless it consists of a single vertex. 
Fi1i1: A DAG is unilateral if and only if it is a path. 


F12: Every undirected graph without self-loops can be given an acyclic orientation, 
in fact, usually many. Namely, arbitrarily index the vertices as v1, v2,...,Un and direct 
each edge from its lower indexed end to its higher indexed end. 


REMARKS 
R1: For more basic information on DAGs, see [Ha94, Ch. 16] and [Ro76, §2.2-2.3]. 


R2: Most of the acyclic orientations in Fact F12 are arbitrary and uninteresting, but 
occasionally an acyclic orientation is natural. In a tree, it is natural to orient edges 
away from a root; see §3.2.2. In a bipartite graph, it is natural to direct all edges from 
one side to the other. Still, most interesting orientations are already imposed by the 
nature of the problem, and the question is whether they are acyclic. 
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3.2.2 Rooted Trees 


If the underlying graph of a digraph D is a tree, then D is certainly a DAG, because 
it doesn’t even have any undirected cycles. However, the important tree DAGs have 
further restrictions on their edge directions. 

For more on rooted trees, see [GrYe06, §3.2]. 


DEFINITIONS 
D6: A directed tree is a digraph whose underlying graph is a tree. 


D7: <A rooted tree is a directed tree with a distinguished vertex r, called the root, 
such that for every other vertex v, the unique path from r to v is a directed path from 
r to v. 


CONVENTION: In drawing a rooted tree with the root marked, the arrows are usually 
omitted because the direction of each arc is always away from the root. In fact, if the 
direction is always down or left-to-right, as in Figure 3.2.6, it is not even necessary to 
indicate the root. 


Figure 3.2.6: Two standard ways to draw a rooted tree. 


D8: A rooted tree is also called an out-tree. This alternative name is typically used 
when the arc directions are shown explicitly, for instance, when the tree is a spanning 
subgraph of a larger digraph. 


D9: An in-tree is an out-tree with all the directions reversed, so that all paths are 
directed toward the root. 


EXAMPLES 
Previous Example E2 is about rooted trees. Here are some others. 


E7: Decision trees. Any branching process leads to a rooted tree, where each node is 
a decision point, each arc from a node is an allowed decision, and the root is the start. 
For instance, the stages in a game may be represented this way. Figure 3.2.7 shows the 
first two moves in a game of tic-tac-toe, one by each player. Each node is represented 
by the way the board looks just before the decision. If we take into account symmetry, 
the figure is complete through the first two moves. 
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CONVENTION: In Figure 3.2.7 the two nodes on the bottom level (3rd move) illustrate 
that different nodes in the tree can represent the same state. While the board looks 
the same at these two nodes, the two ordered sequences of decisions leading to these 
nodes are different. Thus in a decision tree, each node represents both a state and the 
complete history of how it was achieved. Compare with Example 6, where these nodes 
would be one, and the digraph would not be a tree. 


HE EE EE ep ae HE aed lo 


Figure 3.2.7: The first two moves in the tic-tac-toe game tree and a bit of the third 
level. 


E8: Decomposition trees. Any decomposition of an object or structure into finer and 
finer parts can be modeled with a rooted tree. Figure 3.2.8 shows an example of sentence 
parsing. 


sentence 
subject phrase predicate phrase 
nour verb indirect object phrase 
preposition pia 
tran sings to me 


Figure 3.2.8: A sentence parse tree. 


FACTS 
F13: Every directed tree is a DAG. 


F14: A digraph is a rooted tree if and only if its underlying graph is connected, exactly 
one vertex (the root) has indegree 0, and all others have indegree 1. 
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DEFINITIONS FOR ROOTED TREES 


D10: The depth or level of a vertex v is its distance from the root, that is, the number 
of edges in the unique directed path from the root to v. 


D11: The height of a rooted tree is the greatest depth of a vertex. 
D12: If (u,v) is an edge, the u is the parent of v and v is the child of u. 
D13: Vertices having the same parent are siblings. 


D14: If there is a directed path from vertex u to vertex v, then u is an ancestor of v 
and v is a descendant of u. 


D15: A leaf is a vertex with outdegree 0 (no children). 
D16: An internal vertex is a vertex that is not a leaf. 
D17: An m-ary tree is a rooted tree in which every vertex has m or fewer children. 


D18: A complete m-ary tree is an m-ary tree in which every internal vertex has 
exactly m children and all leaves are at the same level. See Figure 3.2.9. 


Figure 3.2.9: Complete and incomplete ternary (3-ary) trees. 


D19: A ordered tree is a rooted tree in which the order of the children at each vertex 
makes a difference. 


D20: A binary tree is an ordered 2-ary tree in which, even when a vertex has only 
one child, it makes a difference whether it is a left child or a right child. 


REMARKS 


R3: Trees, rooted trees, ordered trees, and binary trees make finer and finer distinc- 
tions, which should only be used if the distinctions are important in the application 
being modeled. For instance, binary trees are used to model computations with binary 
operations, as in 3 x (4/5). Since division is noncommutative (4/5 4 5/4), binary trees 
are an appropriate model for such computations. 


R4: Figure 3.2.10 shows four graphs. As trees they are all the same (that is, isomor- 
phic). However, as rooted trees, G; = Gz and G3 = G4, so there are two rooted trees. 
There are three ordered trees, as G; and G2 are still the same, but G3, G4 are different. 
Finally, as binary trees they are all different. In G1, vertex c is a right child; in G2 it is 
a left child. 
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. p /\ a 
a c Cc a 
Cc Cc 
Gl G2 G3 G4 
Figure 3.2.10: Four trees: the same and not the same. 


FACTS 
F15: An m-ary tree has at most m” vertices at level k. 


F16: Let T be an n-vertex m-ary tree of height h. Then 


ht+l_y 
htl<n< 


m—-1 
The lower bound is attained if and only if T is a path. The upper bound is attained if 
and only if T is a complete m-ary tree. 


Spanning Directed Trees 


Since every connected graph has a spanning tree, every digraph has a spanning directed 
tree. In a graph, a spanning tree connects all the vertices, while using the minimum 
number of edges. However, in a digraph, a spanning directed tree may contain few 
directed paths and thus may allow fewer connections than the whole digraph does. So 
the more interesting question is whether a digraph has a spanning rooted tree. This 
question is answered algorithmically by the directed version of depth first search; see 
§10.1 and [GrYe06, §12.1]. It is answered algebraically by the directed matrix tree 
theorems; see $6.4. Here we simply state two key facts. 


FACTS 


F17: A digraph D has a spanning tree rooted at v if and only if directed depth first 
search starting at vu finds one. 


F18: For every vertex of a digraph D there is a spanning tree rooted at that vertex if 
and only if D is strongly connected. 


Functional Graphs 


Closely related structurally to rooted trees, but devised for a different purpose, are 
functional graphs. 


DEFINITIONS 


D21: A functional graph is a digraph in which each vertex has outdegree one. 
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EXAMPLES 


E9: For each function f from a finite domain U to itself, define a digraph D whose 
vertex set is U and for which (u,v) is an arc if and only if f(u) =v. By definition of a 
function, there is one such v for every u € U. Hence, D is a functional graph (whence 
the name). 


E10: Specifically, consider the doubling function on the positive integers, but consider 
only the effect on the ones digit. This function is completely described by its effect on 
the domain {0,1,...,9}. Its functional graph is shown in Figure 3.2.11. 


Figure 3.2.11: The functional graph for doubling (mod 10). 


FACTS 


F19: Let D bea functional graph, and let G be the underlying undirected graph. Then 
each component of G contains exactly one cycle. In D this cycle is a directed cycle, and 
the removal of any arc in it turns that component into an in-tree. 


3.2.3 DAGs and Posets 


There is a very close connection between DAGs and posets. Every DAG repre- 
sents a poset, and every poset can be represented by DAGs in several ways. For more 
information, see [Bo00, §7.1-7.2]. 


DEFINITIONS 


D22: A partial order is a binary relation X on a set X that is 
e reflexive: for allxa Ee X, x4 ~<a; 
e antisymmetric: for allz,ye X,ifaxy and yxza, then «= y; 
e transitive: for all x,y,z € X, if x ~<yand y =X z, then z X z. 


D23: A poset, or partially ordered set P = (X,~) is a pair consisting of a set X, 
called the domain, and a partial order X on X. 
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D24: Elements x,y of P are comparable if either x xX y or y X x. 
D25: Element x is less than element y, written x <~ y, ifaxyandaFy. 


D26: The comparability digraph of the poset P = (X, x) is the digraph with vertex 
set X such that there is an arc from x to y if and only if x x y. 


D27: The element y covers the element x in a poset if x < y and there is no element 
z such that x ~ z < y. 


D28: The cover graph of a poset P = (X,-) is the graph with vertex set X such 
that x,y are adjacent if and only if one of them covers the other. 


D29: A Hasse diagram of poset P is a straight-line drawing of the cover graph such 
that the lesser element of each adjacent pair is lower in the drawing. 


EXAMPLES 


Ell: Let X = {2,4,5,8,10,20} and let = be the divisibility relation on X. That is, 
x x y if and only if y/z is an integer. The comparability digraph and the Hasse diagram 
for P = (X, X) are as shown in Figure 3.2.12. 


Figure 3.2.12: Comparability digraph and Hasse diagram for a poset. 


FACTS 
F20: If the loops are deleted, the comparability digraph of any poset is a DAG. 


F21: Every Hasse diagram is a DAG if one considers all edges to be directed up (or 
all down). 


F22: Every DAG D represents a poset in the following sense. The domain of P is the 
vertex set of D, and x x y if there is a directed path from x to y. 


TERMINOLOGY NOTE: In passing from DAG D to poset P, null paths are included, so 
that x x x for all x. Alternatively, we obtain the poset by taking the transitive closure 
D* of D. Then x ~ y if and only if (a, y) is an arc of D*. 
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3.2.4 Topological Sort and Optimization 


In a DAG, the vertices can always be numbered consecutively so that all arcs go 
from lower to higher numbers. Using this numbering, many optimization problems can 
be solved by essentially the same algorithm, one that makes a single pass through the 
vertices in numbered order. For more general digraphs, algorithms for these optimization 
problems are less efficient or at least more complicated to describe. 


DEFINITIONS 


D30: A linear extension ordering of a digraph is a consecutive numbering of the 
vertices aS U1, U2,...,Un so that all arcs go from lower-numbered to higher-numbered 
vertices. 


D31: A topological sort, or topsort, is any algorithm that assigns a linear extension 
ordering to a digraph when it has one. (This name is traditional, but the relation to 
topology in the sense understood by topologists is obscure.) A simple topological sort 
algorithm is shown as Algorithm 3.2.1. See also [RoTe03, §11.6.2]. 


FACTS 
F23: A digraph has a linear extension ordering if and only if it is a DAG. 


F24: Topological sort determines if a digraph is a DAG and finds a linear extension 
ordering if it is. 


Algorithm 3.2.1: Topological Sort 


Input: a digraph D. 
Output: A linear extension ordering if D is a DAG; failure otherwise 
A :=D;k:=1 
while Vi # {vertex set of H non-empty} 
Up, = any vertex in H of indegree zero. 
{If no such vertex exists, exit: D is not a DAG} 
H:=H—-v, {New H isa DAG if old H was} 
k:=k+1 


REMARK 


R5: Because of the close connection between DAGs and posets, this whole discussion 
of linear extensions and topological sort can just as well be stated in the poset context. 
For instance, every poset has a linear extension, which may be found by a topological 
sort. See [GrYe06, pp. 507-510]. 
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Optimization 


There are many computational problems about graphs, with important real-world ap- 
plications, when the graphs have weights on their vertices and/or edges. For DAGs, 
many of these problems can be solved by essentially the same single-pass algorithm. 
This algorithm is the basic form of the sort of staged algorithm called dynamic pro- 
gramming in operations research circles [HiLil0, Ch. 10]. Algorithms 3.2.2 and 3.2.3 
provide templates for two versions of this algorithm. The examples that follow fill 
in the templates by giving specific formulas for updating the functions they compute. 


In Algorithm 3.2.2, topsort is done first, and then the function F' is computed vertex 
by vertex in topsort order. In Algorithm 3.2.3, the topsort is done simultaneously with 
improving F' on vertices not yet sorted. 


Algorithm 3.2.2: Basic Dynamic Programming, First Version 


Input: DAG D with vertices numbered v1, v2,...,Un in topsort order; weights 
w(v) on vertices or w(v, u) on arcs, as needed. 
Output: Correct values of desired function F’. 
Initialize F(v,) 
Fork =2 ton 
Determine F(v,) in terms of weights and all F(v;) with i < k. 


Algorithm 3.2.3: Basic Dynamic Programming, Second Version 


Input: DAG D with n vertices and weights w(v) on vertices or w(v, wu) on arcs, 
as needed. 
Output: Correct values of desired function F’. 


Initialize F(v) for all v. 

Hf :=D 

Fork=1ton 
Uy, += asourcein H {exists since H is a DAG} 
Update F(u) for all u for which (vz, u) is an edge in H. 
HA :=H—-v, 


EXAMPLES 


CONVENTION: Below we assume that each DAG has just one source, the vertex Start. In 
other words, every other vertex has at least one predecessor. Without this assumption, 
some formulas below involve operations on empty sets (e.g., sum or max), and each 
example would need to specify how to correctly interpret those operations. For simplicity 
we also assume that the DAGs have no multiple edges. 
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E12: Project Scheduling. Consider Figure 3.2.13, which repeats Figure 3.2.1 with the 
following additions: Start and Finish vertices, a topsort ordering, and times for the 
tasks as weights on the vertices. Start and Finish, being merely marker vertices, take 
time 0. Recall that (u,v) is an arc if task w must be completed directly before task v 
begins, and that these tasks are the steps necessary to complete an airplane stopover. 
How quickly can the stopover be completed? The bottleneck is the longest path from 
Start to Finish, where the length of a (directed) path is the sum of the weights on its 
vertices. Dynamic programming can answer this question as follows. 


Let 
F'(u) = the length of the longest path (using vertex weights) from Start to w. 
Then in Algorithm 3.2.2 use 
Initialization: F(v,) = w(v1) =0, (Note: v; = Start) 
Update: F(v,) = w(v~) + max{F'(v;) | (vi, ve) is an arc}. 
In Algorithm 3.2.3 use 
Initialization: For all v, F(v) = w(v), 
Update: For all u such that (vg, u) is an arc, F(w) = max{F(u), F(vg,)+w(u)}. 


For either algorithm, at termination the desired answer is F'(Finish), that is, F'(v,). 


v5: 10 


v7: 25 

unload load 
passengers passengers 

vi: : v8: 0 
clean cabin 
Start Finish 
3: 20 v6: 20 

unload luggage load new luggage 


Figure 3.2.13: Airplane stopover as CPM graph. 


This method of finding the optimal schedule by iteratively finding the longest path is 
the essence of the critical path method, or CPM [HiLi10, §9.8]. This example uses the 
activity on node model, or AON. See Example 13 for the activity on arc model, or AoA. 


E13: Project Scheduling, second model. If edges represent subtasks, and tasks earlier 
on directed paths must be completed before those later are begun, then the longest path 
from the Start to Finish vertex is the shortest time in which the whole project can be 
completed, where now the length of a path is the sum of the weights on its edges. Let 


F'(u) = the length of the longest path (using edge weights) from Start to uw. 
Then in Algorithm 3.2.2 use 


Initialization: F(v,) = 0, 
Update: F (vp) = max{F'(v;) + w(ui, ve) | (Vi, UR) is an arc}. 
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In Algorithm 3.2.3 use 
Initialization: For all v, F(v) = 0, 
Update: For all u such that (vz, u) is an arc, 
F(u) = max{F(u), F(ug)+w(vr,u)}. 


For either algorithm, at termination the desired answer is F'(Finish). 


E14: Shortest Paths. What is the shortest directed path between two vertices u and w’, 
where the length of a path is the sum of the weights on its edges? If a graph represents 
a road network, and the weights on the edges are the lengths of the road segments (or 
the travel times, or the toll on that segment), then shortest path means the shortest 
road distance (or least time, or lowest toll). If the graph is a DAG, and we make u the 
Start vertex (by eliminating earlier vertices in the topsort if necessary), then dynamic 
programming finds the shortest path as follows. Let 


F(v) = the length of the shortest path (using edge weights) from Start to v. 
Then in Algorithm 3.2.2 use 


Initialization: F(v,)=0, (Note: v; =u = Start) 
Update: F (vp) = min{F(v;) + w(ui, vp) | (vi, vx) is an arc}. 


In Algorithm 3.2.3 use 


Initialization: F(v,) =0, F(v) =o for v 4 vy, 
Update: For all v such that (v,,v) is an arc, 
F(v) = min{F(v), F(vug)+w(vz, v)}. 


For either algorithm, at termination the desired answer is the value of F'(u’). 


E15: What is the shortest directed path between two vertices, where the length of 
a path is the sum of the weights on its vertices? Dynamic programming solves this 
problem too for DAGs, with a slight change in the formulas in Example 14 (replace 
edge weights with vertex weights). 


E16: Counting Paths. How many directed paths are there between a given pair of 
vertices? If the digraph is a DAG, and the vertices are Start and Finish, let 


F(u) = the number of directed paths from Start to uw. 
Then in Algorithm 3.2.2 use 


Initialization: F(v,) = 1, (v1 = Start) 
Update: F (vp) = o{F (v:) | (vi, ve) is an arc}. 


In Algorithm 3.2.3 use 


Initialization: F (Start) = 1, F(v) =0 for v F Start, 
Update: For all v such that (vz, v) is an arc, 
F(v) = F(v) + F (vp). 


For either algorithm, at termination the desired answer is the value of F'(Finish). 
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E17: Maximin Paths. What is the directed path between two vertices for which 
the minimum edge weight on that path is maximum among all paths between those 
two vertices? This is called the maximin path and that maximum value is called the 
maximin value. In Figure 3.2.14 the maximin path from v; to vg is vj v3v4vg and the 
maximin value is 4. If the edges represent railroad segments, and each edge weight is 
the weight limit on that railroad segment, then this is the path between the two points 
over which the heaviest load can be shipped. 


Figure 3.2.14: The maximin path v ,v3v4vg has value 4 and the minimax path v,v3u5u¢ 
has value 6. 


If the digraph is a DAG, and the vertices of interest are Start and Finish, let F(u) = 0 
mean that Fu) is so far undefined, and by convention let max{@,a} = min{@,a} = a. 
Our goal is at the end to have 


F'(u) = the maximin value for directed paths from from Start to uw. 


Then in Algorithm 3.2.2 use 


Initialization: F'(v1) = 0, (v1 = Start) 
=m. 


Update: F'(v,z) ax{min{F'(v;), w(u;, ve) } | (Vi, ve) is an arc}. 


In Algorithm 3.2.3 use 


Initialization: For all v, F(v) =. 
Update: For all v such that (vz, v) is an arc, 
F(v) = max{F(v), min{F (vg), w(ve,v)}}. 


For either algorithm, at termination the desired answer is the value of F'(Finish). 


E18: Minimax Paths. What is the directed path between two vertices for which the 
maximum edge weight on the path is minimum? This minimax question is relevant if 
the graph represents a pipeline network, and each edge weight is the maximum elevation 
on that segment, because the work necessary to push a fluid through a pipeline route 
is related to the maximum height to which the fluid must be raised along the way. In 
Figure 3.2.14 the minimax path from v; to vg is vj v3v5ve6 and the minimax value is 6. 
Dynamic programming solutions to the minimax problem are found by interchanging 
the roles of min and max in the algorithms for Example E17. 
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FACTS 


F25: Algorithms 3.2.2—3.2.3 each solve critical path problems and many other opti- 
mization and computation problems on DAGs. (See the examples above.) 


F26: In project scheduling problems modeled by DAGs, the minimum completion time 
is the length of the longest path from the Start node to the Finish node. 


F27: Any DAG may be augmented to have just one source and one sink (just create a 
new node named Start adjacent to all existing sources, and a new node named Finish 
adjacent from all existing sinks). 
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INTRODUCTION 

Tournaments comprise a large and important class of directed graphs. Application 
areas in which tournaments arise as models include round-robin tournaments (hence the 
name), paired-comparison experiments, domination in some animal societies, majority 
voting, population ecology, and communication networks. Many early results were 
motivated by applications; more recently, much focus has been on the combinatorial 
structure of tournaments as a separate area of graph theory. J. W. Moon’s excellent 
monograph [Mo68] contains most of the results on tournaments up to 1968. In large 
part because of the influence of that work, tournament theory has so flourished during 
the past 35 years that subsequent surveys covered only a fraction of the results available. 
However, these surveys remain good sources for results about tournaments and directed 
graphs related to tournaments (see [HaNoCa65], [HaMo66], [BeWi75], [ReBe79], [Be81], 
[ZhSo91], [Gu95], [BaGu96], and [Re96]). Much work has been done on generalizations 
and extensions of tournaments (see [BaGu98]). A good source for digraphs in general, 
with extensive coverage of tournaments and various generalizations, is the book by 
Bang-Jensen and Gutin [BaGu01]. 


3.3.1 Basic Definitions and Examples 
NOTATION: An arc from vertex x to vertex y will be denoted (x,y) or by x > y. 
DEFINITIONS 


D1: <A tournament is an oriented complete graph, i.e., there is exactly one arc 
between every pair of distinct vertices (and no loops). 
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D2: The order of a tournament T is the number of vertices in T. A tournament 
of order n will be called an n-tournament. 


D3: A vertex x in a tournament T’ dominates (or beats) vertex y in T whenever 
(x,y) is an arc of T. We also say that y is dominated (or beaten) by zx. 


D4: A vertex that dominates every other vertex in a tournament is called a trans- 
mitter. A vertex that is dominated by every other vertex in a tournament is called a 
receiver. 


D5: The score (or out-degree) of a vertex v in a tournament T is the number of 
vertices that v dominates. It is denoted by d}(v). Note that if the tournament T under 
consideration is clear from the context, then T will be dropped and the score of v will 
be denoted d*(v). The in-score (or in-degree) of a vertex u in a tournament T is the 
number of vertices that dominate u. It is denoted dp (u) (or d- (u)). 


D6: The score sequence (or score vector) of an n-tournament T is the ordered 
n-tuple (81, 82,.-.,Sn—1, $n), where s; is the score of vertex v;, 1 <i <n, and 


8, S82 <<... Sn_1 < Sy 


D7: <A tournament is reducible if its vertex-set can be partitioned into two non- 
empty subsets Vj and V2 such that every vertex in V; dominates every vertex in V3. A 
tournament that is not reducible is said to be irreducible. 


D8: The out-set of a vertex x in a digraph D, denoted O(z), is the set of all vertices 
that « dominates, and the in-set of x, denoted I(x), is the set of all vertices that 
dominate 2. 


TERMINOLOGY: In a digraph D, the out-set of a vertex « is also called the netghborhood 
of x, denoted N# (ax) (or N+ (x) if D is understood). 


FACTS 


F1: There are 2(2) different labeled n-tournaments using the same n distinct labels, 
since for each pair of distinct labels {a,b}, either the vertex labeled a dominates the 
vertex labeled b or b dominates a. 


F2: [Da54] The number ¢(n) of non-isomorphic (unlabeled) n-tournaments is given 
by a rather complicated formula involving a summation over certain partitions of n. 
Moreover, 


t(n) 


t(n) > and lim ———— =1 
( ) n! n—-0o (2) jn! 
The first few values of t(n) are given by 
n |1)/2/3]4]5 | 6 7 8 9 10 


t(n) | 1 | 1 | 2 | 4 | 12 | 56 | 456 | 6880 | 191536 | 9733056 
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EXAMPLES 


E1: Tournaments of orders 1 through 4 are illustrated in Figure 3.3.1. 


Figure 3.3.1: Tournaments of orders 1 through 4. 


E2: Tournament Scheduling. To minimize the number of days of an n-tournament, 
games are scheduled in parallel. If n is even, then at least n—1 rounds are needed (since 
at most n/2 of the n(n—1)/2 games may be played at once). If n is odd, then at least 
n rounds are needed. There are several scheduling methods that achieve these compact 
schedules (see [Mo68]). Round-robin sports tournament scheduling and its relationship 
to edge-coloring is discussed in 85.6. Another related topic, factors and factorization, is 
discussed in 85.4. 


Regular Tournaments 


DEFINITIONS 


D9: A regular tournament is a tournament T in which all scores are the same (i.e., 

there is an integer s so that dt (v) = s for all vertices v € V(T)). An almost regular (or 

near regular) tournament is a tournament T in which ee (v) —d-(v)|} = 1. 
veV(T 


D10: A doubly-regular tournament is a tournament in which all pairs of vertices 
jointly dominate the same number of vertices (i.e., there is an integer k so that for all 
distinct pairs of vertices x,y € V(T), we have |O(x) MN O(y)| = k). 


D11: Let G be an abelian group of odd order n = 2m-+ 1 with identity 0. Let S be 
an m-element subset of G'— {0} such that for every x,y € S, «x+y #40. That is, choose 
exactly one element from each of the m 2-sets of the form {z,—a}, where x ranges over 
all  € G— {0}. Form the digraph D with vertex-set V(D) = G and arc-set A(D) 
defined by: arc (%,y) € A(D) if and only if y—a € S. Then D is called a rotational 
tournament with symbol set S and is denoted Rg(S), or simply R(S) if the group 
G is understood. 


D12: Let G=GF/(p") be the finite field with p* elements, where p is a prime, p = 3 
(modulo 4), and k is an odd positive integer, and let S be the set of elements that 
are multiplicative squares of G (called the quadratic residues). Then the rotational 
tournament R¢(S) is called a quadratic residue tournament. 
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FACTS 
F3: The rotational tournament R¢(S), where |G| =n, is a regular n-tournament. 


F4: [ReBe79] If T is a doubly-regular n-tournament, then T is regular and n = 3 
(modulo 4). Moreover, there exists a doubly-regular (44 + 3)-tournament if and only if 
there exists a (4k +4) by (4k+4) matrix H of +1’s and —1’s such that HH* = (4k+4)I 
and H + H'‘ = 21, where J is the identity matrix (such an H is called a skew-Hadamard 
matrix) [ReBr72]. 


REMARK 


R1: Frequently, the group G for the rotational tournament Rg(S) is taken to be Z,, 
the integers modulo n = 2m + 1. 


EXAMPLES 


E3: The 9-tournament shown in Figure 3.3.2 is regular since every vertex has score 
4, and it is also irreducible. Moreover, it is the rotational tournament Rg(S), where 
G = Zp and S = {2,4,6, 8}. 


Figure 3.3.2: The regular, rotational tournament R({2, 4,6, 8}). 


E4: The regular 7-tournament shown in Figure 3.3.3 is the quadratic residue tourna- 
ment Rg(S), where G = GF(7) and S = {1,2,4}. Observe that it is irreducible, and it 
is a doubly-regular tournament since |O(x) 7 O(y)| = 1 for all distinct pairs of vertices 
x and y. The quadratic residue 7-tournament is notorious in tournament theory due to 
its occurrence as an exception to many results on tournaments. 


Figure 3.3.3: R({1,2,4}). 
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Arc Reversals 


Any n-tournament can be transformed into any other n-tournament by a sequence of 
reversals of arcs. 


FACTS 


F5: ([Ry64] If T and W are two n-tournaments with the same score sequence, then 
T can be transformed into an isomorphic copy of W by a properly chosen sequence of 
reversals of arcs in 3-cycles. 


F6: [Re73] If T and W are two n-tournaments and k is a fixed integer with 1 < k < n—1, 
then T can be transformed into an isomorphic copy of W by a properly chosen sequence 
of reversals of arcs in k-paths. 


REMARK 


R2: C. Thomassen [Th88] extended these results to pairs of tournaments with the same 
vertex-set. He also described an infinite class of non-tournament digraphs for which the 
reversal of no arc decreases the total number of cycles. However, little progress has been 
made on the corresponding tournament problem (see [Re84]). 


CONJECTURE 


Adam’s Conjecture [Ad64]: Every non-transitive tournament contains at least one 
arc whose reversal reduces the total number of cycles. 


3.3.2. Paths, Cycles, and Connectivity 


Paths and cycles are fundamental sub-structures in tournaments and have been well 
studied in tournament theory. Many more results than given here have been collected 
by Bang-Jensen and Gutin in their survey [BaGu96] and their book [BaGu01]. 


NOTATION: All paths in this section are directed unless specified otherwise. 
DEFINITIONS 


D13: A hamiltonian path (or spanning path) in a digraph D is a path that includes 
all vertices of D. A hamiltonian cycle (or spanning cycle) in a digraph D is a cycle 
that includes all vertices of D. (Hamiltonian paths and cycles are discussed in §4.5.) 


D14: A digraph D is strong (or strongly connected) if for every pair of distinct 
vertices x and y of D, there is a path from x to y and a path from y to z. 
EXAMPLE 


E5: The two tournaments in Figures 3.3.2 and 3.3.3 are strong and irreducible. For 
example, the hamiltonian cycle in R({2, 4,6, 8}) given by 


07-2-67>337>57>4>58>1>7>0 


implies that for every pair of distinct vertices x and y, there is a path from x to y and 
a path from y to x. 
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REMARK 


R3: Fact F7, below, is perhaps the most fundamental result about tournaments and 
is used frequently in their study. The first part has several inductive proofs. 


FACTS 


F7: [Re34] Every tournament contains a hamiltonian path. Moreover, every tourna- 
ment contains an odd number of hamiltonian paths. 


F8: The following four statements are equivalent for any n-tournament T: 


a) T is strong. 


b) T is irreducible. 


(a) 
(b) 
(c) T contains a hamiltonian cycle [Ca59]. 

(d) For every vertex x of T and for every integer k, 3 < k < n, x is contained in a 
cycle of length k [Mo68]. (See also [HaMo66].) 


F9: [Ga72] A curious fact: the number of n-tournaments containing a unique hamil- 
tonian cycle is equal to the (2n — 6)* Fibonacci number. 


F10: [MoMo62] The fraction of labeled n-tournaments that are strong approaches 1 
as N —> OO. 


F11: There is an O(n?) algorithm for finding a hamiltonian path in a tournament, 
and there is an O(n”) algorithm for finding a hamiltonian cycle in a tournament. (See 
[BaGu01] and [Ma92]).) 


F12: [Vo02] Every arc of a strongly connected n-tournament is contained in a path of 
length 
[(n + 3)/2] -—1 


Condensation and Transitive Tournaments 


DEFINITIONS 


D15: IfT is a tournament with vertex partition {Vi, Vo,..., Vi}, where each V; induces 
a maximal strongly connected sub-tournament of T, then the condensation tourna- 
ment of T, denoted T*, is the k-tournament with vertex-set {u1,u2,...,u,} and in 
which u; dominates u; whenever all of the vertices in V; dominate all of the vertices in 
V; in T. 


D16: A tournament T is transitive if for all three distinct vertices x, y, and z in T, 
if dominates y, and y dominates z, then x dominates z. 
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EXAMPLE 


E6: Consider the 9-tournament T consisting of three vertex-disjoint 3-cycles Ai, Ao, 
A3, in which every vertex of A, dominates every vertex of Az and every vertex of A3, 
and every vertex of Az dominates every vertex of A3. The vertex partition of T in 
Definition D15 is V(A,) UV(A2) UV(A3), and T™ is the transitive 3-tournament with 
vertex-set {u,, U2, u3}, where u, dominates wz and u3, and uz dominates us. 


FACTS 
F13: [HaNoCa65] The condensation T* of a tournament T is a transitive tournament. 


F14: The following five statements are equivalent for an n-tournament. See [Mo68] for 
references. 


(a) T is transitive. 

(b) T contains no cycles. 

(c) T contains a unique hamiltonian path. 

(d) T has score sequence (0,1,2,3,...,2—2,n—1). 


(e) The vertices of T can be labeled vj, v2, v3,...,Un—1, Un So that v; dominates v; if 
and only if 1<i<j <n (ie., T is a complete [linear] order). 


F15: Every (2”~')-tournament contains a transitive sub-tournament of order n. 


Cycles and Paths in Tournaments 


FACTS 


F16: [A167] Every arc in a regular n-tournament, n > 3, is in cycles of all lengths m, 
3<m<n. (See [Th80] for extensions.) 


F17: [Ja72] Every arc in an almost regular n-tournament, n > 8, is in cycles of all 
lengths m, 4 <m <n. (See [Th80] for extensions.) 


F18: [AlReRo74] For every arc (x,y) of a regular n-tournament T, where n > 7, and 
for every integer m, 3 <m <n-—1, T contains a path of length m from x to y. (See 
[Th80] for extensions.) 


F19: [Th80] For every arc (x,y) of an almost regular n-tournament T, where n > 10, 
and for every integer m, 3 < m < n—1, T contains a path of length m from z to y. 
(See also [GuVo97].) 
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Hamiltonian Cycles and Kelly’s Conjecture 


CONJECTURE 


Kelly’s conjecture (see [Mo68]): The arc-set of a regular n-tournament can be parti- 
tioned into (n — 1)/2 subsets, each of which induces a hamiltonian cycle. 


EXAMPLE 


E7: The arc-set of the quadratic residue rotational 7-tournament R({1,2,4}) can be 
decomposed into 3 hamiltonian cycles: 


07> 17527>537>47>5>6-0 


07-2>4>67>17>3755-0 
0>-451>57>52>56>53-50 


REMARK 


R4: Kelly’s conjecture has stimulated much work in tournament theory. Evidence 
for the conjecture includes: it is true for n < 9 (B. Alspach, see [BeTh81]); every 
n-tournament, n > 5, contains two arc-disjoint hamiltonian cycles [Zh80]; regular or 
almost regular n-tournaments contain at least |./n/1000| arc-disjoint hamiltonian cy- 
cles [Th82]. The best published result to date is the next result. A covering result then 
follows. 


FACTS 


F20: [Ha93] Then there exists a positive constant c, c > 2718, so that each regular 
n-tournament contains at least cn arc-disjoint hamiltonian cycles. 


F21: [Th85] Each regular n-tournament T contains 12n hamiltonian cycles so that 
each arc of T is in at least one of the cycles. 


Higher Connectivity 


DEFINITION 


D17: Dis k-strong (or k-strongly connected) if for every subset S of k—1 or fewer 
vertices of D, D — S is a strong digraph. 


FACTS 


F22: [Th80] Every arc in a 3-strong tournament is contained in a hamiltonian cycle. 
Moreover, this is false for infinitely many 2-strong tournaments. For every pair of 
distinct vertices « and y in a 4-strong tournament there is a hamiltonian path from x 
to y and there is a hamiltonian path from y to x. Moreover, this is false for infinitely 
many 3-strong tournaments. 
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F23: [FrTh87] If T is a k-strong tournament and B is any set of k — 1 or fewer arcs of 
T, then the arc-deletion digraph T — B contains a hamiltonian cycle. 


F24: [Th84] There is a function h so that given any k independent arcs, a1, @2,..., Ak, 
in an h(k)-connected tournament T, there is a hamiltonian cycle in T containing 
@1,@2,...,@x in cyclic order. 


F25: [S093] For any integer m, 3 < m < n—3, every 2-strong n-tournament T, n > 6, 
contains two vertex-disjoint cycles of lengths m and n — m, unless T is isomorphic to 
the quadratic residue rotational tournament RT(1,2,4). (This result is based on the 
case m = 3, which was established earlier in [Re85]. See also [BaGu00].) 


F26: [ChGoLi01] If T is a k-strong n-tournament with n > 8k, then T contains k 
vertex-disjoint cycles that use all of the vertices of T. 


Anti-Directed Paths 


During the last 30 years, researchers have also searched for copies of other orientations 
of undirected paths and cycles in tournaments. Initially, study focused on oriented 
paths and cycles that contain no directed path of length 2 (called anti-directed paths 
and cycles), and successes there led to more general results on arbitrary oriented paths 
and cycles. 


TERMINOLOGY: In a digraph, a directed path of length k is sometimes called a k-path. 
DEFINITION 


D18: An anti-directed path (or cycle) in a digraph D is a sequence of arcs that 
forms a path or cycle in the underlying graph of D but does not contain a directed path 
of length 2 in D. 


EXAMPLE 


E8: Two anti-directed paths and an anti-directed cycle are illustrated in Figure 3.3.4. 


4 
# 


e-_e -0-@ -0- @ 


Figure 3.3.4: Two anti-directed paths and an anti-directed cycle. 
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FACTS 


F27: [HaTh00a] Let T be a tournament different from the 3-cycle, the regular 5- 
tournament, and the quadratic residue rotational tournament R(1,2,4). Then T con- 
tains every orientation of an undirected hamiltonian path. (This was first proved for 
anti-directed hamiltonian paths in 1971 [Gr71].) 


F28: [Pe84] If n is even and n > 16, then every n-tournament contains an anti-directed 
hamiltonian cycle. (The major part of this result was first established for all even n > 50 
[Th73], and then the result was improved to all even n > 28 [Ro74].) 


F29: [Ha00a] Every n-tournament, n > 68, contains every orientation of a (undirected) 
hamiltonian cycle except possibly the (directed) hamiltonian cycle when the tournament 
is reducible. (This was first proved for n > 2'*8 [Th86].) 


3.3.3 Scores and Score Sequences 


Fact F30, below, due to the mathematical sociologist H. G. Landau, is another basic 
result that is useful in studies on tournaments. Nearly a dozen proofs appear in the 
literature (see the survey [Re96] and subsequent paper [GrRe99]). 


FACTS 


F30: [La53] A sequence of n integers (81, $2,...,8m—1, Sn), Where 
81 < 82 <... < Sn_1 < Sp, is the score sequence of some n-tournament if and only if 


ee or fork =1,2,...,n-—1, and t= (5) 


i=l 


F31: [HaNoCa65] A sequence of n integers (1, S2,..-,Sn—1, $n), where 
81 < 83 <... < Sn_1 < Sn, is the score sequence of some strong n-tournament if and 
only if 


k 


k m n 
S> si > Cc) fork =1,2,...,n-—1, and Dee (5) 


i=1 
(See also [HaMo66].) 
F32: Let S = (81, 82,53,..-,;Sn—1,$n) be a sequence of n > 2 nonnegative integers, 


where 8, < 589 <... < S,_1 < 8, <n-—1, and let m= s,. S' is the score sequence of 
some n-tournament if and only if the new sequence 


81, $2,-++58m3$m41 — 1, 8m42—1,---,$n-1—-1 


when arranged in non-decreasing order, is the score sequence of some (n—1)-tournament. 
(See [ReBe79].) 


F33: [Av80] A score sequence S = (81, $2,53,---,5n—1,$n) is the score sequence for 
exactly one n-tournament T if and only if each of the strong components of T is simple, 
and the simple strong score sequences are (0), (1, 1, 1), (1, 1, 2, 2), and (2, 2, 2, 2, 2). 
(See also [Te98].) 
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F34: [BrQi84] Let S = (s1,52,...,5n) be a score sequence. Every n-tournament with 
score sequence S' has a unique hamiltonian cycle if and only if 


S = (1,1,2,3,...,n—3,n—2,n— 2) 


F35: [Ya88] and [Ya89] Every non-empty set of nonnegative integers is the set of scores 
for some tournament. 


EXAMPLE 


E9: It is easy to verify that the sequence (1, 1, 1, 4, 4, 4) satisfies the conditions 
of Fact F30, and hence, it is the score sequence for some 6-tournament. In fact, the 
6-tournament consisting of two vertex-disjoint 3-cycles, A and B, where every vertex 
of A dominates every vertex of B, has score sequence (1, 1, 1, 4, 4, 4). Since the sum 
of the first three scores equals (os Fact F31 implies that no 6-tournament with score 
sequence (1, 1, 1, 4, 4, 4) is strong. 


REMARKS 


R5: The transitive n-tournament is the only n-tournament in which all of the scores 
are distinct. So, each score occurs with frequency 1. Regular n-tournaments are the 
only n-tournaments in which all of the scores are the same. So, each score in a regular 
n-tournament occurs with frequency n. Thus, each of the sets {1} and {n} is the set 
of frequencies of scores in some n-tournament. Given a non-empty set F' of positive 
integers, the least possible order of a tournament with a set of score frequencies given 
by F was explicitly determined in [AlRe78]. 


R6: Tournament Rankings. Given the results of a round-robin competition, one would 
like to rank the teams or at least pick a clearcut winner. Unfortunately, not one of 
the many ranking methods that have been proposed is entirely satisfactory. Ranking 
by the order of a hamiltonian path (whose existence is guaranteed by Fact F7 in the 
previous subsection) does not work unless the path is unique, which is only the case for 
transitive tournaments (Fact F14). Ranking by score vector usually results in ties, and 
a team that beats only a few teams, with those few teams having lots of wins, might 
deserve a better ranking. This suggests considering the second-order score vector, where 
each team’s score is the sum of the out-degrees of the teams it beats. One can continue 
by defining the nth-order score vectors recursively. There is an asymptotic ranking 
obtained this way, related to the eigenvalues of the digraph. See [Mo68] for more details 
and references. 


The Second Neighborhood of a Vertex 


DEFINITION 


D19: Let x be a vertex in a digraph D. The second neighborhood of x, denoted 
Nj* (2), is the set of all vertices of D reachable from x by a 2-path but not a 1-path. 
That is, NS*(2)=[ U  Ndy) |- NS @). 

yEN} (2) 
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FACTS 
F36: [Fi96] Every tournament T contains a vertex x so that |N£t (2)| > | NZ (2)|. 


F37: [HaTh0Ob] If a tournament T contains no transmitter, then there are at least 
two vertices that satisfy the condition in Fact F36. 


CONJECTURE 


Seymour’s second neighborhood conjecture (see [Fi96]): Every digraph D con- 
tains a vertex z for which NS (2)| > [V5 (2)|- 


3.3.4 Transitivity, Feedback Sets, and Consistent Arcs 


In a tournament that represents the outcomes of a paired-comparison experiment 
(or the results of a round-robin competition or the results of majority voting by an 
electorate in which there are no ties), there is much interest in attempts to measure 
the consistency of choices by the subject who made the comparisons (or the consistency 
of wins among the participants or the consistency of the electorate’s choice among the 
alternatives). Consistency corresponds to a lack of cycles. So, one measure is the largest 
number of vertices that induce a transitive sub-tournament in the outcome tournament. 
Another measure is the largest number of arcs of the outcome tournament that do not 
contain the arcs of a cycle. 


DEFINITION 


D20: A feedback set of arcs in a tournament T is a set S of arcs such that the 
digraph T — S contains no cycle. 


EXAMPLE 


E10: Let T be the tournament with vertex-set {1,2,...,n} in which j dominates k 
whenever j > k, except that 7 dominates 7+ 1, for i = 1,2,...,n—1. Then the set of 
arcs {(7,4 +1) | 1 <7 <mn-—1)} is a feedback set of arcs in T, and the smaller set of 
ares {(7,4 +1) | 1<i<mn-—1, i odd} is also a feedback set of arcs in T. 


Smallest Feedback Sets 


Finding a smallest feedback set in an n-tournament T is equivalent to finding a transitive 
n-tournament (or linear order) W such that V(W) = V(T) and the number of pairs 
{x,y} of distinct vertices in which « dominates y in T but not in W is as small as possible. 
The latter problem is known as a Slater problem in the voting literature involving 
tournaments. See [ChHuWo96]. Work on the computation of a minimum-weighted 
feedback set of arcs in an arc-weighted tournament is reported in [ChGuHuWo097] and 
[ShYu01]. 


FACTS 


F38: A smallest set of arcs in a tournament T whose reversal yields a transitive 
tournament is a smallest feedback set in T. (See [BaHulsRoTe95].) 
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F39: The number of arcs in a smallest feedback set in a tournament T is equal to the 
number of arcs in a smallest transversal of the cycles in T. (See [BaHulsRoTe95].) 


F40: [BaHulsRoTe95] If R is a smallest feedback set in a tournament T, then every 
arc of R is contained in some 3-cycle of T. 


F41: [BaHulsRoTe95] A digraph D is acyclic if and only if its arc set is a smallest 
feedback set of some tournament. 


Acyclic Subdigraphs and Transitive Sub-Tournaments 


DEFINITION 


D21: A set of arcs in a digraph D is a consistent set of arcs if it induces an acyclic 
subdigraph of D. 


FACTS 


F42: [Sp71/72, Fe83] If g(n) denotes the largest integer so that every n-tournament 
contains a consistent set of g(n) arcs, then there are positive constants c, and cz so 
that $(5) + cyn3/? < g(n) < $(5) + cyn3/2, Moreover, values for n < 12 are as follows 
([Re69, Be72]): 


N 12/3]/4|5|/6|7)8)])9|l)]ll| 12 
gm) [112/41] 6]9| 12 | 20 | 24] 30] 35 | 44-46 


F43: [PaRe70, Ne94, Sa94] If v(m) denotes the largest integer such that every n-tourn- 
ament contains a transitive sub-tournament with at least v(m) vertices, then 


for4<n<7 

for 8<n< 13 
for 14<n< 27 
for 28<n< 31 


v(n) = 


aD oOo FR W 


[logy (16n/7)| < v(m) <2 |logyn| +1 for 32<n< 54 
u(n) > |log(n/55)| +7 for n > 55 


F44: [AoHa98] and [GuGyThWe98] For any tournament score sequence S = (81, 52, 
++; S8n—1, $n), Where 81 < 82 < 83 <... < Sn_1 < Sn, there exists a tournament T 
on vertex set {1,2,...,n} so that the score of vertex i is s; and the sub-tournaments 
of T on both the even and the odd indexed vertices are transitive, i.e., 7 dominates 7 
whenever i > j and i = j (mod 2). (See also [BrSh01] for a shorter proof, and see 
[BaBeHa92] for origins of the result in terms of the so-called cyclic chromatic number 
of a tournament.) 
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Arc-Disjoint Cycles 


NOTATION: For a given tournament T’, a(T’) denotes the maximum number of arc-disjoint 
cycles in T, and c(T’) denotes the number of arcs in a smallest feedback set in T. Also, 
let a(n) = max{a(T)} and c(n) = max{c(T)}, where the maxima are taken over all 
n-tournaments T. 


REMARK 


R7: Note that the quantity a(n) equals the maximum number of edge-disjoint (undi- 
rected) cycles in the complete graph of order n, which has been shown by [ChGeHe7]1] 
to equal |(n/3)|(n — 1)/2] |. 


FACTS 
F45: For any tournament T, a(T) < c(T). 


F46: [BeKo76] For n > 10, a(n) < c(n). That is, for each n > 10, there exists an 
n-tournament T such that a smallest feedback set in JT’ contains more arcs than in a 
largest collection of arc-disjoint cycles in T. (See also [Be75] and the discussions in 
[BaGu01] and [Is95].) 


3.3.5 Kings, Oriented Trees, and Reachability 


Kings arose in an attempt to determine the “strongest” individuals in certain small 
animal societies in which there exists a pairwise “pecking” relationship (see work refer- 
enced in [La53]). The delightful article by Maurer [Ma80] stimulated early interest in the 
topic. Extensions of the idea led to new investigations into combinatorial sub-structures 
in tournaments involving oriented trees and other “reachability sub-structures.” The 
concept of a king appeared independently as “uncovered vertices” in some of the voting 
theory literature (see §3.3.8 below). Moreover, there is current interest in kings and 
generalizations in other digraphs, particularly in multi-partite tournaments (e.g., see 
the discussion in [Re96]). 


DEFINITIONS 


D22: A king in a tournament T is a vertex x such that for every other vertex y, there 
is a 1-path or a 2-path from z to y in T. 


D23: A serf in a tournament is a vertex x such that for every other vertex y, there 
is a 1-path or a 2-path from y to a. 


D24: A tournament is k-stable, k > 1, if every vertex is a king and more than & arcs 
must be reversed in order to reduce the number of kings. 


FACTS 


F47: [Va52] Every tournament contains a king. In fact, every vertex of maximum score 
is a king [La53]. 
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F48: [Ma80] For positive integers k and n, there exists an n-tournament with exactly 
k kings if and only if1<k <n, k #2, and (k,n) 4 (4,4) (see also [Re82]). 


F49: [Re80] For integers 0 < b<s <k <n, there exists an n-tournament with exactly 
k kings, exactly s serfs, and exactly b vertices that are both kings and serfs if and only 
if the following four conditions are satisfied: 

1) k+s—b<n; 
2) s#2Qandk #2; 

3) n=k=s #4 or {k<nand b< s}; 


( 
( 
( 
(4 


) 
) 
) 
) (n,k, s,b) is none of (n,4,3,2), (5,4,1,0), or (7, 6,3, 2). 


F50: [ReBr84] A k-stable tournament must have at least 4k + 3 vertices. Moreover, 
the following three statements are equivalent: 


(1) There exists a k-stable (4k + 3)-tournament. 
(2) There exists a (4k + 4) by (4k + 4) skew-symmetric Hadamard matrix. 


(3) There exists a doubly regular (4k + 3)-tournament. 


F51: [Ma80] In almost all tournaments every vertex is a king. (See also [Mo68].) In 
fact, for each positive integer &, almost all tournaments are k-stable [ReBr84]. 


REMARKS 


R8: Bounds on the least number of vertices that need be adjoined to an n-tournament 
T to form a new super tournament W so that the set of kings in W is exactly the 
vertices of T were described in [Re82] and [Wa84]. The least order of a tournament 
Z in which all vertices of Z are kings and that contains T as a sub-tournament was 
determined in [Re80]. For other work on kings in tournaments the reader is referred to 
the references in [Re96]. 


R9: By definition, a king is the root of a rooted spanning tree of depth at most 2. So, 
it is natural to consider the existence of other oriented trees in tournaments. 


Tournaments Containing Oriented Trees 


TERMINOLOGY: An out-branching (or out-tree) in a digraph is a rooted spanning 
tree, and an in-branching (or in-tree) is an out-branching with all the arcs reversed. 


CONJECTURE 


Sumner’s conjecture (see [ReWo83]): Every (2n — 2)-tournament contains every ori- 
entation of every tree of order n. 
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EXAMPLE 


E11: All of the 8 oriented trees of order 4 are shown in Figure 3.3.5. A copy of each 
can be found in any 6-tournament. 


eke ee: Ge es 


ee -© -© © -© -0 © @ -0-©@ -e© e © -0 -e 


Figure 3.3.5: The oriented trees of order 4. 


REMARK 


R10: Note that no integer smaller than 2n — 2 will suffice in the statement of the 
conjecture, for a score of at least n—1 is required to accommodate the “out-orientation” 
of the tree Ky,,-1; any regular (2n — 3)-tournament fails to have a score of n— 1. Over 
the last 20 years several papers reported partial and related results (see the references 
in [HaTh0Ob}), all of which support the conjecture. 


FACTS 


F52: [HaTh00b] If f(n) denotes the least integer m so that every m-tournament 
contains every orientation of every tree of order n, then f(n) < (7n — 5)/2. (Earlier 
efforts yielded f(m) < 12n [HaTh91] and then f(n) < n—6 [HaTh00b].) 


F53: [LuWaPa00] Every n-tournament, n > 800, contains a spanning rooted 2-tree of 
depth 2 so that, with at most one exception, all vertices that are not a leaf or the root 
have out-degree 2. 


F54: [Pe02] Each rotational (2n + 1)-tournament contains all rooted trees of order 
2n + 1 in which there are at most n branches, each of which is a directed path. 


F55: [Ba91] A tournament T contains an out-branching and an in-branching that are 
arc-disjoint, both rooted at a specified vertex v, if and only if T is strong and for each 
arc (x,y) of T, the digraph T — {(, y)} contains either an out-branching rooted at v or 
an in-branching rooted at v. If T’ is 2-arc-strong, then for every pair of vertices x and 
y, there is an out-branching rooted at x and an arc disjoint in-branching rooted at y. 


Arc-Colorings and Monochromatic Paths 


CONJECTURE 


P. Erdés conjecture (see [SaSaWo82]): for each positive integer k, there is a least 
positive integer s(k) so that every arc-colored tournament involving k colors contains 
a set S of s(k) vertices with the property that for every vertex y not in S, there is a 
monochromatic path from y to some vertex in S. 
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REMARK 


R11: This conjecture considers reachability in tournaments via monochromatic paths. 
Since every tournament contains a hamiltonian path, s(1) = 1. Fact F56 below implies 
that s(2) =1. A certain coloring of the 9-tournament that is the lexicographic product 
of a 3-cycle with a 3-cycle shows that s(3) > 2 (see [SaSaWo82]). In particular, Erdés 
asked if s(3) = 3. It is not even known that s(k) is finite for k > 3. Some progress on 
this conjecture is included below. (See also [LiSa96] and [Re00] for a relaxation of the 
problem and several open questions.) 


FACTS 


F56: [SaSaWo82] If the arcs of a tournament T are colored with two colors, then there 
exists a vertex x in T so that for every vertex y # x in T, there is a monochromatic 
path from y to x. (See [Re84] for another proof.) 


F57: [Sh88] If the arcs of a tournament T are colored with three colors and T does 
not contain a 3-cycle or a transitive sub-tournament of order 3 whose arcs use all three 
colors, then there exists a vertex x in T so that for every vertex y £ x in T, there is a 
monochromatic path from y to «x. 


3.3.6 Domination 


Issues concerning domination have played an important role in the development of 
tournament theory. However, exact results on domination numbers of tournaments are 
scarce. For example, the problem of determining the smallest order of a tournament T 
with domination number y(T') = k for a given integer k has only some partial results. 
Bounds are known, some of which are constructive, but the exact value is known only 
for small values of k. Domination in general (undirected) graphs is discussed in §9.2. 


DEFINITIONS 


D25: A dominating set in a tournament T is a set S of vertices in T such that every 
vertex not in S is dominated by some vertex in S. 


D26: The domination graph of a tournament T is an undirected graph G that has 
the same vertex-set as T, and x is adjacent to y in G whenever {x,y} is a dominating 
set in T. 


D27: A spiked cycle is a connected (undirected) graph with the property that when 
all vertices of degree 1 are removed, a cycle results. 


D28: The domination number of a tournament is the minimum cardinality of a 
dominating set in T, denoted 7(T). 
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EXAMPLE 


E12: Let T denote the transitive n-tournament with vertex-set {1,2,...,} in which j 
dominates 7 whenever j > i. Reversal of the arc (n, 1) yields a strong n-tournament W in 
which vertex 1 can reach vertex n via a 1-path, and 1 can reach vertices 2,3,...,(n—1) 
via 2-paths (through vertex n). So, 1 is a king in W. Since W has no transmitter, 
and every vertex in W is dominated by 1 or n, {1,n} is a dominating set in W and 
y(W) = 2. 


FACTS 


F58: [GrSp71] For a positive integer k, let p denote the smallest prime number greater 
than k?2?*-?, where p = 3 (mod 4). The domination number of the quadratic residue 
p-tournament is greater than k. (See also [ReMcHeHe02].) 


F59: [FiLuMeRe98] The domination graph of a tournament is either a spiked odd 
cycle with or without isolated vertices, or a forest of caterpillars. In particular, the 
domination graph of an n-tournament has at most n edges. Furthermore, any spiked 
odd cycle with or without isolated vertices is the domination graph of some tournament. 


F60: [FiLuMeRe99] A connected graph is the domination graph of a tournament if and 
only if it is either a spiked odd cycle, a star, or a caterpillar whose spine has positive 
length and has at least three vertices of degree 1 adjacent to one end of its spine. The 
tournaments that have a connected domination graph were characterized in [JiLu98]. 


F61: [SzSz65] If T is an n-tournament with n > 2, then the domination number of T 
satisfies y(T’) < logan — loggloggn + 2. 


3.3.7 Tournament Matrices 


Some early work on tournament matrices included the results by Brauer and Gentry 
and by J. W. Moon and N. J. Pullman (see [Mo68] and [BrGe72]); work by H. J. 
Ryser [Ry64] on tournament matrices with given row and column sum that minimize 
the number of upsets, i.e., the number of 1’s above the main diagonal; and work by D. 
R. Fulkerson [Fu65] that described the tournament matrices with prescribed row sums 
that minimize and maximize the number of upsets. R. A. Brualdi and Q. Li [BrLi83] 
continued the upset theme and expanded on the work of Ryser and Fulkerson. These 
last three references may be thought of as papers on ranking since minimizing upsets 
gives rise to orderings of the vertices that minimize the number of losses by stronger 
players to weaker players. 


DEFINITION 


D29: A tournament matrix is a square matrix M = (m,;) of 0’s and 1’s, with 0’s 
on the main diagonal and m;; + m,; = 1, for all distinct i and j. 


TERMINOLOGY: For a given ordering of the vertices, v1, V2,...,Un, of a tournament T, 
the adjacency matrix M = |m,,| of T is the 0-1 matrix given by 


1 if v; dominates v; 


0 otherwise 


214 Chapter 3. Directed Graphs 


Thus, a tournament matrix is the adjacency matrix of some tournament for a given 
ordering of the vertices. 


EXAMPLE 


E13: A tournament matrix of order 6 is shown in Figure 3.3.6. 


Corecoc oS 
See OO 
Sree orr 
SCocrcore 
pe Orecoco 
CORK Se 


Figure 3.3.6: A tournament matrix. 


REMARK 


R12: An elementary observation about an n by n tournament matrix M is that 
M+M't+1, = Jy, where I, is the n by n identity matrix, M* is the transpose matrix 
of M, and J, is the n by n matrix of all 1’s. Moreover, any adjacency matrix M of T 
can be obtained from any other adjacency matrix N of T by permuting the order of the 
vertices used to obtain N, i.e., there is a permutation matrix P such that M = P-!NP. 
Thus, the eigenvalues are the same for all of the tournament matrices corresponding to 
a particular tournament. 


FACTS 


F62: [BrGe68] Let A1,A2,...,An denote the eigenvalues of an n by n tournament 
matrix A, where |A\;| > |A2| >... > |An|. Then 0 < Ay < (n— 1)/2, and |A;| < 
[n(n — 1)/2j|/?, 7 = 2,3,...,n. Moreover, if M is an n by n tournament ma- 
trix and Ay denotes max{|A| : A an eigenvalue of M}, then for odd n, max{Ay : 
M an n by n tournament matrix} is attained by the regular tournament matrices. 


F63: [CaGrkiPuMa92] For all n > 3, each irreducible n by n tournament matrix M has 
at least three distinct eigenvalues; such a matrix has exactly three distinct eigenvalues 
if and only if it is a Hadamard tournament matrix (i.e., M'M = nl). There is an 
irreducible n by n tournament matrix with exactly n distinct eigenvalues. 


F64: [Mi95] If A is an n by n tournament matrix, then the rank of A is equal to 
(n — 1)/2 if and only if n is odd and AA‘ = 0. Equality implies that the characteristic 
of the field divides (n — 1)/2 (without the hypothesis of regularity). Examples of order 
n having rank (n — 1)/2 for n = 1 (mod 4) for fields of characteristic p, where p divides 
(n — 1)/4, can be obtained from doubly regular tournament matrices of order (n — 2) 
by adding an (n — 1)st row of n 0’s (and hence, an (n — 1)st column of all 1’s save for 
the 0 in the (n — 1,n — 1) position) followed by an n” row of (n — 1) consecutive 1’s 
and a 0 in the (n,n) position (and hence, an n‘” column of n 0’s). 
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F65: [Sh92] A tournament matrix is singular if more than one-fourth of the triples 
of vertices in the corresponding tournament induce 3-cycles. All tournament matrices 
realizing a given score sequence are nonsingular if and only if the scores are “sufficiently 
close” to one another. The spectral radius of a singular n by n tournament matrix is 
less than or equal to (1/2)(n — 1), and equality implies that exactly one-fourth of the 
triples of vertices in the corresponding tournament induce 3-cycles. 


3.3.8 Voting 


Work on acyclic digraphs in tournaments, including transitive sub-tournaments, is of 
interest in voting theory since such structures give a measure of group consistency by the 
voters. Readers can find a rich source of problems and issues in selected articles in the 
social choice literature that treats voting theory; particular examples of such literature 
include the periodicals Social Choice and Welfare, Mathematical Social Sciences, Public 
Choice, and The American Journal of Political Science. 


Deciding Who Won 


A central issue in voting theory is to pick a “best” alternative (or subset of the alterna- 
tives) given that voter preferences have been aggregated. A “best” alternative or subset 
of alternatives is called a solution and is thought of as the winners. Several tournament 
solutions have been considered in the literature. Each is to be non-empty, invariant 
under isomorphism, and uniquely the Condorcet winner if there is one in the tourna- 
ment. Some of these solutions are: the vertices of largest score (the Copeland solution), 
vertices based on the maximum eigenvalue of the adjacency matrix of the tournament, 
vertices associated with a Markov method, the Condorcet winner of a transitive tour- 
nament that is “closest” to the given tournament (called the Slater solution), vertices 
that are uncovered relative to a certain covering relation, vertices that are transmitters 
of maximal transitive sub-tournaments (the Banks set), and vertices satisfying a special 
axiomatic formulation (the tournament equilibrium set). These are discussed in detail 
in J.-F. Laslier’s monograph [La97]. 


DEFINITION 


D30: The Condorcet winner is a candidate (or alternative) x such that for every 
other candidate (or alternative) y, x is preferred over y by a majority of the voters. 


D31: The majority digraph D of a set of n-tournaments, all with the same vertex-set 
V, has vertex-set V, and vertex x dominates vertex y in D if and only if « dominates 
y in a majority of the n-tournaments. 


D32: A digraph D is induced by a set of voters if D is the majority digraph based 
on a collection of linear orderings of the vertices of D, exactly one for each voter. (The 
linear orders represent preferences by the voters for the alternatives that are the vertices 
of D. Different voters might have the same linear order.) 


D33: The Condorcet paradoz is that the voters may be consistent in their preferences 
(i.e., each of their rankings of the n candidates is a linear order), but the amalgamation 
of voters’ preferences using majority rule can result in inconsistencies (i.e., cycles in the 
majority digraph). 
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REMARK 


R13: In the definition of a majority digraph, the common vertex set may be thought 
of as a finite set of n “alternatives,” and each n-tournament may be thought of as the 
pairwise preferences of the alternatives by a “voter.” So, the resulting majority digraph 
represents voters’ preferences under majority voting. If there are an odd number of 
voters or there are no ties, then the majority digraph is a tournament. 


EXAMPLE 


E14: Figure 3.3.7 illustrates the majority tournament T of the set of three transitive 
5-tournaments 7), T>, and T3. In the drawings of T), T>, and 73, the long lines directed 
downward mean that each vertex dominates exactly the vertices below it. For example, 
in the second tournament from the left, vertex c dominates exactly vertices a and d, 
and vertex a is dominated by exactly vertices b, e, and c. The existence of cycles in 
the majority digraph illustrates the Condorcet paradox. For instance, it shows that a 
majority of voters prefer a to b, a majority prefer b to c, and yet, a majority prefer c to 
a. 


2@ be de $ 
be ce ce \ 
c@ c@ or " 
de a@ 2@ 
co ' dev vel P 


Figure 3.3.7: The majority digraph of 3 tournaments. 


Tournaments That Are Majority Digraphs 


FACTS 


F66: [St59] Every n-tournament (indeed, every oriented graph of order n) is the 
majority digraph of some collection of n + 1 tournaments, for n odd, and of n + 2 
tournaments, for n even. 


NOTATION: Let m(n) denote the smallest integer such that any n-vertex digraph can 
be induced as a majority digraph by a collection of m(n) or fewer voters, and let g(n) 
denote the smallest integer such that any n-tournament can be induced as a majority 
digraph by a collection of g(n) or fewer voters. 


F67: For large n, m(n) > (.55n/logn) [St59], and there exists a constant c so that 
m(n) < (en/logn) [ErMo64]. 


F68: [Mo68] The integer g(n) is always odd, g(3) = g(4) = 9(5) = 3, g(n4+1) < g(n)42, 
and m(n) < 2g(n). 
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F69: [Ma99] In contrast to the situation for majority tournaments, for any \, 1/2 <A< 
1, there exists an integer n and a labeled n-tournament T, so that for every collection 
C of transitive tournaments on the same label set as T, there is an arc (u,v) of T such 
that the proportion of C in which u dominates v is less than X. In short, T is not the 
A-majority tournament for any collection of transitive n-tournaments. 


Agendas 


DEFINITIONS 


D34: An agenda is an ordered list of alternatives (i-e., an ordered list of the vertices 
of a majority tournament). 


D35: An amendment procedure of voting is a sequential voting process in which, 
given an agenda (a1, d2,...,@n) of alternatives, alternative a1 is pitted against a2 in the 
first vote, then the winner is pitted against a3 in the second vote, then the winner is 
pitted against a4 in the third vote, etc. 


D36: Given a majority n-tournament T and an agenda (a1, da2,...,@n) of alternatives 
given by the vertices of T, the sincere decision is the alternative surviving the last 
vote (i.e., the (n—1)™ vote) in an amendment procedure of voting using majority voting 
at each stage. It is a function of the agenda and T. The decision tree is the spanning, 
rooted subtree of T, rooted at the sincere decision, induced by the n — 1 arcs of T that 
describe the n—1 votes taken in the amendment procedure using T and (a1, a2,...,@n). 


EXAMPLE 


E15: Given the agenda (b,e,c,a,d) and the majority tournament shown in Figure 
3.3.8, alternative a is the sincere decision. The corresponding decision tree rooted at 
vertex a is also shown. 


(b, e, c, a, d) 
d c d Cc 


Figure 3.3.8: Majority tournament, agenda, and decision tree. 


FACT 


F70: [Mi77] For any tournament T’, the set of vertices that can be obtained as the 
sincere decision under an amendment procedure is exactly the set of vertices in the 
initial strong component of T. (For another proof, see [Re91a].) 
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Division Trees and Sophisticated Decisions 


DEFINITIONS 


D37: Given an agenda (a1,a2,...,a@n), the division tree of (a1,a2,...,@n) is the 
labeled, balanced, binary, rooted tree on 2” — 1 vertices labeled by non-empty sub- 
sequences of the agenda (a1, d2,...,@,); the root is labeled (a;,a2,...,a@,); and, for 
0<j<n-—2, a vertex at level 7 which is labeled by a subsequence of (a1, a2,...,@n), 
say (b1, bo, b3,...,6,—-;), dominates exactly two vertices at level 7 + 1, one labeled 
(b1, bs, eae On—5), and one labeled (bo, bs, seey bn—j): 


D38: Let T be a majority n-tournament and let (a1,a2,...,@,) be an agenda of 
alternatives given by the vertices of T. The sophisticated decision is the anticipated 
decision at the root of the division tree relative to (a1, a2,...,@n,) and T, where the 


anticipated decision at each vertex at level n — 2 of the division tree is the majority 
choice in T’ between the two alternatives that make up the ordered pair labeling that 
vertex in the division tree; and inductively, for 0 < 7 < n— 2, the anticipated decision 
at each vertex vu of level 7 in the division tree is the majority choice in T’ between the 
anticipated decisions at the two vertices at level 7 + 1 that are dominated by v. 


FACTS 


F71: [Ba85] The set of vertices in a tournament T that can be obtained as the sophis- 
ticated voting decision under an amendment procedure relative to some agenda consists 
of those vertices of T that are transmitters of maximal transitive sub-tournaments of T’. 


F72: No alternative is unanimously preferred to the sophisticated voting decision. 
(Observed in [Mi77] and [Mi80] and proved in [Re91b].) 


F73: [Re97] A tournament T admits an agenda for which the sincere voting decision and 
the sophisticated voting decision are identical if and only if the initial strong component 
of T is not a 3-cycle. As a result, asymptotically, most tournaments admit such an 
agenda. 


EXAMPLES 


E16: The division tree of the agenda (x,y,z) is shown in Figure 3.3.9. Given the 
majority tournament shown, the anticipated decisions at levels 1 and 0 of the division 
tree are underlined in the vertex labels. The anticipated decision at the root is y, so 
y is the sophisticated decision relative to this tournament and agenda. Note that the 
sincere decision is z, which illustrates Fact F73. 


xyz 


(x, y> z) XZ MZ 


Figure 3.3.9: Agenda, division tree, and majority tournament. 
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E17: The majority 4-tournament shown in Figure 3.3.10 illustrates the positive case 
for Fact F73. As before, the anticipated decisions are underlined. For the agenda 
(y,v,u, x), the sincere decision and the sophisticated decision are both uw. 


yvux 


(y, Vv, U, x) 


Figure 3.3.10: A majority 4-tournament. 


Inductively Determining the Sophisticated Decision 


The following result yields an algorithm for determining the sophisticated decision that 
is much more straightforward than using the definition. (Recall that I(z) denotes the 
in-set of vertex z.) 


FACT 


F74: [ShWe84] Let T be a majority n-tournament and let (a;,a2,...,@,) denote an 
agenda composed of the alternatives that make up the vertices of T. Inductively define 
the sequence (21, 22,.--,2n) as follows: 2, = an, and for 1 <j <n, 


n 
a; ifa;€ f) I(%) 
a= i=jtl 
2341 Otherwise 


Then z, is the sophisticated decision. 


References 


Ad64] A. Adam, Problem, 12-18 in Theory of Graphs and Applications, Proc. Coll. 
Smolenice, Czech. Acad. Sci. Publ., 1964. 


Al67] B. Alspach, Cycles of each length in regular tournaments, Canad. Math. Bull. 
10 (1967), 283-285. 


AlRe78] B. Alspach and K. B. Reid, Degree frequencies in digraphs and tournaments, 
J. Graph Theory 2 (1978), 241-249. 


AlReRo74] B. Alspach, K. B. Reid, and D.P. Roselle, Bypasses in asymmetric digraphs, 
J. Combin. Theory B 17 (1974), 11-18. 


AlTa82] B. Alspach and C. Tabib, A note on the number of 4-circuits in a tournament, 
Annals Discrete Math. 12 (1982), 13-19. 


220 Chapter 3. Directed Graphs 
AoHa98] S. Ao and D. Hanson, Score vectors and tournaments with cyclic chromatic 
number 1 or 2, Ars Combin. 49 (1998), 185-191. 


Av80] P. Avery, Condition of a score sequence to be simple, J. Graph Theory 4 (1980), 
157-164. 


Ba85| J. S. Banks, Sophisticated voting outcomes and agenda control, Social Choice 
and Welfare 1 (1985), 295-306. 


Ba91] J. Bang-Jensen, Edge-disjoint in- and out-branchings in tournaments and related 
path problems, J. Combin. Theory B 51 (1991), 1-23. 


BaBeHa92] K. S. Bagga, L. W. Beineke, and F. Harary, Two problems on coloring 
tournaments, Vishrva Internat. J. Graph Theory 1 (1992), 83-94. 


BaGu00] J. Bang-Jensen, Y. Guo, and A. Yeo, Complementary cycles containing pre- 
scribed vertices in tournaments, Discrete Math. 214 (2000), 77-87. 


BaGu0l] J. Bang-Jensen and G. Gutin, Digraphs: Theory, Algorithms, and Applica- 
tions, Springer, 2001. 


BaGu96] J. Bang-Jensen and G. Gutin, Paths, trees, and cycles in tournament, Con- 
gressus Numer. 115 (1996), 131-170. 


BaGu98] J. Bang-Jensen and G. Gutin, Generalizations of tournaments: a survey, J. 
Graph Theory 28 (1998), 171-202. 


BaHulsRoTe95] J.-P. Barthélémy, O. Hurdy, G. Isaak, F. S. Roberts, and B. Tesman, 
The reversing number of a digraph, Discrete Appl. Math. (1995), 39-76. 


Be72] J. C. Bermond, Ordes a distance minimum d’un tournoi et graphes partiels sans 
circuits maximal, Math. Inform. Sci. Humaines 37 (1972), 5-25. 


Be75| J.-C. Bermond, The circuit hypergraph of a tournament, 165-180 in Infinite and 
Finite Sets, North Holland, 1975. 


Be81] L. W. Beineke, A tour through tournaments or bipartite and ordinary tour- 
naments: a comparative survey, 41-55 in Combinatorics; Swansea, 1981, London 
Math. Soc. Lecture Notes 52, Cambridge University Press, Cambridge, 1981. 


BeKo76] J.-C. Bermond and Y. Kodratoff, Une heuristic pour le calcul de l’indice de 
transitivite d’un tournoi, R. A. I. R. O. Informatique Theorique 10 (1976), 83-92. 


BeTh81] J.-C. Bermond and C. Thomassen, Cycles in digraphs: a survey, J. Graph 
Theory 5 (1981) 1-43. 


BeWi75] L. W. Beineke and R. J. Wilson, A survey of recent results on tournaments, 
31-48 in Recent Advances in Graph Theory, Proc. Symp. Prague 1974, Academia, 
1975. 


[BrGe68] A. Brauer and I. C. Gentry, On the characteristic roots of tournament matri- 
ces, Bull. Amer. Math. Soc. 74 (1968), 1133-1135. 


[BrGe72] A. Brauer and I. C. Gentry, Some remarks on tournament matrices, Linear 
Algebra Appl. 5 (1972), 311-318. 


Section 3.3. Tournaments 221 


BrLi83] R. A. Brualdi and Q. Li, Upsets in round robin tournaments, J. Combin. 
Theory B (1983), 62-77. 


BrQi84] R. A. Brualdi and Q. Li, The interchange graph of tournaments with the same 
score vector, 129-151 in Progress in Graph Theory, Academic Press, 1984. 


BrSh01] R. A. Brualdi and J. Shen, Landau’s inequalities for tournament scores and 
a short proof of a theorem on transitive sub-tournaments, J. Graph Theory 38 
(2001), 244-254. 


[Ca59] P. Camion, Chemins et circuits hamiltonioens des graphes complets, C. R. Acad. 
Sci. Paris 249 (1959), 2151-2152. 


[CaGrKiPuMa92] D. de Caen, D. A. Gregory, S. J. Kirkland, N. J. Pullman, and J. S. 
Maybe, Algebraic multiplicity of the eigenvalues of a tournament matrix, Linear 
Algebra Appl. 169 (1992), 179-193. 


ChGeHe71] G. Chartrand, D. Geller, and S. Hedetniemi, Graphs with forbidden sub- 
graphs, J. Combin. Theory B 10 (1971), 12-41. 


ChGoLi01] G. Chen, R. J. Gould, and H. Li, Partitioning vertices of a tournament into 
independent cycles, J. Combin. Theory B 83 (2001), 213-220. 


ChGuHuWo97] I. Charon-Fournier, A. Guénoche, O. Hudry, and F. Woirgard, New 
results on the computation of median orders, Discrete Math. 165/166 (1997), 139— 
153. 


ChHuWo96] I. Charon-Fournier, O. Hudry, and F. Woirgard, Ordres medians et orders 
de Slater des tournois, Math. Inform. Sci. Humaines 133 (1996), 23-56. 


Da5d4] R. L. Davis, Structures of dominance relations, Bull. Math. Biophys. 16 (1954), 
131-140. 


ErMo64] P. Erdés and L. Moser, On the representation of directed graphs as unions of 
orderings, Publ. Math. Inst. Hung. Acad. Sci. 9 (1964), 125-132. 


Fe83] W. Fernandez de la Vega, On the maximum cardinality of a consistent set of arcs 
in a random tournament, J. Combin. Theory B (1983), 328-332. 


Fi96] D. C. Fisher, Squaring a tournament: proof of Dean’s conjecture, J. Graph 
Theory 23 (1996), 43-48. 


FiLuMeRe98] D. C. Fisher, J. R. Lundgren, S$. Merz, and K. B. Reid, The domination 
and competition graphs of a tournament, J. Graph Theory 29 (1998), 103-110. 


FilLuMeRe99] D. C. Fisher, J. R. Lundgren, S$. Merz, and K. B. Reid, Connected dom- 
ination graphs of tournaments, J. Comb. Math. and Combin. Comput. 31 (1999), 
169-176. 


[FrTh87] P. Fraise and C. Thomassen, Hamiltonian dicycles avoiding prescribed arcs in 
tournaments, Graphs and Combin. 3 (1987), 239-250. 


[Fu65] D. R. Fulkerson, Upsets in round robin tournaments, Canad. J. Math. 17 (1965), 
957-969. 


222 Chapter 3. Directed Graphs 
Ga72] M. R. Gary, On enumerating tournaments that admit exactly one Hamiltonian 
circuit, J. Combin. Theory B 13 (1972), 266-269. 


Gr71] B. Griinbaum, Anti-directed Hamiltonian paths in tournaments, J. Combin. 
Theory B 11 (1971), 249-257. 


GrRe99] J. R. Griggs and K. B. Reid, Landau’s Theorem revisited, Australasian J. 
Combin. 20 (1999), 19-24. 


GrSp71] R. L. Graham and J. H. Spencer, A constructive solution to a tournament 
problem, Canad. Math. Bull. 14 (1971), 45-48. 


Gu95] G. Gutin, Cycles and paths in semicomplete multipartite digraphs, theorems, 
and algorithms: a survey, J. Graph Theory 19 (1995), 481-505. 


GuGyThWe98] B. Guiduli, A. Gyarfas, S. Thomassé, and P. Weidl, 2-partition-trans- 
itive tournaments, J. Combin. Theory B 72 (1998), 181-196. 


GuVo97] Y. Guo and L. Volkman, Bypasses in tournaments, Discrete Appl. Math. 79 
(1997), 127-135. 


Ha00a] F. Havet, Oriented Hamiltonian cycles in tournaments, J. Combin. Theory B 
(2000), 1-31. 


Ha93] R. Haggkvist, Hamiltonian cycles in oriented graphs, Combin. Probab. Comput. 
2 (1993), 25-32. 


HaMo66] F. Harary and L. Moser, The theory of round robin tournaments, Amer. 
Math. Monthly 73 (1966), 231-246. 


HaNoCa65] F. Harary, R. Z. Norman, and D. Cartwright, Structural Models: An 
Introduction to the Theory of Directed Graphs, John Wiley & Sons, 1965. 


HaTh00a] F. Havet and S. Thomassé, Oriented Hamiltonian paths in tournaments: a 
proof of Rosenfeld’s conjecture, J. Combin. Theory B 78 (2000), 243-273. 


HaTh00b] F. Havet and S. Thomassé, Median orders of tournaments: a tool for the 
second neighborhood problem and Sumner’s conjecture, J. Graph Theory 35 (2000), 
244-256. 


[HaTh91] R. Haggkvist and A. Thomason, Trees in tournaments, Combinatorica 11 
(1991), 123-130. 


HaTh97] R. Haggkvist and A. Thomason, Oriented Hamiltonian cycles in oriented 
gs y' 
graphs, 339-353 in Combinatorics, Geometry, and Probability (Cambridge, 1993), 
Cambridge University Press, 1997. 


Is95] G. Isaak, Tournaments as feedback arc sets, Electronic J. Combin. 2 (1995), 19. 


Ja72] O. S. Jakobsen, Cycles and paths in tournaments, Thesis, University of Aarhus, 
1972. 


JiLu98] G. Jimenez and J. R. Lundgren, Tournaments which yield connected domina- 
tion graphs, Congressus Numer. 131 (1998), 123-133. 


La53] H. G. Landau, On dominance relations and the structure of animal societies. IIT. 
The condition for a score structure, Bull. Math. Biophys. 15 (1953), 143-148. 


Section 3.3. Tournaments 223 


La97] J.-F. Laslier, Tournament Solutions and Majority Voting, Springer, 1997. 


LiSa96] V. Linek and B. Sands, A note on paths in edge-colored tournaments, Ars 
Comb. 44 (1996), 225-228. 


LuWaPa00] X. Lu, D.-W. Wang, and J. Pan, Rooted spanning trees in tournaments, 
Graphs and Combinatorics 16 (2000), 411-427. 


Ma8g0] S. Maurer, The king chicken theorems, Math. Mag. 53 (1980), 67-80. 
Ma99] J. Mala, On A-majority voting paradoxes, Math. Soc. Sci. 37 (1999), 39-44. 


Ma92] Y. Manoussakis, A linear-time algorithm for finding Hamiltonian cycles in tour- 
naments, Discrete Appl. Math. 36 (1992), 199-201. 


Mi77| N. R. Miller, Graph-theoretical approaches to the theory of voting, Amer. J. 
Polit. Sci. 21 (1977), 769-803. 


Mi80] N. R. Miller, A new solution set for tournaments and majority voting: further 
graph-theoretical approaches to the theory of voting, Amer. J. Polit. Sci. 24 (1980), 
68-96. 


Mi95] T. S. Michael, The ranks of tournament matrices, Amer. Math. Monthly 102 
(1995), 637-639. 


Mo68] J. W. Moon, Topics on Tournaments, Holt, Rinehart, and Winston, 1968. 


MoMo62] J. W. Moon and L. Moser, Almost all tournaments are irreducible, Canad. 
Math. Bull. 5 (1962), 61-65. 


Ne94] V. Neumann Lara, A short proof of a theorem of Reid and Parker on tourna- 
ments, Graphs and Combin. 10 (1994), 363-366. 


PaRe70] E. T. Parker and K. B. Reid, Disproof of a conjecture of Erdés and Moser on 
tournaments, J. Combin. Theory 9 (1970), 225-238. 


Pe02] V. Petrovic, Claws in rotational tournaments, Graphs and Combin. (2002), 591— 
596. 


Pe84] V. Petrovic, Antidirected Hamiltonian circuits in tournaments, 259-269 in Graph 
Theory, Novi Sad, 1983, Univ. Novi Sad, Novi Sad, 1984. 


Re00| K. B. Reid, Monotone reachability in arc-colored tournaments, Congressus Nu- 
mer. 146 (2000), 131-141. 


Re34| L. Rédei, Ein kombinatorischer satz, Acta Litterarum ac Scientiarum, Szeged. 7 
(1934), 39-43. 


Re69] K. B. Reid, On sets of arcs containing no cycles in a tournament, Canad. Math. 
Bull. 12 (1969), 261-267. 


Re73] K. B. Reid, Equivalence of n-tournaments via k-path reversals, Discrete Math. 
6 (1973), 263-280. 


Re80| K. B. Reid, Tournaments with prescribed numbers of kings and serfs, Congressus 
Num. 29 (1980), 809-826. 


224 Chapter 3. Directed Graphs 


[Re82] K. B. Reid, Every vertex a king, Discrete Math. 38 (1982), 93-98. 


[Re84] K. B. Reid, Monochromatic reachability, complementary cycles and single arc re- 
versals in tournaments, Springer-Verlag Lecture Notes in Mathematics 1073 (1984), 
11-21. 


Re85| K. B. Reid, Two complementary cycles in two-connected tournaments, 171-211 
in Cycles in Graphs, vol. 115 of North-Holland Math. Stud., North-Holland, 1985. 


Re91la] K. B. Reid, Majority tournaments: Sincere and sophisticated voting decisions 
under amendment procedure, Math. Soc. Sci. 21 (1991), 1-19. 


Re91b] K. B. Reid, The relationship between two algorithms for decisions via sophis- 
ticated majority voting with an agenda, Discrete Appl. Math. 31 (1991), 23-28. 


Re96] K. B. Reid, Tournaments: scores, kings, generalizations and special topics, Con- 
gressus Numer. 115 (1996), 171-211. 


Re97| K. B. Reid, Equitable agendas: agendas ensuring identical sincere and sophisti- 
cated voting decisions, Social Choice and Welfare 14 (1997), 363-378. 


ReBe79| K. B. Reid and L. W. Beineke, Tournaments, 169-204 in Selected Topics in 
Graph Theory, Academic Press, London, 1979. 


ReBr72] K. B. Reid and E. Brown, Doubly regular tournaments are equivalent to skew 
Hadamard matrices, J. Combin. Theory 12 (1972), 332-338. 


ReBr84] K. B. Reid and M. F. Bridgland, Stability of kings in tournaments, 117-128 
in Progress in Graph Theory, Academic Press, 1984. 


ReMcHeHe02] K. B. Reid, A. A. McRae, S. M. Hedetniemi, and S. T. Hedetniemi, 
Domination and irredundance in tournaments, preprint, 2002. 


ReWo83] K. B. Reid and N. Wormald, Embedding oriented n-trees in tournaments, 
Studia Sci. Math. Hungarica 18 (1983), 377-387. 


Ro74] M. Rosenfeld, Anti-directed Hamiltonian circuits in tournaments, J. Combin. 
Theory B 16 (1974), 234-242. 


Ry64] H. J. Ryser, Matrices of zeros and ones in combinatorial mathematics, 103-124 
in Recent Advances in Matrix Theory, Univ. Wisconsin Press, 1964. 


Sa94] A. Sdnchez-Flores, On tournaments and their largest transitive subtournaments, 
Graphs and Combin. 10 (1994), 367-376. 


SaSaWo82] B. Sands, N. Sauer, and R. Woodrow, On monochromatic paths in edge- 
colored digraphs, J. Combin. Theory B 45 (1982), 108-111. 


Sh88] M. Shen, On monochromatic paths in m-coloured tournaments, J. Combin. The- 
ory B (1988), 108-111. 


Sh92] B. L. Shader, On tournament matrices, Linear Algebra Appl. 162/164 (1992), 
335-368. 


ShWe84] K. A. Shepsle and B. Weingast, Uncovered sets and sophisticated voting 
outcomes with implications for agenda institutions, Amer. J. Polit. Sci. 28 (1984), 
49-74. 


Section 3.3. Tournaments 225 


[ShYu01] I. Sharon (I. Charon-Fournier) and O. Yudri (O. Hudry), The branch and 
bound method for solving the problem of linear ordering on weighted tournaments 
(Russian), Diskretn. Anal. Issled. Oper. 8 (2001), 73-91. 


S093] Z. M. Song, Complementary cycles of all lengths in tournaments, J. Combin. 
Theory B (1993), 18-25. 


Sp71/72] J. Spencer, Optimal ranking of tournaments, Networks 1 (1971/72), 135-138. 
St59) R. Stearns, The voting problem, Amer. Math. Monthly 66 (1959), 761-763. 


SzSz65] E. Szekeres and G. Szekeres, On a problem of Schiitte and Erdés, Math. Gaz. 
49 (1965), 290-293. 


Te98] P. Tetali, A characterization of unique tournaments, J. Combin. Theory B 72 
(1998), 157-159. 


Th73] C. Thomassen, Antidirected Hamiltonian circuits and paths in tournaments, 
Math. Ann. 201 (1973), 231-238. 


Th80] C. Thomassen, Hamiltonian-connected tournaments, J. Combin. Theory B 28 
(1980), 142-163. 


Th82|] C. Thomassen, Edge-disjoint Hamiltonian paths and cycles in tournaments, 
Proc. London Math. Soc. 45 (1982), 151-168. 


Th84] C. Thomassen, Connectivity in tournaments, 305-313 in Graph Theory and 
Combinatorics, Cambridge, 1983, Academic Press, 1984. 


Th85] C. Thomassen, Hamiltonian circuits in regular tournaments, Annals Discrete 
Math. 27 (1985), 159-162. 


Th86] A. Thomason, Paths and cycles in tournaments, Trans. Amer. Math. Soc. 296 
(1986), 167-180. 


Th88] C. Thomassen, Arc reversals in tournaments, Discrete Math. 71 (1988), 73-86. 


Va52] H. E. Vaughan, On well-ordered subsets and maximal elements of ordered sets, 
Pacific J. Math. 2 (1952), 407-412. 


Vo02] L. Volkman, Cycles in multipartite tournaments: results and problems, Discrete 
Math. 245 (2002), 19-53. 


Wa84] K. Wayland, Getting your chickens elected, Congressus Numer. 45 (1984), 311— 
318. 


Ya88| T. X. Yao, Reid’s conjecture on score sets in tournaments (in Chinese), Kexue 
Tongbao 33 (1988), 481-484. 


Ya89| T. X. Yao, On Reid conjecture of score sets for tournaments, Chinese Sci. Bull. 
34 (1989), 804-808. 


Zh80] C. Q. Zhang, Every regular tournament has two arc-disjoint Hamiltonian cycles, 
J. Qufu Normal College, Special Issue Oper. Res. (1980), 70-81. 


ZhSo91] K.-M. Zhang and Z.-S. Song, Cycles in digraphs — a survey, J. Nanjing Univ., 
Natural Sci. 27 (1991) 188-215. 


226 Chapter 3. Directed Graphs 


Glossary for Chapter 3 


Activity on Arc: a digraph scheduling model in which arcs represent subtasks to be 
scheduled as part of a large project. 
Activity on Node: a digraph scheduling model in which nodes represent subtasks to 
be scheduled as part of a large project. 
adjacency matrix — of a digraph: the |V| x |V| matrix in which the ij entry is the 
number of arcs from v; to v;. 
agenda — in voting: an ordered list of alternatives (i.e., an ordered list of the vertices 
of a majority tournament). 
almost regular tournament (or near regular): see regular tournament. 
amendment procedure — for voting: a sequential voting process in which, given an 
agenda (a1, d2,...,@n) of alternatives, alternative a; is pitted against a2 in the first 
vote, then the winner is pitted against ag in the second vote, then the winner is 
pitted against a, in the third vote, etc. 
ancestor of a vertex v — in a rooted tree: a vertex that lies on the unique path from 
v to the root; see also descendant. 
anti-directed cycle — in a digraph D: a sequence of arcs that forms a cycle in the 
underlying graph of D but does not contain a directed path of length 2 in D. 
anti-directed path — in a digraph D: a sequence of arcs that forms a path in the 
underlying graph of D but does not contain a directed path of length 2 in D. 
antisymmetric relation R: one in which, for all x,y, if Ry and yRa, then x = y. 
AoA: Activity on Arc. 
AON: Activity on Node. 
arc: see directed edge. 
arc-cut: synonym for edge-cut. 
basis of a digraph: a minimal set of vertices such that every other vertex can be 
reached from some vertex in this set by a directed path. 
beats: synonym for dominates. 
binary tree: an ordered tree in which each vertex has at most two children, and each 
child is designated either a left-child or a right-child. 
__, balanced: a binary tree such that for every vertex, the number of vertices in its 
left and right subtrees differ by at most one. 
binary-search tree (BST): a binary tree, each of whose vertices is assigned a key, such 
that the key assigned to any vertex v is greater than the key at each vertex in the 
left subtree of v, and is less than the key at each vertex in the right subtree of v. 
child of vertex v — in a rooted tree: a vertex to which there is an edge from v; see 
also parent. 
__, left — in a binary tree: a child which is designated to be on the left, whether or 
not there is another child. 
__, right — in a binary tree: a child which is designated to be on the right, whether 
or not there is another child. 
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comparability digraph — of a poset (X, <): the digraph with vertex set X such that 
there is an arc from «x to y if and only if x X y. 

comparable elements — of a poset (X,~<): elements x,y such that either x < y or 
yru. 

complete m-ary tree: an m-ary tree in which every internal vertex has exactly m 
children and all leaves are at the same level. 

complete digraph: a simple digraph such that between each pair of its vertices, there 
is an arc in both directions. 

condensation — of a digraph G whose strong components are $1, S2,...,S;: a digraph 
G* with vertex-set Vg» = {51,52,...,5,} such that (s;,s;) € E(G*) if and only if 
there is an arc in digraph G from a vertex in component 5S; to a vertex in component 


fore 
condensation of a tournament T: a tournament T* whose vertex-set {u1, U2,..., UK} 
corresponds to a vertex partition {V,, Vo,...,Vz} of V(T), where each V; induces a 


maximal strongly connected sub-tournament of T’, and in which vertex u; dominates 
u; whenever all of the vertices in V; dominate all of the vertices in V; in T. 

Condorcet paradox: the possibility that the voters may be consistent in their pref- 
erences (i.e., each of their rankings of the n candidates is a linear order), but the 
amalgamation of voters’ preferences using majority rule can result in inconsistencies 
(i.e., cycles in the majority digraph). 

Condorcet winner — in voting: a candidate (or alternative) x such that for every 
other candidate (or alternative) y, x is preferred over y by a majority of the voters. 

connectivity (or vertex-connectivity) — of a non-complete digraph: the minimum size 
of a vertex subset S such that G— S is neither strongly connected nor the trivial 
digraph. (The connectivity of a complete n-vertex digraph is n— 1.) Denoted k,,(G) 
or «(G). Synonyms for vertex-cut are cut and disconnecting set. 

consistent set of arcs — in a digraph D: aset of arcs that induces an acyclic subdigraph 
of D. 

cost flow network: see network. 

cover graph — of a poset (X,~<): the graph with vertex set X such that x,y are 
adjacent if and only if one of them covers the other. 

covering — in a poset (X, xX): the element y covers the element x if x < y and there is 
no element z such that x ~ z < y. 

CPM: Critical Path Method. 

Critical Path Method: a method for scheduling models where subtasks have fixed 
times and precedence is known. The whole project is modeled as an AoA or AoN 
digraph, and a single-pass iterative algorithm is used to find the longest path from 
start to finish. 

DAG: a directed acyclic graph. 

decision tree — for a sincere decision corresponding to a given majority n-tournament 
T and an agenda (a1, d2,...,@,) of alternatives: the spanning, rooted subtree of T, 
rooted at the sincere decision, induced by the n — 1 arcs of T’ which describe the 
n —1 votes taken in the amendment procedure using T and (aj, a2,...,@n). 

depth (or level) — of a vertex v in a rooted tree: the length (i-e., number of arcs) of the 
unique directed path from the root to v. 

descendant of a vertex v — in a rooted tree: a vertex w such that v is on the unique 
path from the root to w; the vertex v is called an ancestor of w. 
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digraph: a directed graph. 
, acyclic: a digraph with no directed cycles, i.e., a directed acyclic graph, a DAG. 
, connected: a digraph whose underlying graph is connected. The term weakly 
connected is also used. 
, representation of a relation R on a finite set S: the digraph whose vertices 
correspond to the elements of S, and whose arcs correspond to the ordered pairs 
in the relation. 
, weak: short form of weakly connected digraph. 
, weakly connected: a digraph whose underlying graph is connected; synonym 
for connected digraph. 
directed acyclic graph: a digraph without directed cycles. 
Directed Chinese Postman Problem: to find a minimum-weight postman tour in 
a given weighted digraph. 

directed cycle: a closed directed path. 

directed edge (or arc): an edge e, one of whose endpoints is designated as the tail, 
and whose other endpoint is designated as the head. In a line drawing, the arrow 
points toward the head. 

directed path: a path in a digraph or partial digraph in which all edges are oriented 

in the same direction. 

directed tree: a digraph whose underlying graph is a tree. 

directed walk — from vo to vy: an alternating sequence (vo, €1, V1, €2; ---; Un—1; n; Un) 

of vertices and arcs, such that tail(e;) = uj-1 and head(e;) = u;, for 1 = 1,2...,n. 
Also called a vp-v, directed walk. 
division tree — in voting: see $3.3, Definition D36. 
dominating set — in a tournament T: a set S of vertices in T’ such that every vertex 
not in S is dominated by some vertex in S. 
domination (or beating)— a vertex y in a tournament: a property that a vertex x has 
if there is an arc from & to y. 
__, graph — of a tournament 7: an undirected graph G that has the same vertex-set 
as T, and z is adjacent to y in G whenever {x,y} is a dominating set in T. 
__, number — of a tournament: the minimum cardinality of a dominating set in T; 
denoted ¥(T). 
doubly-regular tournament: see regular tournament. 
edge-connectivity — of a non-trivial digraph: the minimum size of an edge subset F’ 
such that G — F is not strongly connected. Denoted «.(G) or X(G). 

edge-cut (or arc-cut) — in a strongly connected digraph: an arc subset whose deletion 
results in a digraph that is not strongly connected. Synonyms are edge-disconnecting 
set, arc-disconnecting set, and cut-set. 

eulerian tour of a digraph G: a closed directed walk that uses each arc exactly once. 

feedback set of arcs — in a tournament T: a set S of arcs such that the digraph T—S 

contains no cycle. 
flow network: see network. 
frontier arc — relative to a rooted tree T in a digraph: an arc whose tail is in T and 
whose head is not in T. 

functional graph: a digraph in which each vertex has outdegree 1. 

hamiltonian cycle (or spanning cycle) — in a digraph D: a cycle that includes all 
vertices of D. 

hamiltonian path (or spanning path) — in a digraph D: a directed path that includes 
all vertices of D. 
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Hasse diagram — of a poset: a straight-line drawing of the cover graph such that the 
lesser element of each adjacent pair is lower in the drawing. 

head: see directed edge. 

height — of a rooted tree: the length of a longest path from the root. 

in-branching (or in-tree) — in a digraph: a rooted spanning tree with all the arcs 
reversed. 

in-score — of a vertex v in a tournament JT: the number of vertices that dominate v 
(i.e., its indegree; denoted d;(v) (or d~(v) when T is understood). 

in-set — of a vertex x in a digraph D: the set of all vertices that dominate x; denoted 
I(x). 

internal vertex — in a tree or rooted tree: a non-leaf. 

in-tree: synonym for in-branching. 

k-strong tournament: see strong tournament. 

king — in a tournament 7: a vertex x such that for every other vertex y, there is a 
1-path or a 2-path from z to y in T. 

leaf — in a rooted tree: a vertex with outdegree 0. 

left subtree — of a vertex v in a binary tree: the binary subtree spanning the left-child 
of v and all of its descendants. 

length of a directed walk: the number of arc-steps in the walk sequence. 

level of a vertex — in a rooted tree: synonym for depth. 

linear extension ordering — of a digraph: a consecutive numbering of the vertices as 
U1, V2,---,Un so that all arcs go from lower-numbered to higher-numbered vertices. 

linear ordering: a consecutive numbering. 

m-ary tree: see rooted tree. 

majority digraph D — of a set of n-tournaments, all with the same vertex-set V: a 
digraph with vertex-set V and such that vertex x dominates vertex y in D if and 
only if x dominates y in a majority of the n-tournaments. 

Markov digraph: a complete digraph with a self-loop at each vertex and whose arcs 
are assigned probabilities such that the out-probabilities at each vertex sum to one; 
models a stationary Markov chain. 

maximum-flow problem: to determine the maximum flow that can be moved through 
an s-t network from source s to sink t such that the flow into each intermediate node 
equals the flow out (conservation of flow) and the flow across any arc does not exceed 
the capacity of that arc. 

minimum-cost-flow problem: to find an assignment of flows on the arcs of the 
flow network that satisfy the supply and demand (negative supply) requirements at 
minimum cost. 

mutually reachable vertices — in a digraph G: vertices that have a directed walk 
from one to the other and vice versa. Every vertex is regarded as mutually reachable 
with itself (via the trivial walk). 

neighborhood: see out-set. 

network: a digraph G = (V,£) used to model a variety of network flow problems; 
vertices might have supply or demand, and arcs might have capacities and or flow 
costs. 

__, s-t flow: a network G = (V,E,cap,s,t) with a nonnegative capacity function 
cap: EF + N, a distinguished vertex s, called the source, with nonzero outdegree, 
and a distinguished vertex t, called the sink, with nonzero indegree. 
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___, capacitated cost flow G = (V, E,cap,c,b): a directed graph with vertex-set V, 
arc-set FE, a nonnegative capacity function cap: E — N, a linear cost function 


c: E> Z, and an integral supply vector b: V > Z that satisfies }> b(w) =0. 
wEeVv 
__, cost flow: a network G = (V,E,cap,c,b) with nonnegative capacity function 
cap: EF —+ N, a linear cost function c: E — Z, and an integral supply vector 
b: V > Z that satisfies S* b(w) = 0. 
weVv 
order of a tournament: the number of vertices it contains. A tournament of order n 


is an n-tournament. 

ordered tree: a rooted tree in which the children of each vertex are assigned a fixed 
ordering. 

ordering: a linear ordering. 

orientation — of a graph: an assignment of directions to its edges, thereby making it a 
digraph. 

oriented graph: a digraph obtained by choosing an orientation for each edge of an 
undirected simple graph. 

out-branching (or out-tree) — in a digraph: synonym for rooted spanning tree. 

out-set (or neighborhood) — of a vertex x in a digraph D: the set of all vertices that x 
dominates; denoted O(x) or N*(a) (or with a subscripted “D” if necessary). 

out-tree: a rooted tree, especially when the arc directions are shown explicitly. 

parent of a vertex w — in a rooted tree: a vertex v that immediately precedes w on 
the path from the root to w; also, w is the child of v. 

partial order: a binary relation < on a set X that is reflexive, antisymmetric, and 
transitive. 

partially ordered set: a pair (X,~<) consisting of a set X and a partial order < on 
Xx. 

path in a digraph: a directed path. 

, k-: a directed path of length k. 

poset: a partially ordered set. 

postman tour (or covering walk): a closed directed walk that uses each arc at least 
once. 

proper arc-coloring — of a digraph: an assignment of colors to the arcs such that any 
two arcs that have an endpoint in common are assigned different colors. 

receiver — in a tournament: a vertex that is dominated by every other vertex in a 
tournament. 

reflexive relation R: one in which, for all 2, rRz. 

regular tournament: a tournament T in which all scores are the same. 

__, almost (or near): a tournament T in which mae 112" (3) —d-(v)|} =1. 
Vv 


__, doubly-: a tournament in which all pairs of vertices jointly dominate the same 
number of vertices (i.e., there is an integer k so that |O(x) N O(y)| = &, for all 
distinct pairs of vertices x and y in T)). 

right child — in a binary tree: a child which is designated to be on the right, whether 
or not there is another child. 

right subtree — of a vertex v in a binary tree: the binary subtree spanning the right- 
child of v and all of its descendants. 

root: see rooted tree. 
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rooted tree: a directed tree having a distinguished vertex r, called the root, such that 
for every other vertex v, there is a directed r-v path. Occasionally encountered 
synonyms for rooted tree are out-tree, branching, and arborescence. 

___, m-ary: a rooted tree in which every vertex has m or fewer children; also called 
an m-ary tree. 

rotational tournament: denoted Rg(S), or simply R(S) if the group G is understood; 
see $3.3, Definition D11. 

s-t flow network: see network. 

score of a vertex v in a tournament T: the number of vertices that v dominates (i.e., 
its outdegree). Denoted d*(v) (or d+(v) when T is understood). 

score sequence (or score vector) — of an n-tournament: the ordered n-tuple 
(S1,52,-+-,5n—1,$n), Where s; is the score of vertex u;, 1 <i <n, and s, < 8s, < 
aa < Sn—-1 < Sn- 

score vector: synonym for score sequence. 

second neighborhood — of a vertex x in a digraph D: the set of all vertices of D 
reachable from x by a 2-path but not a 1-path; denoted Nj*(z). 

serf — in a tournament JT: a vertex x such that for every other vertex y, there is a 
1-path or a 2-path from y to z. 

siblings — in a rooted tree: children of the same parent. 

simple digraph: a digraph with no self-loops and no multi-arcs. 

sincere decision — for a given majority n-tournament T and an agenda (a1, d2,...,@n) 
of alternatives given by the vertices of T: the alternative surviving the last vote (i.e., 
the (n — 1)'® vote) in an amendment procedure of voting using majority voting at 
each stage. 

sink — in a digraph: a vertex of outdegree zero. 

sophisticated decision — in voting: see §3.3, Definition D38. 

source — in a digraph: a vertex of indegree zero. 

spanning subgraph - of a graph or digraph: a subgraph that includes all the vertices 
of the original graph. 

spiked cycle: a connected (undirected) graph with the property that when all vertices 
of degree 1 are removed, a cycle results. 

standard plane representation of an ordered tree: a standard plane drawing of the 
tree such that at each level, the left-to-right order of the vertices agrees with their 
prescribed order. 

strong component — of a digraph G: maximal strongly connected subdigraph of G. 

strong digraph: short form of strongly connected digraph. 

strong orientation — of a graph: an orientation that results in a strong digraph. 

strong tournament: a tournament that is a strongly connected digraph. 

__, k-: a strong tournament such that the removal of any set of k —1 or fewer vertices 
results in a strong digraph. 

strongly connected digraph: a digraph in which every two vertices are mutually 
reachable, i.e., there is a directed path from each of the two vertices to the other. 

strongly orientable graph: a graph for which there exists an assignment of directions 
to the edges such that the resulting digraph is strongly connected. 

symbol set — for a rotational tournament: see §3.3, Definition D11. 

tail: see directed edge. 

topological sort or topsort: any algorithm that assigns a linear extension ordering to 
a digraph when it has one. 

topsort: short form of topological sort. 
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tournament matrix: a square matrix M = (m,,) of 0’s and 1’s, with 0’s on the main 
diagonal and m,; + mj; = 1, for all distinct i and j (i.e., the adjacency matrix of 
some tournament). 
tournament: a simple digraph such that between each pair of vertices there is exactly 
one arc. 
__, irreducible: a tournament that is not a reducible tournament. 
___, quadratic residue: a special rotational tournament; see §3.3, Definition D12. 
__, reducible: a tournament whose vertex-set can be partitioned into two non-empty 
subsets V; and V2 such that every vertex in V; dominates every vertex in V2. 
__, k-stable: a tournament in which every vertex is a king and more than k arcs 
must be reversed in order to reduce the number of kings, where k > 1. 
__, n-: a tournament of order n, i.e., an n-vertex tournament. 
transitive closure — of a graph of digraph D: the smallest supergraph of D that is 
transitive. 
transitive digraph: a digraph in which, if (u,v) and (v, w) are arcs, then so is (u, w). 
transitive orientation — of a graph: an orientation that results in a transitive digraph. 
transitive relation R: a relation in which, for all x,y, z, if Ry and yRz, then cRz. 
transitive tournament: a tournament such that for every set of three distinct vertices 
x,y, and z, if ze dominates y, and y dominates z, then x dominates z. 
transmitter — in a tournament: a vertex that dominates every other vertex in a tour- 
nament. 
unilateral digraph: a digraph in which, for all pairs of vertices u,v, there is a directed 
path between them in at least one direction. 
vertex-cut — in a strongly connected digraph: a vertex subset whose deletion results 
in a digraph that is not strongly connected. 
weights — in a graph or digraph: numbers on the vertices or edges or arcs, often 
representing something that is to be maximized or minimized. 
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Section 4.1 
Connectivity: Properties and Structure 


Camino Balbuena, Universitat Politécnica de Catalunya, Spain 
Josep Fabrega, Universitat Politécnica de Catalunya, Spain 
Miquel Angel Fiol, Universitat Politecnica de Catalunya, Spain 
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INTRODUCTION 


Connectivity is one of the central concepts of graph theory, from both a theoret- 
ical and a practical point of view. Its theoretical implications are mainly based on 
the existence of nice max-min characterization results, such as Menger’s theorems. In 
these theorems, one condition which is clearly necessary also turns out to be sufficient. 
Moreover, these results are closely related to some other key theorems in graph theory: 
Ford and Fulkerson’s theorem about flows and Hall’s theorem on perfect matchings. 
With respect to the applications, the study of connectivity parameters of graphs and 
digraphs is of great interest in the design of reliable and fault-tolerant interconnection 
or communication networks. 


Since graph connectivity has been so widely studied, we limit ourselves here to the 
presentation of some of the key results dealing with finite simple graphs and digraphs. 
For results about infinite graphs and connectivity algorithms the reader can consult, 
for instance, Aharoni and Diestel [AhDi94], Gibbons [Gi85], Halin [Ha00], Henzinger, 
Rao, and Gabow [HeRaGa00], Wigderson [Wi92]. For further details, we refer the 
reader to some of the good textbooks and surveys available on the subject: Berge 
[Be76], Bermond, Homobono, and Peyrat [BeHoPe89], Frank [Fr90, Fr94, Fr95], Gross 
and Yellen [GrYe06], Hellwig and Volkmann [HeVo08], Lovdsz [L093], Mader [Ma79], 
Oellermann [O0e96], Tutte [Tu66]. 
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4.1.1 Connectivity Parameters 
In this first subsection the basic notions of connectivity and edge-connectivity of 
simple graphs and digraphs are reviewed. 


NOTATION: Given a graph or digraph G, the vertex-set and edge-set are denoted V(G) 
and E(G), respectively. Often, when there is no ambiguity, we omit the argument and 
refer to these sets as V and E. 


Preliminaries 
DEFINITIONS 


D1: A graph is connected if there exists a walk between every pair of its vertices. 
A graph that is not connected is called disconnected. 


D2: The subgraphs of G which are maximal with respect to the property of being 
connected are called the components of G. 


D3: Let G=(V,E) bea graph and U CV. The vertex-deletion subgraph G —U 
is the graph obtained from G by deleting from G the vertices in U. That is, G—U is 
the subgraph induced on the vertex subset V — U. If U = {u}, we simply write G — u. 


D4: Let G= (V,£) beagraphand F Cc E. The edge-deletion subgraph G— F is the 
subgraph obtained from G by deleting from G the edges in F. Thus, G—F = (V, E—F). 
As in the case of vertex deletion, if F = {e}, it is customary to write G — e rather than 
G — fe}. 


D5: <A disconnecting (vertex-)set (or vertex-cut) of a connected graph G is a 
vertex subset U such that G — U has at least two different components. 


D6: A vertex v is a cut-vertex of a connected graph G if {v} is a disconnecting set 
of G. 


D7: A disconnecting edge-set (or edge-cut) of a connected graph G is an edge 
subset F’ such that G — F has at least two different components. 


D8: An edge e is a bridge (or cut-edge) of a connected graph G if {e} is a discon- 
necting edge-set of G. 


FACTS 


F1: Every nontrivial connected graph contains at least two vertices that are not 
cut-vertices. 


F2: An edge is a bridge if and only if it lies on no cycle. 
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Vertex- and Edge-Connectivity 


The simplest way of quantifying connectedness of a graph is by means of its parameters 
vertex-connectivity and edge-connectivity. 


DEFINITIONS 


D9: The (vertex-)connectivity «(G) of a graph G is the minimum number of 
vertices whose removal from G' leaves a disconnected or a trivial graph. 


D10: The edge-connectivity \(G) of a nontrivial graph G is the minimum number 
of edges whose removal from G results in a disconnected graph. 


NOTATION: When the context is clear, we suppress the dependence on G and simply use 
«& and X. 


NOTATION: In some other sections of the Handbook, k,(G) and «-(G) are used instead 
of «(G) and X(G). 


EXAMPLE 


E1: Figure 4.1.1 shows an example of a graph with « = 2 and \=3. 


Figure 4.1.1: « =2 and \ =3. 


FACTS 

F3: We have « = 0 if and only if G is disconnected or G = Ky. If G has order n, 
then «& = n — 1 if and only if G is the complete graph K,. In this case, the removal 
of n — 1 vertices results in the trivial graph Ky. Moreover, if G £ K,, is a connected 
graph, then 1 < « <n —2 and there exists a disconnecting set U of « vertices. 


F4; IfG# Kk, we have \ = 0 if G is disconnected. By convention, we set A(1) = 0. 


F5: If G #4 k, is connected, then the removal of A edges results in a disconnected 
graph with precisely two components. 


F6: The parameters « and \ can be computed in polynomial time. 
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Relationships Among the Parameters 


NOTATION: The minimum degree of a graph G is denoted 6(G). When the context is 
clear, we simply write 6. (In some other sections of the Handbook, the notation dmin(G) 
is used.) 


FACTS 

F7: [Wh32] For any graph, k < A < 6. 

F8: [ChHa68] For all integers a, b, c such that 0 < a <b <c, there exists a graph G 
with ke =a, \=b, and d=c. 

DEFINITIONS 


D11: A graph G is maximally connected when & = X = 6, and G is maximally 
edge-connected when X = 6. 


D12: A graph G with connectivity k > k > 1 is called k-connected. Equivalently, 
G is k-connected if the removal of fewer than k vertices leaves neither a disconnected 
graph nor a trivial one. Analogously, if \ > k > 1, G is said to be k-edge-connected. 


D13: A connected graph G without cut-vertices (k > 1 or G = K2) is called a block. 


Some Simple Observations 


The following facts are simply restatements of the definitions. 
FACTS 
F9: A nontrivial graph is 1-connected if and only if it is connected. 


F10: If G is k-connected, either G = K;,41 or it has at least k +2 vertices and G—U 
is still connected for any U C V with |U| < k. 


F11: A graph G is k-edge-connected if the deletion of fewer than k edges does not 
disconnect it. 


F12: Every block with at least three vertices is 2-connected. 


Internally-Disjoint Paths and Whitney’s Theorem 


DEFINITIONS 


D114: An internal vertex of a path is a vertex that is neither the initial nor the 
final vertex of that path. 


D15: The paths P,, Po,..., Py joining the vertices u and v are said to be internally- 
disjoint (or openly-disjoint) u—v paths if no two paths in the collection have an 
internal vertex in common. Thus, V(P;) 1 V(P;) = {u,v} for i F j. 
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FACTS 


F13: [Wh32] A graph G with order n > 3 is 2-connected if and only if any two vertices 
of G are joined by at least two internally-disjoint paths. 


F14: Fact F13 implies that every 2-connected graph is a block. 


F15: A graph G with at least three vertices is a block if and only if every two vertices 
of G lie on a common cycle. 


Strong Connectivity in Digraphs 


For basic concepts on digraphs, see, for example, the textbooks of Bang-Jensen and 
Gutin [BaGu01], Chartrand, Lesniak, and Zhang [ChLeZh11], Harary, Norman, and 
Cartwright [HaNoCa68}. 


DEFINITIONS 


D16: Ina digraph G, vertices u and v are mutually reachable if G contains both 
a directed u—v walk and a directed v—u walk. 


D17: A digraph G is said to be strongly connected if every two vertices u and v 
are mutually reachable. 


D18: For a strongly connected digraph G, the (vertex) connectivity & = «(G) 
is defined as the minimum number of vertices whose removal leaves a non-strongly 
connected or trivial digraph. Analogously, if G is not trivial, its edge-connectivity 
A = (G) is the minimum number of directed edges (or arcs) whose removal results in 
a non-strongly connected digraph. 


D19: Let G be an undirected graph. The associated symmetric digraph G* is the 
digraph obtained from G by replacing each edge uv € E(G) by the two directed edges 
(u,v) and (v,u) forming a digon. 


REMARKS 


R1: In our context, the interest for studying digraphs is that we can deal with 
an undirected graph G by considering G*. In particular, «(G*) = «(G), and, since a 
minimum edge-disconnecting set cannot contain digons, we also have A(G*) = A(G). 


NOTATION: The symbols 5* and 5~ denote the minimum outdegree and indegree among 
the vertices of a digraph G. Then, the minimum degree of G is defined as 6 = 
min{ot, 7}. 


R2: Note that, if G is a strongly connected digraph, then 6 > 1. The following result, 
due to Geller and Harary, is the analogue of (and implies) Fact F7. 


FACT 
F16: [{GeHa70] For any digraph, « < \ <0. 


TERMINOLOGY: A digraph G is said to be maximally connected when k = = 6, 
and G is maximally edge-connected when » = 0. 
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An Application to Interconnection Networks 


The interconnection network of a communication or distributed computer system is 
usually modeled by a (directed) graph in which the vertices represent the switching 
elements or processors, and the communication links are represented by (directed) edges. 
Fault-tolerance is one of the main factors that have to be taken into account in the design 
of an interconnection network. See, for instance, the survey of Bermond, Homobono, 
and Peyrat [BeHoPe89] and the book by Xu [Xu01]. Indeed, it is generally expected 
that the system be able to work even if several of its elements fail. Thus, it is often 
required that the (di)graph associated with the interconnection network be sufficiently 
connected, and, in most cases, a good design requires that this (di)graph has maximum 
connectivity. Communication networks are discussed in $11.4 of the Handbook. 


4.1.2 Characterizations 


When a graph G is k-connected we need to delete at least k vertices to disconnect it. 
Clearly, if any pair u,v of vertices can be joined by k internally-disjoint u—v paths, G is 
k-connected. In fact, it turns out that the converse statement is also true. That is, in a k- 
connected graph any two vertices can be joined by k internally-disjoint paths. We review 
in this subsection some key theorems of this type that characterize k-connectedness. 


Menger’s Theorems 


DEFINITION 


D20: Let u and v be two non-adjacent vertices of a connected graph G #4 K,. A 
(u|v)-disconnecting set X, or simply (u|v)-set, is a disconnecting set X C V — {u,v} 
whose removal from G leaves u and v in different components. 


NOTATION: For any pair of non-adjacent vertices u and v, «(ulv) denotes the minimum 
number of vertices in a (u|v)-set. 


NOTATION: For any two vertices u and v, «(u—v) denotes the maximum number of 
internally-disjoint u—v paths. 


FACTS 

F17: For any graph G, «(G) = min{«(ulv) : u,v € V, nonadjacent }. 

F18: (Menger’s theorem) [Me27] For any pair of non-adjacent vertices u and v, 
K(u—v) = K(ulv) 

F19: Although «(u—v) can be arbitrarily smaller than the minimum of the degrees 

of u and v, Mader proved that every finite graph contains vertices for which equality 


holds: 


F20: [Ma73] Every connected non-trivial graph contains adjacent vertices u and v for 
which K(u—v) = min{deg(u), deg(v)}. 
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NOTATION: For any pair of distinct vertices u and v, A(u|v) denotes the minimum number 
of edges whose removal from G (G non-trivial) leaves u and v in different components 
and A(u—v) denotes the maximum number of edge-disjoint u—v paths. 


F21: For any non-trivial graph G, \(G) = min{A(ulv), u,v € V}. 


F22: (Edge-analogue of Menger’s theorem) [ElFeSh56, FoFu56] For any pair of vertices 
u and v, 


A(u—v) = A(ulv). 


REMARKS 


R3:  Digraph versions of Menger’s theorems are the same except that all paths are 
directed paths. 


RA: The edge form and arc form of Menger’s theorem were proved by Ford and 
Fulkerson [FoFu56] using network-flow methods. Network flow is discussed in Chapter 
11 of this Handbook. 


Other Versions and Generalizations of Menger’s Theorem 


In addition to the ones given below, there exist other versions and generalizations 
of Menger’s theorem; see, for example, Diestel [Di00], Frank [Fr95], and McCuaig 
[McCu84]. A comprehensive survey about variations of Menger’s theorem can be found 
in Oellermann [Oe12]. 


DEFINITIONS 


D21: Given A,B CV, an A-B path is a u—v path P withue A,v Ee BhuFv, 
and any other vertex of P is neither in A nor in B. 


D22: A set X C V separates A from B (or is (A|B)-separating) if every A-B 
path in G contains a vertex of X. 


D23: An A-path is an A—B path with A= B. 


D24: Asubset X Cc V—A totally separates A if each component of G—X contains at 
most one vertex of A (or, equivalently, every A-path between different vertices contains 
some vertex of X). 


D25: A vertex subset is an independent set if no two of its vertices are adjacent. 


NOTATION: The maximum number of (internally-)disjoint A—B paths is denoted 
«&(A—B), and the size of a minimum (A|B)-separating set is denoted «(A|B). 


FACTS 


F23: The minimum number of vertices separating A from B is equal to the maximum 
number of disjoint A—B paths. That is, 
k(A—B) = k(A|B). 
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F24: If A is an independent set, the maximum number of internally-disjoint A-paths 
is at most the minimum number of vertices in a totally A-separating set, that is, 


k(A—A) < K(A|A). 


F25: The corresponding Menger-type result does not hold and inequality can be strict. 
In fact, there exist examples for which «(A—A) = «(A|A)/2. 


F26: Gallai [Ga61] conjectured that Fact F25 corresponds to the “extremal” situation 
and that always «(A—A) > «(A|A)/2, and Lovasz [Lo76] conjectured that A\(A—A) > 
\(A|A)/2. Both conjectures were proved by Mader. 


F27:  [Ma78b, Ma78c] «(A—A) > «(A|A)/2 and A(A—A) > (A A)/2. 


REMARK 


R5: The classical version of Menger’s theorem (Fact F18) is easily derived from 
Fact F23 by taking A and B as the sets of vertices adjacent to u and v, respectively. 


Another Menger-Type Theorem 


NOTATION: For any pair of vertices u and v, k,(u—v) denotes the maximum number 
of internally-disjoint u—v paths of length less than or equal to n. For any pair of non- 
adjacent vertices u and v, Kp(ulv) denotes the minimum number of vertices of a set 
X CV — {u,v} such that every u—v path in G — X has length greater than n. 


FACTS 


F28: There are examples for which we have the strict inequality &,,(u—v) < K,(ulv). 
However, for n = d(u,v) > 2 (ie., for shortest u—v paths), we have &,(u—v) = ky(ulv). 
This Menger-type result is equivalently restated as Fact F29. 


F29: [EnJaS177, LoNeP178] The maximum number of internally-disjoint shortest u—v 
paths is equal to the minimum number of vertices (different from u and v) necessary to 
destroy all shortest u—v paths. 


Whitney’s Theorem 


In a connected graph, there exists a path between any pair of its vertices, and if the 
graph is 2-connected, then there exist at least two internally-disjoint paths between 
two distinct vertices (Fact F13). As a corollary of Menger’s theorem, we have the 
remarkable result that this property can be generalized to k-connected graphs, which was 
independently proved by Whitney. It provides a natural and intrinsic characterization 
of k-connected graphs. 


FACTS 


F30: (Whitney’s theorem) [Wh32] A non-trivial graph G is k-connected if and only if 
for each pair u,v of distinct vertices there are at least k internally-disjoint u—v paths 
(or, alternatively, if and only if every cut-set has at least k vertices). 
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F31: (Edge version of Whitney’s theorem) A nontrivial graph G is k-edge-connected 
if and only if for each pair u,v of distinct vertices there exist at least k edge-disjoint 
u—v paths. 


F32: (The Fan Lemma) Let G be a k-connected graph (k > 1). Let v € V and let 
BCYV, |B| >k,v ¢ B. Then there exist distinct vertices b;, bo,...,b,% in B and a v—b; 
path P; for each i = 1,2...,k, such that the paths P,, Po,..., Py, are internally-disjoint 
(that is, with only vertex v in common) and V(P;) 1 B = {b;} for 1 =1,2,...k. 


Other Characterizations 


Another interesting characterization of k-connected graphs was independently conjec- 
tured by Frank and Maurer. The conjecture was proved by Lovasz and by Gyéri (who 
worked independently), and it appears as Fact F33. Su proved a characterization of 
k-edge-connectivity for digraphs (Fact F34). 


FACTS 
F33: [Lo77, Gy78] A graph G with n > k +1 vertices is k-connected if and only if, 
for any distinct vertices u1,u2,...,ux and any positive integers n1,n2,...,n,% such that 


ny +ng+--- +n, =n, there is a partition Vi,V2,...,V_ of V(G) such that u; € Vi, 
|V;| =n,, and the induced subgraph G(V;) is connected, 1 <i <n. 


F34: [Su97] A digraph G with at least k edges is k-edge-connected if and only if, for 
any k distinct arcs e; = (u;,u;), 1 <i < k, the digraph G — {e1, €2,...,ex} contains k 
edge-disjoint spanning arborescences (rooted trees) T;,7T2,..., 7% such that T; is rooted 
at uj, 1<i<n. 


4.1.3. Structural Connectivity 


Here our purpose is to give results about certain configurations that must be present 
in a k-connected or k-edge-connected graph. 


Cycles Containing Prescribed Vertices 


The first is a classical result by Dirac, which generalizes Fact F15. 
FACTS 


F35: [Di60] Let G be a k-connected graph, k > 2. Then G contains a cycle through 
any given k vertices. 


F36: [WaMec67] Let G be a k-connected graph with k > 3. Then G has a cycle 
containing a given set H with k + 1 vertices if and only if there is no set T C V —H 
with |T| =k vertices whose removal separates the vertices of H from each other. 
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The Lovasz—Woodall Conjecture 


Lovasz [Lo74] and Woodall [Wo77] independently conjectured that every k-connected 
graph has a cycle containing a given set F' of k independent edges (that is, no two edges 
have a vertex in common), if and only if F is not an edge-disconnecting set of odd 
cardinality. Partial results on this conjecture are given in Facts F37 > F39. 


FACTS 


F37: [Lo74, Lo77, ErGy85, Lo90, Sa96] The Lovdsz—Woodall Conjecture is true for 
k = 3,4,5. 


F38: [HaTh82] The Lovdsz—-Woodall Conjecture is true assuming that G is (k + 1)- 
connected (without restriction on the edge set F). 


F39: [Ka02] Under the same assumptions of the conjecture, F' is either contained in 
a cycle or in two disjoint cycles. 


TERMINOLOGY: A subset of independent edges is also called a matching. Matchings 
are discussed in Section 11.3 of this Handbook. 


Paths with Prescribed Initial and Final Vertices 


Given any two subsets A, B C V of k vertices of a k-connected graph, the existence of k 
disjoint paths P; (1 <i < k) connecting A and B is guaranteed by Menger’s theorem. 
Menger’s theorem does not, however, ensure that each of these paths can be so chosen 
to join a fixed u;,v; pair of vertices, u; € A, v; € B, (1 <i <k). Now we consider the 
existence of paths with prescribed end-vertices. 


DEFINITIONS 


D26: A graph G is called k-linked if it has at least 2k vertices, and for every 
sequence U1, U2,...,Uk,U1,V2,---,Uk of 2k different vertices, there exists a u;—v; path 
P,,i=1,2,...,k, such that the k paths are vertex-disjoint. 


D27: A graph is weakly k-linked if it has at least 2k vertices, and for every k pairs 
of vertices (u;,v;), there exists a u;—v; path P;, 1 < i < k, such that the k paths are 
edge-disjoint. 


D28: A graph is said to be k-parity-linked if one can find k disjoint paths with 
prescribed end-vertices and prescribed parities of the lengths. 


D29: The bipartite index of a graph is the smallest number of vertices whose 
deletion creates a bipartite graph. 


FACTS 
F40: A k-linked graph is always (2k — 1)-connected, but the converse is not true. 


F41: [Ju70], [LaMa70] (independently) For each k, there exists an integer f(k) such 
that if « > f(k) then G is k-linked. 
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F42: Thomassen [Th80a] and Seymour independently characterized the graphs that 
are not 2-linked. This is the first problem in the so-called k-paths problem that has 
been solved using the Robertson—Seymour theory [RoSe85]. 


NOTATION: For k > 1, g(k) denotes the smallest integer such that every g(k)-edge- 
connected graph G is weakly k-linked. 


CONJECTURE 
[Th80a] For every integer k > 1, g(2k+ 1) = g(2k) = 2k +1. 
FACTS 


F43: [Ok84, Ok85, Ok87] If k > 3 is odd, wi, ug,..., UK, U1, 02---, UR are (not neces- 
sarily distinct) vertices from a set T with |T| < 6, and A(uw;,v;) >k 1 <i <k&), then 
there exists a u; — v; path for 1 <i < k such that the k paths are edge-disjoint. 


F44: [(Hu91] For every integer k > 1, g(2k+1) < 2k42 and g(2k) < 2k4 2. 


F45: [Ok88, Ok90a] For every integer k > 1, 
(a) g(2k +1) < 3k and g(2k +2) < 3k +2, 
(b) g(3k) < 4k and g(3k +2) < 4k +2. 


F46: [ThO1] Every f(k)-connected graph (defined in Fact F41) with bipartite index 
at least 4k — 3 is k-parity-linked. 


F47: [Su97] Let G be a k-edge-connected digraph, and let (w1, f1,v1), (ue, fo, v2), 

.., (Uk, fk, Uk) be any & triples, where ui, u2...,Uz, V1, V2...,U% are not necessarily 
distinct vertices, and fi, fo,...,f% are k distinct arcs, either of the form f; = (u;, ti), 
i=1,...,k, or fi; = (ti, v;), i= 1,...,k. Then there exist k edge-disjoint u;—v; paths 
P; in G such that f; € E(P;),i=1,...,k. 


Subgraphs 


High connectivity implies a large minimum degree (Fact F7). Conversely, a large min- 
imum degree does not guarantee high connectivity (Fact F8). However, it does ensure 
the existence of a highly connected subgraph. 


FACT 

F48: [Ma72a] Every graph of minimum degree at least 4&4 contains a k-connected 
subgraph. 

REMARK 


R6: In fact, Mader [Ma72a] proved that if the average of the degrees of the vertices 
of G is at least 4k, then G contains a k-connected subgraph. Concerning the proof of 
Fact F48, see also Thomassen [Th88]. 
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4.1.4 Analysis and Synthesis 


An interesting question in the study of graph connectivity is to describe how to 
obtain every k-(edge-)connected graph from a given “simple” one by a succession of 
elementary operations preserving k-connectedness. A classical result on this topic is 
Tutte’s theorem, which states how to construct all 3-connected graphs, starting with a 
wheel graph. We also consider some relevant results dealing with deletion of edges or 
vertices. Finally, some facts concerning minimally and critically k-connected graphs, as 
well as a reference to connectivity augmentation problems, are considered. 


Contractions and Splittings 


DEFINITIONS 


D30: The contraction of an edge uv consists of the identification of its endpoints u 
and v (keeping the old adjacencies but removing the self-loop from u = v to itself). Let 
G be a k-connected graph. An edge of G is said to be k-contractible if its contraction 
results in a k-connected graph. 


D31: The converse operation is called splitting: A vertex w with degree 6 is replaced 
by an edge uv in such a way that some of the vertices adjacent to w are now adjacent 
to u and the rest are adjacent to v. Moreover, if the new vertices u,v have degrees at 
least k = 6/2 +1 we speak about a k-vertex-splitting. 


D32: For any integer n > 4, the wheel graph W,, is the n-vertex graph obtained by 
joining a vertex to each of the n — 1 vertices of the cycle graph Cy_1. 
FACTS 


F49: If G is a k-connected graph, the operations of k-vertex splitting and edge 
addition always produce a graph that is also (at least) k-connected. In fact, as shown 
below, for k = 3 these operations suffice to derive all 3-connected graphs. 


F50: [Th80b] Every 3-connected graph distinct from Ky has a 3-contractible edge. 


F51: [Th81] Every triangle-free (no 3-cycles) k-connected graph has a k-contractible 
edge. 


F52: [Tu61] Every 3-connected graph can be obtained from a wheel by a finite sequence 
of 3-vertex-splittings and edge additions. 


REMARK 


R7: In general, k-connectedness does not ensure the existence of k-contractible edges. 
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EXAMPLE 


E2: = In Figure 4.1.2, the cube graph Q3 is synthesized from the wheel graph Ws; in 
four steps. All but the second step are 3-vertex-splittings. 


~ Py - XP 


Figure 4.1.2: A 4-step Tutte synthesis of the cube graph Q3. 


REMARKS 


R8: Thomassen used Fact F50 to give a short proof of Kuratowski’s theorem on 
planarity. Fact F50 can also be derived from Tutte’s theorem (Fact F52). 


R9: Since Tutte’s paper, the distribution of contractible edges in graphs of given 
connectivity has been extensively studied. For a comprehensive survey of this sub- 
ject, we refer the reader to Kriesell [Kr02], where the author also considers subgraph 
contractions (see below). 


R10: Fact F52 is a reformulation of the following proposition [Tu61]: a 3-connected 
graph is either a wheel, or it contains an edge whose removal leaves a 3-connected 
subgraph, or it contains a 3-contractible edge that is not in a cycle of length 3. 


R11: Slater [S174] gave a similar result for constructing all 4-connected graphs starting 
from Ks, but in this case three more operations are required. For k > 5 the problem 
is still open. However, Lovasz [Lo74] and Mader [Ma78a] managed to construct all k- 
edge-connected pseudographs (loops and multiple edges allowed) for every k even and 
odd, respectively. 


Subgraph Contraction 


The contraction of a subgraph is a natural generalization of edge contraction. 


DEFINITION 


D33: A connected subgraph H of a k-connected graph G is said to be k-contractible 
if the contraction of H into a single vertex results in a k-connected graph. 
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FACTS 


F53: [McOt94] Every 3-connected graph on n > 9 vertices has a 3-contractible path 
of length two. 


F54: [ThTo81] Every 3-connected graph with minimum degree at least four contains 
a 3-contractible cycle. 


F55: [Kr00] Every 3-connected graph of order at least eight has a 3-contractible 
subgraph of order four. 


CONJECTURE 


[McOt94] For every n, a 3-connected graph of sufficiently large order has a 3-contractible 
subgraph of order n. 


Edge Deletion 


DEFINITION 


D34: <A subgraph H of a k-edge-connected graph G is said to be p-reducible if the 
graph obtained from G by removing the edges of H is (k — p)-connected. 


FACTS 


F56: [{Ma74] Every k-connected graph G with minimum degree at least k + 2 contains 
a cycle C such that G — E(C) is k-connected. 


F57: [Ok88] Let G be a k-edge-connected graph with k > 4 even. Let {u,v} Cc V and 
{e1,e2,f} CE,e, 4 f (¢=1,2). Then, 


(a) There exists a 2-reducible cycle containing e; and e2, but not f. 


(b) There exists a 2-reducible u—v path containing e;, but not f. 


F58: [Ok90b] Let G be a k-edge-connected graph with k > 2 even. If {uy, v1, ua, vo} 
are distinct vertices, with edges e9 = v1 V2, e; = ujv; (i = 1,2), and there is no edge-cut 
with k or k +1 elements containing {e9,e1,¢2}, then there exists a 2-reducible cycle 
containing {eo, €1, e2}. 


F59: [HuOk92] For each odd k > 3, there exists a k-edge-connected graph containing 
two vertices u and v such that every cycle passing through wu, v is p-reducible with p > 3. 


REMARK 


R12: For the case of three consecutive edges e1, €2, €3 of a k-connected graph, Okamura 
[Ok95] also found a nontrivial equivalent reformulation of the condition that no cycle 
of G containing e1, e2, and eg is 2-reducible. 
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FACTS 


F60: [ChKaLi72] Every 3-connected graph of minimum degree at least 4 has a vertex 
v such that G — v is 3-connected. 


F61: [Th81] Every (k + 3)-connected graph has an induced (chordless) cycle whose 
deletion results in a k-connected graph. 


F62: [Eg87] Every (k + 2)-connected triangle-free graph has an induced cycle whose 
deletion results in a k-connected graph. 


REMARK 


R13: Fact F61 was conjectured by Lovasz, and Thomassen used Fact F51 to prove 
it. 


Products of Graphs 


DEFINITIONS 


D35: Recall that the cartesian product of two graphs G; = (Vi, Ei), i = 1,2, is the 
graph Gi0G>2 with vertex set V; x V2, and for which vertices (11,22) and (yi, y2) are 
adjacent if x; = y, and ray2 € Fe, or x,y, € Fy and x2 = yo. 


D36: The Kronecker product of two graphs G; = (Vj, E;), i = 1,2, is the graph 
G1 X Gp with vertex set V; x V2, and for which vertices (x1, #2) and (y1, ye) are adjacent 
if T1Y1 € FE, and L2Y2 € E>. 


D37: [BeDeFa84] Let G; = (Vj, E;), 1 = 1,2, be two graphs with the edges of G1 
arbitrarily oriented, in such a way that an oriented edge from x1 to y; is denoted by 
€x,y,- For each arc ez,y,, let Te,,,, be a permutation of V2. Then the twisted product 
G, * Gy has V, x V2 as vertex set, with two vertices (71,72), (yi, y2) being adjacent if 
and only if either 

%=y, and «ry2 € FE 


or 
mye, and y=Te,y, (x2). 


D38: [BaDaFiMi09] Given two graphs G; = (V;, £;), 1 = 1,2, and a non-empty vertex 
subset U, C Vi, the generalized hierarchical product G'(U,) G2 is the graph with 
vertex set V; x V2, and for which vertices (a1, 22) and (y1, y2) are adjacent if riy, € Ey 
and x2 = y2, or 41 = y1 € Uy and xoye € Ep. 
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FACTS 
F63: [XuYa06] For any nontrivial graphs G; and Go, 
K(G, G2) > min{K(G,) + 6(G), K(G2) + 6(G,)} 


and 


\(Gi0G2) 2 min{A(G1)|V2], A(G2)|Vil, 6(Gi) + 6(Ga2)}- 
F64:  [Sp08] For any nontrivial graphs G, and Go, 
«(Gi0G2) = min{K(G1)|V2|, «(G2)|Vi], 6(G1) + 6(G2)}- 


F65:  [We62] If G; and G2 are two connected graphs, then G1 x G2 is connected if 
and only if G; and G2 are not both bipartite graphs. 


F66: 
(a) [MaVu08] «(Kn x Km) = (n—1)(m—1) for any n >m>2 and n> 3. 


(b) [WaWull] «(G x K,) = min{n«(G), (rn — 1)6(G)} for any nontrivial graph G and 
n> 3. 


F67: [BaGVMa06, BaCeDiGVMa07] 
(a) For any nontrivial graphs G; and Go, 
min{«(G1)|Va|, (61 + 1)K(G2), 61 + d2)} < K(G1 * Go) < 61 + bo; 
min{A(G1)|Va|, (61 + 1)A(G2), 61 + d2)} < A(G1 * Ge) < 61 + bo, 


where 6, + 62 is the minimum degree of Gy * Go. 


(b) If G, and G2 are maximally connected, then G'; * Gz is also maximally connected. 
(c) For every connected graph G, the graph G * G is maximally connected. 
F68: [BaDaFiMi09] The connectivity of the generalized hierarchical product satisfies 
K(G (U1) G2) < min{«(G1)|Vo|, «(Ui|U7), 6(G1(01) 1 Ga)}, 
where U; Cc V; — U, and 6(Gi (U1) N G2) = min{6(G, — U1), 6(Gi(U1)) + do}. 


REMARKS 


R14: The graph G, * G2 can be viewed as formed by |V| disjoint copies of G2, each 
oriented edge xy; indicating that some perfect matching between the copies G{1, GY? 
(respectively generated by the vertices x; and y; of G1) is added. Moreover, K2 * G is 
a permutation graph {[ChHa67]. 


R15: If in Definition D37, 7-,,,, is the identity permutation for any oriented edge 
€x,y,, the twisted product G  * Gz is the cartesian product G,UG»2. 


R16: If U; is consits of only one vertex, then Gi (U1) MG is the standard hierarchical 
product [BaCoDaFi09], whereas if U; = V; we obtain the cartesian product GiOG2. 


R17: Fact F66(b) was previously proved for G bipartite in [GuVu09]. 


R18: Regarding Fact F66, the connectivity of Kronecker products by Ke has been 
recently studied in [WaYal2]. 
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Minimality and Criticality 


A standard technique used to study a certain property P is to consider those graphs 
that are edge-minimal or vertex-minimal (critical) with respect to P, in the sense that 
the removal of any vertex or edge produces a graph for which P does not hold. 


DEFINITIONS 


D39: A graph or digraph G is said to be minimally k-connected if «(G) > k but, 
for each edge e € E, K(G—e) < k. Analogously, G is minimally k-edge-connected 
if A(G) > k, but for each e € E, A(G-e) <k. 


D40: A vertex u of a digraph has half degree k if either degt(u) = k or deg” (u) = k. 


FACTS 


F69: [Ma71, Ma72b] Every minimally k-connected (or k-edge-connected) graph con- 
tains at least & + 1 vertices of degree k. 


F70: [Ma72b] Every cycle of a minimally k-connected graph contains a vertex of 
degree k. 


F71: Every cycle in a k-connected graph G contains either a vertex of degree k or an 
edge whose removal does not lower the connectivity of G. 


F72: [Ha81] Every minimally k-connected digraph contains at least k + 1 vertices of 
half degree k. 


F73: [Ma02] Every minimally k-connected digraph contains at least k +1 vertices of 
outdegree k and at least k + 1 vertices of indegree k. 


REMARKS 


R19: Halin [Ha69, Ha00] proved the existence of a vertex of degree k in every 
minimally k-connected graph, and the corresponding theorem for minimally k-edge- 
connected graphs was proved by Lick [Li72]. Both results were then improved by Mader 
(Fact F69). 


R20: Fact F72, a consequence of Mader’s result Fact F73, is due to Hamidoune and is 
the digraph analogue of (and implies) Mader’s theorem (Fact F69) about the existence 
of vertices of degree k. The existence of at least one vertex of half degree k had been 
previously asserted by Kameda [Ka74]. 


Vertex-Minimal Connectivity — Criticality 


Maurer and Slater [MaS177] introduced the general concept of critically connected and 
critically edge-connected graphs, graphs whose connectivity decreases when one or more 
vertices are removed. 
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DEFINITION 


D41: A graph G is called k-critically n-connected, or an (n,k)-graph, if, for each 
vertex subset U with |U| < k, we have «(G —U) = n-—|U|. When k = 1, we simply 
refer to the graph as critically n-connected. 


FACTS 
F74: [MaS177] The only (n,n)-graph is the complete graph Ky,41. 


F75: The “cocktail party graph” (obtained from Ko,+42 by removing a 1-factor [perfect 
matching]) is a (2n,n)-graph but not a (2n,n + 1)-graph. 


F76: (Su88] The complete graph on k+1 vertices is the unique k-critically n-connected 
graph with n < 2k. 


F77: [Ma77] If G is a (n,3)-graph, then its order is at most 6n?. Thus, for each n, 
there are only finitely many of (n,3)-critical graphs. 


REMARKS 
R21: An early survey about (n, k)-graphs can be found in [Ma84]. 


R22: Fact F75 led Slater to conjecture that, apart from K,,11, there is no (n, k)-graph 
with k > n/2, which, after some partial results, was finally proved by Su (Fact F76). 


R23: Fact F77 was generalized by Mader to the class of all finite n-connected graphs. 


Connectivity Augmentation 


We conclude the section by referring the reader to Frank [Fr94] for an in-depth discus- 
sion of connectivity augmentation. In the edge-connectivity augmentation problem, we 
are given a graph G = (V, £) and a positive integer k, and the goal is to find the smallest 
set of edges F' that we can add to G such that G’ = (V, E'U F) is k-connected. Due to 
its applicability to the design of fault-tolerant networks, connectivity augmentation has 
also been widely investigated from an algorithmic point of view. Watanabe and Naka- 
mura [WaNa87] gave the first polynomial-time algorithm solving the edge-connectivity 
augmentation problem. In the same paper, the authors formulated a necessary and 
sufficient condition to decide if a given graph G can be made k-connected by adding at 
most a certain number of edges. The same question for digraphs was solved in [Fr92]. 
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INTRODUCTION 


Eulerian graph theory has its roots in the Konigsberg Bridges Problem: Four land- 
masses are being connected by seven bridges as depicted in Figure 4.2.1. The graph 
theoretical model of this problem is depicted in Figure 4.2.2. 


KONINGSBERGA 


Figure 4.2.1: A map of K6énigsberg as it was in Euler’s days, with highlighted bridges. 


QUESTION: Starting at any of the four landmasses, is it possible to perform a walk 
such that every bridge is crossed once and only once, the walk ending at any of these 
four landmasses? L. Euler wrote an article on this problem in 1736 [Eul736]; hence 
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Figure 4.2.2: Graph model of the Konigsberg Bridges Problem. 


the name eulerian graph. This paper can be viewed as the “birth certificate” for graph 
theory, in general. For an extensive treatment of eulerian graphs and related topics see 
(F190, F191]. The equivalent question asks for a walk in the graph of Figure 4.2.2 such 
that every edge is traversed precisely once. 


4.2.1 Basic Definitions and Characterizations 


NOTATION: Throughout this section, a graph, digraph, or mixed graph is denoted G = 
(V, £), where V is the vertex-set of G and E is the edge-set of G, consisting of undirected 
edges, directed edges (arcs), or both, respectively. 


TERMINOLOGY: Sometimes, for emphasis and to avoid confusion, the adjective “undi- 
rected” is used for “graph” or “edge.” 


DEFINITIONS 


D1: An eulerian tour in a graph (or digraph) G is a closed walk that uses each edge 
(or arc) of G exactly once, and hence, is a closed trail. An eulerian tour in a mixed 
graph is a closed trail that uses each edge and each arc exactly once. An open eulerian 
trail is an open trail that uses each edge and/or arc exactly once. 


D2: A graph, digraph, or mixed graph that has an eulerian tour is called ewlerian. 
D3: An undirected graph is even if every vertex has even degree. 


D4: In a digraph, a vertex v is balanced if the indegree and outdegree of v are equal. 
A digraph is balanced if all of its vertices are balanced. 


TERMINOLOGY NOTE: In §4.3, the term symmetric is used instead of balanced when the 
indegree and outdegree of v are equal. 


D5: A balanced orientation of a graph (or mixed graph) G is an assignment of a 
direction to each edge of the graph (or each undirected edge of the mixed graph) so that 
the resulting digraph is balanced. 


D6: A cycle decomposition of a graph (digraph) G is a partition of the edge-set 
(arc-set) of G such that each partition set forms a cycle (directed cycle). 
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Some Basic Characterizations 


Unless stated otherwise, we assume graphs not to have any self-loops. Note that the 
existence or non-existence of self-loops has no effect on whether a graph is eulerian. 


FACTS 
For details of the following facts, see, e.g., [To73, Mc84, Wo90, F189, F190). 


F1: The Classical Characterization ([Eu1736], [Hi1873], [Vel2, Ve31]) 
Let G be a connected graph. The following are equivalent: 


(a) G is eulerian. 
(b) G is an even graph. 
(c) G has a cycle decomposition. 
F2: A graph is even if and only if it has a balanced orientation. 
F3: A graph is even if and only if it has a decomposition into closed trails. 


F4: A graph is even if and only if every edge belongs to an odd number of cycles. 


F5: A graph is even if and only if it has an odd number of cycle decompositions. 
F6: A connected graph G = (V, £) is eulerian if and only if the number of subsets of 
E (including the empty set) that induce an acyclic subgraph of G is odd ([Sh79, F189, 
F190). 
F7: For a connected digraph D the following are equivalent. 

(a) D is eulerian. 


(b) D is a balanced digraph. 


(c) D has a directed cycle decomposition. 


REMARKS 

R1: For the classical characterization, Euler ([Eu1736]) showed that Fact F1(a) implies 
Fact F1(b), while the converse is due to Hierholzer ({[Hi1873]). The equivalence of Fact 
F1(b) and Fact F1(c) is due to Veblen ([Vel2, Ve31]}). 


R2: By Fact F1, the statements in Facts F2 through F6 can be viewed as alternative 
characterizations of eulerian graphs. 


R3: Note that a connected eulerian digraph is strongly connected. 


R4: There is no digraph or mixed graph analog for the characterization expressed in 
Fact F4. 
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Characterizations Based on Partition Cuts 


DEFINITIONS 


D7: Let G be a graph and let X C V(G). The partition-cut associated with X, 
denoted F(X, X), is the set of edges in G with one endpoint in X and one endpoint in 
X =V(G)-—X. A partition-cut in a digraph or mixed graph is analogously defined. 


D8: An edge-cut, arc-cut, and mixed-cut are partition-cuts in a graph, digraph, 
and mixed graph G, respectively, associated with some X C V(G). 


D9: The out-arcs of an arc-cut (or mixed-cut) E(X, X) is the subset of directed edges 
whose tail is in X and is denoted E+(X,X). The in-arcs of E(X,X) is the subset of 
directed edges whose head is in X and is denoted E~(X,X). 


D10: Let v be a vertex of a graph, digraph, or mixed graph G. The tncidence set of 
v, denoted E,, is the partition-cut E(X, X), where X = {v}. 


NOTATION: In a digraph, the out-arcs and in-arcs of the incidence set of v are denoted 
Ey and E,,, respectively. 


FACTS 
F8: A graph G is even if and only if |E(X, X)| is even for every X C V(G). 


F9: A connected digraph G is eulerian if and only if |E+(X,X)| = |E~(X,X)| for 
every X C V(G). 


F10: Let G be a connected mixed graph. The following are equivalent: 


(a) G is eulerian. 


(b) |E(X,X)| - ||E*(X, X)| — |E~(X, X)|| is nonnegative and even for every X C 
V(G). 


(c) G has a cycle decomposition. 


REMARKS 


R5: While Fact F1(b) and its digraph analogue, Fact F7(b), are (local) degree condi- 
tions guaranteeing that a graph (undirected or directed) is eulerian, for mixed graphs 
one needs the global condition in Fact F10(b) (which reduces to Facts F8 and F9 for 
undirected graphs and digraphs). 


R6: Although the condition in Fact F10 is impractical from an algorithmic point of 
view for producing an eulerian tour in a mixed graph G, such a tour can be obtained 
using network-flow techniques by first getting a balanced orientation Dg of G; then any 
eulerian tour of Dg corresponds to an eulerian tour in G [FoFu62]. 
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4.2.2 Algorithms to Construct Eulerian Tours 


We begin with two classical algorithms for constructing an eulerian tour. All three 
algorithms in this subsection are polynomial-time (see [F190]). 


Algorithm 4.2.1: Hierholzer’s Algorithm [Hi1873] 


Input: a connected graph G whose vertices all have even degree. 
Output: an eulerian tour T. 


Start at any vertex v, and construct a closed trail T in G. 

While there are edges of G' not already in trail T 
Choose any vertex w in T that is incident on an unused edge. 
Starting at vertex w, construct a closed trail D of unused edges. 
Enlarge trail T by splicing trail D into T at vertex w. 

Return T. 


COMPUTATIONAL NOTE: A modified depth-first search (see §10.1), in which every un- 
used edge remains in the stack, can be used to construct the closed trails. 


EXAMPLE 


El: The key step in Algorithm 4.2.1 is enlarging a closed trail by combining it 
with a second closed trail — the detour. ‘To illustrate, consider the closed trails, 
T = (t1,te,t3,t4) and D = (d1,d2,d3), in the graph shown in Figure 4.2.3. The 
closed trail that results when detour D is spliced into trail T at vertex w is given 
by T”’ = (t1, ta, di, do, d3,t3,t4). At the next iteration, the trail (e1, e2, e3) is spliced into 
trail T’, resulting in an eulerian tour of the entire graph. 


Figure 4.2.3: Splicing (di, da, ds) into (ti, ta, t3, t4) results in (ti, ta, dy, do, ds, t3, t4). 


REMARKS 


R7: The splicing operation in Hierholzer’s algorithm is also called a k-absorption and 
is discussed later in this section. 


R8: The strategy in Fleury’s algorithm, shown below, is to avoid, if possible, traversing 
a bridge in the subgraph induced on the set of untraversed edges. Fleury’s algorithm 
also appeared in [Lu1894]. 
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Algorithm 4.2.2: Fleury’s Algorithm [F11883] 


Input: Eulerian graph G with q edges and vp € V(G). 
Output: Eulerian tour T9. 
Choose e€1; = vov1 € Ey, arbitrarily. 
Let T; = (vo, €1, U1) 
Fori=1toq-1 
If degg, (vi) = 1 
Let €:41 = Uivig1 € E(G;). 
Else 
Choose ej41 = U;0i41 € E(G;) that is not a bridge in G,. 
Extend T; to Ty41 = (vo, €1,U1,-++5 Us, Ci4+1, Us41)- 


The Splitting and Detachment Operations 


The splitting and detachment operations can serve as the basis for many of the charac- 
terizations, constructions, and decompositions discussed in this section. 


DEFINITIONS 


D11: Let G be a graph with vertex v such that deg(v) > 3, and let e,, ey be incident 
on v and Wa, wp, respectively. The graph G,» obtained from G by introducing a new 
vertex Ug», adding new edges e/,,€}, joining vg, and wa, wp, respectively, and deleting 
€q, €p is called the a-b split of G at v. The operation that produces G,, is called the 
splitting operation (see Figure 4.2.4.) 


Wy 


G G 


Figure 4.2.4: The splitting operation producing the 1-2 split of G at v. 


D12: Let v be a vertex of a graph G with deg(v) > 2, and let the edge subsets 
E\(v), Eo(v),...,Ex(v), k > 2, be a partition of the incidence set E,. Replace v with 
new vertices v1, U2,...,Uk, and let vj, i = 1,2,...,k, be incident on the edges of E;(v) 
(without altering any other incidence). The graph H thus obtained is called a detach- 
ment of G at v. This action is called a detachment operation at v (see Figure 
4.2.5). 


D13: A graph H is a detachment of G if it results from a sequence of detachment 
operations performed at each of the vertices of some vertex subset W C V(G). For a 
discussion of detachments of graphs, see [Na79, Na85a, Na85b]. 
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Figure 4.2.5: Graph A is a detachment of G at v. 


FACTS 


F1l: Splitting Lemma. Let v be a vertex of a connected, bridgeless graph G with 
deg(v) > 4, and let e1, e2,e3 € Ey. 

(a) If v is not a cut-vertex then at least one of the splits Gi,2 or G1,3 is connected and 
bridgeless. 

(b) If v is a cut-vertex and e; and eg belong to different blocks, then G3 is connected 
and bridgeless. 


F12: Let v be a vertex in a 2-connected graph G (i.e., no cut-vertices) with deg(v) > 4. 
If neither the 1-2 split G12 at v nor the 1-3 split G13 is 2-connected, then G2 and 
Gi,3 have the same cut-vertex x and no other cut-vertices. In this case, both Gj,2 and 
Gj,3 are connected and bridgeless. 


F13: A graph is connected if and only if there is a detachment of G that is a tree. 


F14: A graph is eulerian if and only if it has a detachment that is a cycle. 


REMARKS 


R9: The Splitting Lemma (Fact F11) can serve as the basis for many of the results and 
algorithms mentioned in this section (see, e.g., [F190]). It can also be used to restrict, 
with no loss in generality, various other (solved as well as unsolved) graph theoretical 
problems to 3-regular graphs. For a short proof of the Splitting Lemma, see [F100]. 


R10: Definitions D11 through D13 and Facts F1l through F14 can be formulated for 
graphs with self-loops as well. In this case (but also later on) it makes sense to consider 
an edge e (not just self-loops) as composed of two half-edges incident on the respective 
endpoints of e. Correspondingly, one then considers the splitting operation as involving 
different half-edges and the sets F,, E;(v) as being sets of half-edges. 


R11: The splitting operation can be viewed as a special case of the detachment op- 
eration, where the partition of the incidence set E, has exactly two cells, with at least 
one cell containing exactly two edges. 
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Algorithm 4.2.3: Splitting Algorithm 


Input: Eulerian graph G with q edges and vp € V(G). 
Output: Eulerian tour Tj in the form of a detachment of G. 


Initialize H =G. 
Choose e; = vov1 € Ey, arbitrarily. 
Let T, = (vo, €1, V1) 
For i= 1 to q 
If deg, (vi) = 2 
Let ej41 = vidiga € Eo, (H) = E(T;). 
Else {apply splitting lemma} 
If v; is not a cut-vertex of H 
Choose €;41 = vjUj41 € Ey,(H) — E(T;) arbitrarily. 
Else 
Choose €j41 = vjv;41 in a different block than e;. 
H := Hig41) {the i-(i+ 1) split of H at u,} 


Extend T; to Tj41 = (vo, €1, V1, ++ Vi, 441; Vi41)- 


REMARKS 


R12: Algorithms 4.2.1-4.2.3 can easily be adapted to construct an eulerian tour in a 
digraph: all one needs to do is choose e;;, such that v; is its tail since v; is the head of 
ej. 


R13: The difference between the Splitting Algorithm and Fleury’s Algorithm lies 
exclusively in the fact that the intermediate trails T;, 0 < 7 < q, are stored separately as 
edge sequences, say, by Fleury’s Algorithm, while the Splitting Algorithm retains them 
as part of the graphs considered. In both cases, however, it is the Splitting Lemma 
which guarantees the correctness of these algorithms (see [F190]). Observe that all even 
graphs are necessarily bridgeless. 


4.2.3 Eulerian-Tour Enumeration and Other 
Counting Problems 


The BEST-Theorem gives an explicit, computationally good formula for the number 
of eulerian tours in an eulerian digraph. It rests on the Matrix Tree Theorem (Fact F15) 
and can be applied to (undirected) graphs by summing over all balanced orientations 
of G. The latter, however, grows exponentially large with the number of vertices. We 
also briefly mention deBruijn (di)graphs because of their relevance to DNA-sequencing 
and other questions. DeBruijn digraphs are discussed in §4.4. 


DEFINITIONS 


D14: An out-tree in a digraph is a tree having a root of indegree 0 and all other 
vertices of indegree 1, and an in-tree is an out-tree with edges reversed. 
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D15: Let D be a digraph, A(D) its adjacency matrix with entries a;,;, and let \; be 
the number of self-loops at vj; € V(D) = {v1,...,Un}. The Kirchhoff matriz A*(D) 
with entries aj; is defined by setting 


ai,=—ay if iA, aj, =id(u;)—A ; l<ijgcn 


D16: Let aset A = {a1,...,an} be called an alphabet whose letters are the elements 
of A. A k-letter word over A is an ordered k-tuple whose components are letters. A 
k-deBruijn sequence over A is a cyclic sequence of letters from A such that every 
k-letter word over A appears exactly once in this cyclic sequence. 


D17: Let n> 2,k > 2. The deBruijn graph D,,,x, has as its vertices the (k — 1)-letter 
words over an n-letter alphabet A; thus, there are altogether n*—! vertices. For each 
k-letter word a,;,,...,@;, in the alphabet A, there is an arc of D,,, that joins the vertex 
Qi,,++-,Qi,_, to the vertex aj,,..., Qi,- 


TERMINOLOGY: For a matrix A, A;,; denotes the (i, j)-th minor, i.e., the matrix ob- 
tained by deleting the i-th row and j-th column from A. 


FACTS 


F15: Matrix Tree Theorem. Given a digraph D,V(D) = {v1,...,Un}, let A* = A*(D) 
be its Kirchhoff matrix. The number of spanning out-trees of D rooted at v; is det Aj ;. 


F16: In an eulerian digraph D, the number of spanning in-trees rooted at v; equals 
the number of spanning out-trees rooted at v;. 


F17: For an eulerian digraph D, det Aj; = det Aj,, 


F18: BEST-Theorem. [EhBr51, TuSm41] Let D be an eulerian digraph of order n, 
and let v; € V(D), a € Ex be chosen arbitrarily. The number of eulerian tours starting 
at v; with the traversal of a is 


Ll<ij<n. 


n 


det Aj; | | (od(v;) — 1)! 


p=i 


F19: For an eulerian graph G with p vertices and q edges, and chosen e € E(G), the 
number O(G) of balanced orientations of G containing a fixed orientation of e satisfies 


3 q—Pp 
($)  <ox(6) <2. 
F20: The deBruijn graph D,,;, is an n-regular digraph (id(v) = od(v) = n for every 
v € V(Dn.x)) with n*—! vertices. 


F21: There is a 1-1 correspondence between the set of k-deBruijn sequences over an 
n-letter alphabet and the set of eulerian tours of the deBruijn graph D,,. Consequently, 
and as an application of the BEST-Theorem, the number of k-deBruijn sequences over 
an n-letter alphabet is ; 


(n!)”" 
nk 
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EXAMPLE 
E2: The deBruijn graphs D23 and D2,4 are shown in Figure 4.2.6 (see also [ChOe93, 


p. 220]). 


a 


42 0001 
1001 
Do 4 . 


c—_ 


0100 0010 | 0011 
1100 
4 =1010 0101 iy 


1101 1011 
a nf C 
Pe 0111 
S 1111 


Figure 4.2.6: The deBruijn digraphs Do3 and D2,4. 


REMARK 


R14: DeBruijn graphs are of particular interest in the case n = 2, i.e., when the words 
are binary sequences. The study of the graphs D2, has been applied in biochemistry 
when considering the problem of DNA sequencing. These same graphs have also been 
of interest in telecommunications when one is concerned with the question of network 
reliability. Apart from D2, Kautz graphs and hypercubes also play an important 
role because these graphs perform well with respect to diameter and other parameters, 
although the number of edges is relatively small in comparison to the number of vertices 
(see, e.g., [Xu02]). 


4.2.4 Applications to General Graphs 


In this subsection, we introduce some applications of eulerian graph theory to graph 
theory in general; some of these applications are also relevant in computer science and 
operations research, for example, the Chinese Postman Problem (§4.3). Interestingly, 
while certain analogues of results in eulerian graph theory hold equally well for general 
graphs, there are other quite natural analogues that lead to yet unsolved problems. 
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Covering Walks and Double Tracings 


DEFINITIONS 


D18: A covering walk (or postman tour) in an arbitrary graph G is a closed walk 
containing every edge of G. 


D19: A double tracing is a closed walk that traverses every edge exactly twice. A 
double tracing is bidirectional if every edge is used once in each of its two directions. 


D20: A retract or retracing in a walk W is a section of the form v;_1, ej, U;, €:41, 
v;41 such that e; = e;41 (and thus vj41 = vi-1). W is called retract-free if it has no 
retracts. 


D21: A double tracing is called strong if it is both bidirectional and retract-free. 


D22: The edge-connectivity of a connected graph G, denoted X(G), is the minimum 
number of edges whose removal can disconnect G. G is called k-edge-connected if 
NG) > k. 


FACTS 


F22: Let G be a graph with 2k vertices of odd degree, & > 0. Then G has a de- 
composition into k open trails whose initial and end vertices are of odd degree in G. 
Consequently, G has a decomposition into cycles and k paths; and if k = 1 and G is 
connected, then it has an open eulerian trail. 


F23: Every connected graph has a bidirectional double tracing. In a tree, every double 
tracing is bidirectional. 


F24: [Sa77] A connected graph has a retract-free double tracing if and only if it has 
no end-vertices (vertices of degree 1). 


F25: [Th85] If G is a graph without 1- and 3-valent vertices, then it has a strong double 
tracing. Consequently, every 4-edge-connected graph has a strong double tracing. 


F26: [Tr66], [Th85] A connected 3-regular graph with |V(G)| = 0 mod 4 has no strong 
double tracing. 


F27: [Ve75] Let G be a connected graph and Ey C E(G). G has a double tracing 
using every e € E(G) — Epo twice in the same (not prescribed) direction, and acting 
bidirectional on Eo, if and only if G— Ep is an even graph. Observe that this implies 
Fact F23 (taking Eo = E(G). 


REMARKS 


R15: The condition for a double tracing to be bidirectional (Definition D19) applies 
to the case of self-loops if one views edges as composed of two half-edges, which allows 
a loop to be viewed as being also orientable in two ways. 


R16: The double tracings quoted in Facts F23 and F25 can be obtained in polynomial 
time by reducing the respective problems to problems of finding eulerian tours satisfying 
certain restrictions, in eulerian digraphs derived from the given graphs by replacing every 
edge by two oppositely oriented arcs joining the same pair of vertices. 
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Maze Searching 


In the context of this section, a maze may be viewed as a connected graph for which 
one has at each vertex local information only. Tarry’s algorithm is just one of several 
maze-searching algorithms. (See [F191] for a more extensive study.) 

NOTATION: In the description of Algorithm 4.2.4, ei,(v), v 4 vo, denotes the edge that 
was traversed in visiting vertex v for the first time, {ein(vo)} = 9, and Fier (v) denotes 
the set of edges that have been already traversed in leaving v. 


Algorithm 4.2.4: Tarry’s Algorithm [Tal895] 


Input: a connected graph G. 
Output: a bidirectional double tracing of G. 


Choose vp € V(G). 
Initialize i = 0 and W = (vo). 
While (Eu; og Fvett (v;) # 0) 
While ([Ey, — Eien (vi)] — {em(vs)} #0) 
Choose edge e; = vj0i41 € [Ev, — Frese (vi)] — {ein (vi) }- 
W := W, (ei, Vi41) {Extend W to v;41 via edge e;.} 
t:=14+1 
Let ej = VUjVi41 = €in(U;) 
W := W, (€;, Viti) 


FACTS 


F28: Tremaux’s (maze search) Algorithm also operates with local information only: 
at vertex vu; reached by the walk W, the number Aw(e) of traversals of every e € Ey, by 
W is known. The output of Tremaux’s Algorithm is also a bidirectional double tracing. 


F29: Applying Tarry’s Algorithm to eulerian graphs G, with additionally having the 
local information Ay (e) of Tremaux’s Algorithm and choosing as next edge e; with 
minimal Aw (e;), then the list of the edges according to their second traversal yields an 
eulerian trail of G (for details of Facts F28 and F29, see [F191]). 

REMARK 


R17: A. S. Fraenkel’s Algorithm uses a counter in connection with a modification of 
Tarry’s Algorithm; its outcome is a covering walk using every edge at most twice. 


Covers, Double Covers, and Packings 


DEFINITIONS 


D23: A cycle cover of a graph G is a family S of cycles of G such that every edge of 
G belongs to at least one element of S. 


D24: A cycle cover S is a cycle double cover (CDC) if every edge of G belongs to 
exactly two elements of S. 
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D25: A cycle packing in G is a set of edge disjoint cycles in G. 


D26: A CDC S is called orientable if the elements of S can be cyclically oriented 
in such a way that every edge e is given opposite orientations in the two elements of S' 
containing e. 


CONJECTURES 

Cycle Double Cover Conjecture (CDCC): Every bridgeless graph has a CDC. 
Oriented Cycle Double Cover Conjecture: Every bridgeless graph has an oriented 
CDC. 

Strong Cycle Double Cover Conjecture: Every bridgeless graph has a CDC con- 
taining a prescribed cycle of the graph. 


Three Optimization Problems 


DEFINITIONS 


D27: Let G bea bridgeless edge-weighted graph with weight function w: E(G) > R*. 
The weight of a cycle C in G, denoted w(C), is given by w(C) = >> w/e). The 
e€E(C) 
weight of a cycle cover or cycle packing S is w(S}) = S> w(C). 
CES 
D28: The Minimum-Weight Cycle-Cover Problem (MWCCP) is to find a cycle 
cover S in G such that w(S) is minimum. 


D29: The Maximum-Weight Cycle-Packing Problem (MWCPP) is to find a 
cycle packing S such that w(S) is maximum. 


D30: The Chinese Postman Problem is to find a minimum-weight covering walk 
W in G where w(e) is counted as often as e is traversed by W (see §4.3). 


FACTS 


F30: [F186] Let G be a planar, bridgeless graph. Then G has an oriented CDC, and 
for any given cycle packing S, G has a CDC containing S as a subset. Thus, the Strong 
Cycle Double Cover Conjecture is true for planar graphs. 


F31: A bridgeless graph having a hamiltonian path admits a double cover with at 
most six even subgraphs ([Tar86]). Later on, it was shown in [HuKo95] that five even 
subgraphs suffice to double cover cubic graphs having a hamiltonian path. 


F32: [FlHa09] Every hypohamiltonian graph has a SCDC. 


F33: [FlGu85] The Undirected Chinese Postman Problem and the Maximum-Weight 
Cycle-Packing Problem are both solvable in polynomial time, and for planar, bridgeless 
graphs, the Minimum-Weight Cycle-Cover Problem can be solved in polynomial time. 


F34: [FlGu85] Let G be an edge-weighted graph with weight function w. If W is a 
solution of the Undirected Chinese Postman Problem and S a solution of the Maximum- 
Weight Cycle-Packing Problem, then w(S) = w(W) — 2w(Ea), where Eg C E(G) is the 
set of those edges used twice in W, and w(Ea):= >> w(e). 

e€Ea 
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F35: [FlGu85] For any planar, connected, bridgeless graph, if S is a solution of the 
Minimum-Weight Cycle-Cover Problem and W is a solution of the Undirected Chinese 
Postman Problem, then w(S) = w(W). 


F36: For any connected, bridgeless graph G with weight function w, if W is a solution 
of the Undirected Chinese Postman Problem and S is a solution of the Minimum-Weight 
Cycle-Cover Problem, then w(S) > w(W). The Petersen graph (§1.2) shows that the 
inequality can be strict (w(S) = 21 and w(W) = 20, for w = 1). 


Nowhere-Zero Flows 


DEFINITIONS 


D31: Let f: E(D) > R be given for a digraph D. The function f is called a flow if 
for every vE V(D), dX> f(a)= SYS f(a). 


ac Et ac€Ey 


D32: Let f: E(G) > N be given for a graph G. Let D be an orientation of G with 
ae € E(D) the directed edge corresponding to e € E(G), and define f’(a-) := f(e). 
Then f is an integer flow in the graph G if f’ is a flow in the digraph D. 


D33: An integer flow f in G is nowhere-zero if f(e) 4 0 for each edge e € E(G). 


D34: A k-flow is an integer flow f such that f(e) < k for each edge e € E(G). 


CONJECTURE 


Nowhere-Zero 5-Flow Conjecture (NZ5FC). Every bridgeless graph has a nowhere- 
zero 5-flow. [Tu54] 


FACTS 
F37: [Se8la] Every bridgeless graph has a nowhere-zero 6-flow. 


F38: [Tu54] In a plane graph G, a (proper) k-face coloring of G corresponds to a 
nowhere-zero k-flow, and vice versa. 


F39: A 3-regular graph G has a nowhere-zero 4-flow if and only if it is 3-edge-colorable, 
and it has a nowhere-zero 3-flow if and only if it is bipartite. 


F40: [Ja75, Ja79] Every 4-edge-connected graph has a nowhere-zero 4-flow because 
it contains a spanning eulerian subgraph E*. Likewise, it has a CDC containing the 
elements of a cycle decomposition of E* (in fact, Tutte conjectured that every 4-edge- 
connected graph has a nowhere-zero 3-flow; see §5.2.2). 


F41: ‘To prove or disprove the NZ5FC and CDCC, one can assume without loss of 
generality that the graphs are 3-regular. 
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F42: [Se79| Let G be a bridgeless, planar graph, and let f: E(G) > Z*. Then the 
following two statements are equivalent: 


(a) There exists a cycle cover S such that for every edge e € E(G), e belongs to 
exactly f(e) elements in S. 


(b) For every edge-cut Eg C E(G), 


>> fle) is even and S*> f(e) => max{f(e): e € Epo}. 
ec Eo 2 e€ Eo 


REMARKS 


R18: Double tracings in arbitrary connected graphs are the natural analogue to eu- 
lerian tours — Euler was already aware of that. Correspondingly, cycle double covers 
seem to be the natural analogue to cycle decompositions, yet their existence has been 
guaranteed so far only for certain classes of graphs, apart from the planar case. See 
[AlGoZh94], [Zh97], and [Zh12] for a thorough treatment of integer flows and cycle 
covers. 


R19: Nowhere-zero flows can be viewed as eulerian tours in an eulerian multidigraph 
derived from an appropriate orientation of the given graph G, by replacing every arc a, 
(corresponding to e € E(G)) by f(e) arcs with the same head and tail that a, has. 


4.2.5 Various Types of Eulerian Tours and Cycle 
Decompositions 


DEFINITION 


D35: Let G be an eulerian digraph and Dy a subdigraph of G. If for every v € V(G), 
an eulerian trail T of G traverses every arc of Do incident from v before it traverses any 
other arc incident from v, then T is called Do-favoring. 


FACTS 


F43: [Ko56] Let G be a connected graph with vertex-set V(G) = {v1,...,Un} and 
having an even number of edges. Then G is eulerian if and only if G' is the edge-disjoint 
union of graphs G1, G2 with degg, (vi) = degg, (vi), 1 < i <n; and if G is the union of 
two such graphs, then G has an eulerian tour in which the edges of G; and G2 alternate. 


F44:; [Se81b, FlFr90] A planar even graph G has a decomposition into even cycles if 
and only if every block of G has an even number of edges. 


F45: Let v be an arbitrary vertex of a strongly connected digraph G. Then there exists 
a spanning in-tree of G with root v. 


F46: [EhBr51] Let D’ be a spanning in-tree with root v in the eulerian digraph G, and 
let Do = G— E(D’). Then there exists a Do-favoring eulerian tour of G starting and 
ending at v. 
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F47: [CaF195] Let {e1,...,¢em} C E(G) be an ordered set where G is eulerian. An 
eulerian tour T of the form T = ...,€1,...,€2,..-,@€m,--- exists if the edge-connectivity 
MG) > m— 1; and if A(G) > 2m, then one can even prescribe the direction in which 
these m edges are traversed by T. 


REMARKS 
R20: Fact F43 can be proved using the Splitting Lemma (Fact F11). 


R21: Fact F44 is stated for planar graphs, but it can be extended to a more general 
class of graphs (see [Zh97]). 


R22: Do-favoring eulerian tours are studied in [FlWe89, F190]. However, in-trees are 
a special case of a more general class of digraphs D’ for which there is a (G — E(D’))- 
favoring eulerian tour. We restricted Fact F46 to in-trees because of its relevance to 
enumerating eulerian tours in digraphs (see the BEST-Theorem [Fact F18]). 


Incidence-Partition and Transition Systems 


DEFINITIONS 


D36: For each vertex v in a graph G, let P(v) = {Fi(v),...,Ex,(v)}, kv > 1, be 


a partition of the incidence set E,. Then P(G) = LU P(v) is called an incidence- 
vEeV 
partition system of G. 


D37: A transition system of an even graph G, denoted 7(G), is an incidence- 

partition system 7(G) = LJ P(v) such that for every v € V(G), |E;(v)| = 2 for every 
vEeV 

cell of the partition P(v). Each cell E;(v) is called a transition. 


D38: An eulerian tour T and a cycle decomposition S give rise to transition systems, 
denoted rr and Ts, respectively, in a natural way. Each transition in the eulerian-tour 
transition system Tr is a pair of consecutive edges in the tour 7. Similarly, each 
transition in the cycle-decomposition transition system Tg is a pair of consecutive 
edges ina cycle CE S. 


TERMINOLOGY: A transition in 77 and a transition in Tg are referred to as a transition 
of T and a transition of S, respectively. 


D39: Let P(G) be an incidence-partition system of a graph G. An eulerian tour T 
is P(G)-orthogonal (or orthogonal to P(G)) if no transition of T is a subset of any 
cell E;(v) of P(G). P(G)-orthogonal cycle decompositions are defined analogously. 


D40: A cycle decomposition S and an eulerian tour T are orthogonal if ts N tr = 0. 


TERMINOLOGY: The term orthogonal has been suggested by several authors as describing 
the underlying concept more accurately than the original term compatible. 
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D41: An incidence-partition system P(G) satisfies the cut condition if for every 

ae = 1 = 
vertex subset X, the edge-cut E(X,X) satisfies |E(X,X)M E;(v)| < E(x, *)| for 
every cell E;(v) of P(G). 


FACTS 


F48: [Ko68] A loopless eulerian graph G has an eulerian tour orthogonal to a given 
partition system P(G) if and only if P(G) satisfies the cut condition restricted to the 
edge-cuts Ey, v € V(G). 


F49: [F180] Given a cycle decomposition S of the eulerian graph G with deg(v) > 2 
for every v € V(G), there exists an eulerian tour orthogonal to S. 


F50: [F180] Let T be an eulerian tour of the eulerian graph G. If deg(v) = 0 mod 4 
for every v € V(G), then there exists a cycle decomposition orthogonal to T. 


F51: [FIlFr90] Let G be a planar, even, loopless graph with incidence-partition system 
P(G). Then G has a P(G)-orthogonal cycle decomposition if and only if P(G) satisfies 
the cut condition. 


F52: [F180] Let G be a planar eulerian graph and let T be an eulerian tour of G. If 
deg(v) > 2 for every v € V(G), then G has a cycle decomposition orthogonal to T. 
EXAMPLE 

E3: The complete graph Ks in Figure 4.2.7, with transition system T(Ks) = {{i,i+1}, 


{’,(@@+1)’}: 1<i< 5, setting 6 = 1} has no 7(K;5)-orthogonal cycle decomposition, 
which shows that Fact F51 cannot be generalized to arbitrary non-planar graphs. 


Figure 4.2.7: Ks having no r(K;)-orthogonal cycle decomposition. 
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REMARKS 


R23: To produce a cycle decomposition $ orthogonal to a given eulerian tour in a 
graph with deg(v) = 0 mod 4 for every v € V(G), one can apply a procedure developed 
by J. Petersen in his celebrated paper [Pe1891]: Color the edges of T alternately blue 
and red, and combine a cycle decomposition of the blue even graph with one of the red 
even graph. 


R24: Fact F50 follows from Fact F43 by using the classical characterization (Fact F1). 


R25: Fact F49 is basically a special case of Fact F48. We stated it separately because 
its converse (given an eulerian tour T, there exists a cycle decomposition orthogonal to 
T) is an open problem known as Sabidussi’s Compatibility Conjecture. Its relevance to 
other open problems such as the Cycle Double Cover Conjecture and the Nowhere-Zero 
5-Flow Conjecture is discussed in [F184, F188, F101, F102]. 


R26: Facts F48 and F51 show that the existence of eulerian tours satisfying certain 
restrictions does not necessarily imply the existence of cycle decompositions satisfying 
the same restrictions: Fact F48 relates to arbitrary loopless graphs and uses the cut 
condition only locally, whereas in Fact F51, the full strength of the cut condition is 
invoked. 


R27: While Facts F51 and F52 have been formulated for planar graphs only, they can 
be extended to a somewhat more general class of graphs (see [Zh97]). 


R28: P(G)-orthogonal eulerian tours in digraphs have been studied in [F190]. Natu- 
rally, due to the appearance of arcs instead of edges, somewhat stronger conditions than 
the cut condition of Definition D41 are needed to prove the existence of P(G)-orthogonal 
eulerian tours. 


R29: Fact F51 can be viewed as a generalization of Fact F42 because one obtains 
an eulerian planar graph by replacing every e € E(G) by f(e) parallel edges and by 
defining the incidence-partition system correspondingly. 


Orderings of the Incidence Set, Non-Intersecting Tours, and 
A-Trails 


DEFINITIONS 


D42: Given a graph G and a vertex v, a fixed sequence (€1, €2,..., €deg(v)) of the edges 
in the incidence set EF, is called a positive ordering of E, and is denoted O*(v). If 
G is imbedded in some surface, one such Ot (v) is given by the counterclockwise cyclic 
ordering of the edges incident on v. 


D43: Let G be an even graph and v a vertex with deg(v) > 4 and with a positive 
ordering of its incident set E, given by OT(v) = (1, €2,...,€deg(v))) A transition 
system 7(G) is non-intersecting with respect to Ot (v) if for any e;,e;,e%,e1 € Ey 
withi<j<k<l, {e;,e,} and {e;,e,} cannot both be transitions of r(G). That is, 


{ei,en} € T(G) => {e;,e1} € T(G) 
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D44: Let G be an even graph with a given positive ordering O*(v) for each v EV. A 
transition system 7(G) is non-intersecting if 7(G) is non-intersecting with respect to 
O*(v) for every v € V with deg(v) > 4. An eulerian tour T and a cycle decomposition 
S are non-intersecting if their corresponding transition systems, 77 and Ts, respectively, 
are non-intersecting. 


D45: Let G be an eulerian graph with a given positive ordering O*(v) for each uv € V. 
An eulerian tour T is an A-trail if {e;,e;} € rr implies j =i+1 or 7 =i—1 (modulo 
deg(v)). 


D46: An outerplanar graph is a graph with an imbedding in the plane such that 
every vertex appears on the boundary of the exterior face. 


D47: A graph (imbedding) triangulates a surface if every region is 3-sided. 


EXAMPLE 


E4: An A-trail T in the octahedron, given by the sequence 1, 2,3,...,11,12, is shown 
in Figure 4.2.8 below. A cycle decomposition orthogonal to T is given by the sets 
{2,6,10}, {4,8, 12}, {1, 11,9, 7,5, 3}. 


Figure 4.2.8: An A-trail in the octahedron. 


FACTS 


F53: Given an eulerian graph G and Ot(v) for every v € V(G), a non-intersecting 
eulerian tour exists. 


F54: In an eulerian graph with deg(v) < 4 for every v € V(G), the concepts of 
non-intersecting eulerian tour and A-trail are equivalent. 


F55: [AnF195] The decision problem whether a given simple, planar, 3-connected 
eulerian graph has an A-trail is NP-complete. 


F56: [AnFlRe98] Simple, outerplanar, eulerian graphs have A-trails; they can be 
constructed in polynomial time. 


Section 4.2. Eulerian Graphs 277 


F57: Let G be a simple eulerian graph that triangulates the plane. Suppose that G 
has maximum degree Omax(G) < 8 with at most one 8-valent vertex, which, if it exists, 
is adjacent to a 4-valent vertex. Then G has an A-trail. 


REMARKS 


R30: Facts F48, F49, and F53 can be proved by employing the Splitting Lemma 
(Fact F11). Consequently, algorithms for constructing eulerian tours that are based on 
the Splitting Lemma, can be modified so as to yield P(G)-orthogonal eulerian tours or 
non-intersecting eulerian tours. 


R31: There is a 1-1 correspondence between transition systems 7(G) of the even graph 
G and the decompositions of E(G) into closed trails: traversing edges of G following 
the given transitions results in closed trails, one at a time; together they form a decom- 
position into closed trails. Likewise, each of these trails defines a subset of 7(G) (for a 
given 7(G)), and since these trails are edge-disjoint, the union of the subsets is 7(G). 


4.2.6 Transforming Eulerian Tours 
The Kappa Transformations 


The kappa transformations consist of various combinations of splitting, splicing, and 
reversing closed trails. They form the basis for constructing eulerian tours and for 
transforming one eulerian tour into another. For a detailed discussion, see, e.g., [F190]. 


DEFINITIONS 
D48: The reverse of a trail T = (vo, €1,U1,.--, 1, 1) is the trail 
T 1 = (uy, 1, 1, ---,€1, Vo) 
D49: Let T = (...,€;, Ui, €:41,---,€j, Uy, €j+1,---) be an eulerian tour in a graph G 


such that v; = vj, and consequently, {e;,e:+1,¢€;,€;+41} C Ey,. The closed subtrail 
(Uj, €i41,+-+,€7,U;) is called a segment of tour T and is denoted S;,;. 


D50: A segment reversal (or «-transformation) is the replacement of one of the 
segments in an eulerian tour T’ by its reverse segment. The resulting eulerian tour is 
denoted «(T). Thus, if tour T = (...,e:,54,3,ej41,---), then 


K(T) = (eld peu Beez i i8) 


D51: Let T = (...,e;,5:,3,¢€)41,.--) be an eulerian tour of a graph G with segment 
Si,;. The 2-cell partition of E(G) consisting of the edge set of S;,; and the edge set of the 
(“rest of the way around”) segment $j, = (vj, €;41,---,€:, Vi) is called a k-detachment 
and is denoted «/(T). 
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D52: Given a trail decomposition of L(G) into closed trails T),...,T,, k > 2, choose 
trails T; and T; such that v € V(T;) N V(Z;) for some vertex v. Let €m i, Em+i € Ev 
be consecutive in T;, and €,,;, €n41,7 € Ey, consecutive in T; (i.e., they are transitions 
of their respective trails). Thus, we may write T; = (...,€m,i,U,€m+1,---) and Tj = 
(U,€n,j.+++,€n41,7,U)- A splice at v of trail T; into trail T; (or the k-absorption at v 
of T; by T, ;) is either one of the closed trails: 


splice(T;, Tj, v) = (iy Omaplpvemtie .) 
splice(T;, rr, v) => (. »+5€m,iy die Em4+1i-- .) 


NOTATION: Either one of the closed trails that result from a splice of T; into Tj is 
denoted «/"({T;, T;}). 


D53: Let T be an eulerian tour in a graph G. An eulerian tour T’ is obtained 
from T by a «*-transformation, denoted T’ = «*(T), if there exists a «-detachment 
K/(T) = (Si, Sy at such that P= K'NLSi 5, Saat That is, T’ = k* (T) = kK "(K/(T)). 


D54: Let 7, and T2 be two eulerian tours in a graph G. Tour T> is obtained from Tj 
by a «1-transformation, denoted Tz = «1(T1), if either T2 = «(T1) or T2 = &*(T1). 


D55: Two eulerian trails, JT; and T>, are considered different if their corresponding 
transition systems are different, i.e., if tr, # Tr. 


REMARK 


R32: The various transformations defined above carry over to eulerian digraphs with 
the added restriction that each transition at a vertex v must comprise an arc incident 
to v and an arc incident from v. 


FACTS 


F58: Let T; and T2 be two different eulerian tours of an eulerian graph G (they exist 
unless G is a cycle). T2 can be obtained from T; by a sequence of «-transformations 
(see [AbKo80], [Sk84], [F190]). 


F59: Let G be an eulerian graph with a partition system P(G), and suppose that T; 
and T> are different P(G)-orthogonal eulerian tours. Then T> can be obtained from T; 
by a sequence of «1-transformations in such a way that any eulerian tour and any trail 
decomposition S with |S| = 2 arising in this sequence are P(G)-orthogonal. 


F60: Let G be an eulerian graph with a given positive ordering Ot(v) for every 
v € V(G), and let T; and Ty be different non-intersecting eulerian tours of G. Then T> 
can be obtained from T; by a sequence of «1-transformations in such a way that any 
eulerian tour and any trail decomposition S with |.S| = 2 arising in this sequence are 
non-intersecting. 


F61: Let 7, and 72 be two different eulerian tours in a digraph G. Then tour T> can 
be obtained from JT; by a sequence of «1-transformations. 
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F62: In 4-regular plane graphs, A-trails (which are non-intersecting eulerian tours 
in this case) are in 1-1 correspondence with spanning trees in an (easily constructed) 
auxiliary graph. The «,-transformations correspond to the edge-addition and edge- 
deletion process in transforming one spanning tree into another spanning tree. 


EXAMPLES 


E5: The complete bipartite graph K2 4 with eulerian tour T = (1,2,3,...,8) (written 
as edge sequence) is shown in Figure 4.2.9 (a). The transitions at v and w are marked 
with little arcs. Tour T is transformed into the eulerian tour T’ = (1, 2,3, 4,8, 7,6, 5) 
by a «-transformation (segment reversal) at v (see Figure 4.2.9 (b)). 


(a) T (b) T'=K(T) 


Figure 4.2.9: T and T”’ in Example E5. 


E6: The tour J in Example E5 is a non-intersecting eulerian tour. By a «-detachment 
at v, one obtains the non-intersecting trail decomposition S = {71,72} with T, = 
(1,2,3,4) and Tz = (5,6,7,8) (written as edge sequences; Figure 4.2.10 (a)). A K- 
absorption at w results in T” = (1,2,7,8,5,6,3,4), another non-intersecting eulerian 
tour (Figure 4.2.10 (b)). 


(a) S={F,T2}, S=K'(T) (b) T"=K\(T)=K"(S) 


Figure 4.2.10: S and T” in Example E6. 
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Splicing the Trails in a Trail Decomposition 


We close the section with an eulerian-tour construction by A. Tucker that starts with a 
closed-trail decomposition and iteratively splices pairs of trails together (i.e., performs 
«-absorptions) until there is only one trail left. 


Algorithm 4.2.5: Tucker’s Algorithm [Tuc76] 


Input: eulerian graph G. 
Output: eulerian tour 7. 
Produce a trail decomposition of G by forming an arbitrary 2-regular 
detachment H of G. 
Let W = {T,,...,T),} be the set of components of H. 
While k > 2 
Choose T;,T; with i 4 j such that V(T;) N V(Z;) 4 9. 
Let v € T, 9 Dis, 
Let T;,; =* (Tj,T;) {a «-absorption at v}. 
W = WU {Th 5} — {Ti T5}- 
k:=k-1 
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INTRODUCTION 

The Chinese Postman Problem (CPP) is one of the more celebrated problems in 
graph optimization. It acts as a useful model in an array of practical contexts such as 
refuse collection, snow removal, and mail delivery. The basic problem was first posed 
by the mathematician Guan (or Kwan Mei-Ko) in 1962 [Gu62]; hence the problem was 
dubbed “Chinese” by Jack Edmonds ([Ed65-a]), based on a suggestion by his supervisor 
Alan Goldman at the U.S. National Bureau of Standards. 


4.3.1 The Basic Problem and Its Variations 


DEFINITIONS 


D1: A postman tour in a graph G is a closed walk that uses each edge of G at least 
once. 


D2: Given a finite graph, G = (V,E), with edges weighted as w : E > Rt, the 
Chinese Postman Problem seeks a minimum-weight postman tour. 


TERMINOLOGY NOTE: The basic problem is sometimes simply referred to as a postman 
problem. 


D3: The undirected version of CPP (UCPP) assumes that the instance graph 
G is an undirected graph. 
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D4: The directed version of CPP (DCPP) assumes that the instance graph G 
is a digraph. 


D5: The mized version of CPP (MCPP) assumes that the instance graph G is 
a mized graph, that is, some edges are directed, and some edges are undirected. 


FACTS 
F1: UCPP and DCPP are polynomial-time solvable. (See §4.3.2 and §4.3.3.) 


F2: MCPP is NP-hard. (See §4.3.4.) 


The Eulerian Case 

DEFINITIONS 

D6: An eulerian tour in a graph (or digraph) G is a closed walk that uses each 
edge (or arc) of G exactly once. An eulerian tour in a mixed graph is a closed walk that 
uses each edge and each arc exactly once. 


D7: A graph, digraph, or mixed graph that has an eulerian tour is called eulerian. 


Ds: A digraph G is strongly connected if for every two of its vertices, u and v, 
there is a directed walk from u to v and one from v to u. 


D9: If G = (V,E£) is a digraph, a vertex v € V is symmetric if the indegree and 
outdegree of v are equal. 


TERMINOLOGY NOTE: If every vertex in a digraph G is symmetric, then G is sometimes 
referred to as a symmetric digraph. However, in other contexts, “symmetric digraph” 
is sometimes taken to mean (x,y) € E > (y,x) € E for all z,y eV. 


FACTS 
F3: A connected graph G is eulerian if and only if every vertex of G has even degree. 
F4: A strongly connected digraph G is eulerian if and only if G is symmetric. 


F5: If the instance graph G (undirected, directed, or mixed) is eulerian, then CPP is 
solved by producing an eulerian tour. 


REMARK 


R1: A characterization of eulerian mixed graphs is given later in the subsection Mixed 
Postman Problems. 
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Variations of CPP 


DEFINITIONS 


D10: open postman tour: The postman is required to start and end at distinct 
vertices of the graph (or digraph). 


D11: not requiring a specified edge: A specified edge is not required to be in an 
admissible tour but its inclusion is at least permitted. 


D12: a specified edge cannot be duplicated: A specified edge is required to be 
present in the postman tour but cannot be duplicated (i.e., cannot be traversed more 
than once). 


D13: windy postman problem: Instances of UCPP place no restriction on the 
direction of traversal along an edge. This does not suggest, however, that in a practical 
application, the postman need necessarily experience the same “cost” of traversal in 
both directions (suppose the edge-weight metric that is relevant is not distance but 
rather time). If one allows edge weights to differ depending upon which direction an 
edge is traversed, the problem becomes the windy postman problem. 


D114: rural postman problem: This variant, also motivated by practical settings, 
arises when rather than requiring that all edges or arcs be traversed at least once, only 
a given subset has to be used. This version derives its name from the apparent case of 
postal delivery in non-urban settings where, perhaps, the postman may have to traverse 
every street within a small town or village and then move on to another one but can do 
so by selecting any of a number of connecting roads (edges) that exist to connect the 
towns. 


D15: stacker crane problem: This is the rural postman problem for mixed graphs. 


FACTS 


F6: The open postman problem remains polynomial-time solvable. If vj and v2 are 
the pre-specified source and destination vertices, then simply add to G an artificial edge 
from v2 to v; and assign the new edge a weight of M, where M is sufficiently large. 
Clearly, in the application of the algorithm, this artificial edge would never be part of 
a shortest path and hence, would never be duplicated. In the resultant G, one would 
simply find an optimal postman tour and then remove the artificial edge from this tour 
to obtain the desired open postman tour from v,; to ve. 


F7: For the second and third variations above, where a specified edge is not required 
or cannot be duplicated, the problem remains polynomial-time solvable (cf. [EdJo73}). 


F8: [Wi89] The windy postman problem is VP-hard although solvable in polynomial 
time if instances are eulerian. 


F9: The rural postman problem is VP-hard on both graphs and digraphs even if all 
edges/arcs have the same weight (see [GaJo79]). 
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REMARK 


R2: It is important to note that the list of extensions presented here is not exhaustive. 
Additional variations to these sorts of problems in general are often easy to create 
whether motivated by purely combinatorial interests or ones more pragmatic, stemming 
from a given practical setting. A good starting place for a sense of the breadth of 
cases, degree of analysis, and categorization of results is the rather expansive survey in 
[EiGeLa95-a] and [EiGeLa95-b]. Extensive coverage for the Chinese Postman Problem 
and its variations may also be found in [F191] and [Dr00]. 


4.3.2 Undirected Postman Problems 


The solution posed by Guan, though clever, was not fast in the universally adopted, 
complexity-theoretic sense (i.e., not polynomial time in the size of the input graph). 
This flaw was pointed out by Edmonds ([Ed65-a]), who then proposed a polynomial- 
time algorithm for the problem. 


DEFINITIONS 


D16: <A matching M in a graph G is a subset of edges no two of which have a 
common vertex. (Matchings are discussed in §11.3.) 


D17: A matching is perfect if every vertex in G is incident to some edge in the 
matching. 


Algorithm 4.3.1: Solution to UCPP 


Input: Connected graph G with positive edge weights. 
Output: Minimum-weight postman tour in G. 
Let Vo be the set of vertices with odd degree in G. 
For each pair of vertices z,y € Vo 
Find a shortest path P in G between x and y. 
Form a complete graph K on the vertex set Vo with edges weighted by 
the respective shortest-path lengths. 
Find a minimum-weight perfect matching M in Kk. 
For each edge e € M 
Duplicate the edges in G of the shortest path P corresponding to e. 
Let G be the resulting super(multi)graph. 


Produce an eulerian tour in G. 


COMPUTATIONAL NOTE: The shortest path computation in Algorithm 4.3.1 is straight- 
forward and fast. Producing a minimum-weight perfect matching in a graph, although 
complicated, can be accomplished in polynomial time following the seminal work by 
Edmonds (cf. [Ed65-b],[Ed65-c]). The implementation of the step relative to traversal- 
finding in G is also easy. The following strategy by Fleury (cf. [Ka67]) can be applied 
recursively: Given a position in the walk, select the next edge arbitrarily so long as 
its removal would not disconnect the graph G, unless this is the only choice. (Fleury’s 
algorithm appears in §4.2.) 
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EXAMPLE 


E1: Consider G on the left in Figure 4.3.1; weights are specified directly on the edges. 
The set Vo is given by {v1, v3, v4, Us} and the stated shortest paths along with the path 
lengths result as follows: 


Vertex Pair Path Length 
U1, U3 V1, €1, V2, €2, V3 3 
U1, U4 U1, €1, V2, €7, U5, C4, U4 5 
U1, U5 U1, €1, V2, €7, U5 3 
U3, U4 U3, €3, V4 3 
U3, U5 U3, €2, V2, C7, U5 2 
V4, U5 U4, €4, U5 2 
Uy v2 
2 €2 
1 
v3 Vy a 3 UZ Vy U3 
é . s 
4 1 &8 3 3 i) 
€5 €3 d 3 
€4 
5 e 2 
U5 A U4 Us, U4 Us U4 


Figure 4.3.1: Application of Algorithm 4.3.1. 


An optimal matching in the complete graph K4, shown in the middle of Figure 4.3.1, 
consists of edges a and c, having total weight 5. These edges correspond to the paths 
U1, €1, V2, €2, V3 and v4, e4,U5. The respective edges in these paths are duplicated in G, 
producing the multigraph, G, shown on the right in Figure 4.3.1. The latter is eulerian, 
and an eulerian tour (of total weight 30) is given by the walk below: 


U1, €1, V2, €2, U3, €2, V2, €1, V1, €6, U3, €3, U4, €4, US, E7, V2, €8, U4, €4, U5, €5, V1 


REMARKS 


R3: The stipulation that edges be weighted by positive values cannot be relaxed, 
because otherwise negative weight closed walks will occur in G (by simply going back and 
forth on such an edge), which creates an intractability in the shortest path computation 
(see [GaJo79]). 


Ra: Implementation of the Fleury traversal strategy requires some attention due, 
largely, to the requirement to test the stated connectivity stipulation. An alternative 
that relaxes this complication was proposed by Edmonds and Johnson ([EdJo73]). 


R5:  Trivially, a necessary condition for a perfect matching to exist in a graph is that 
the graph possesses an even number of vertices. Since we may take G, in any interesting 
instance of UCPP, to be connected, there must be a path between every pair of vertices. 
Hence the complete graph specification is clear and since |Vo| is even, it follows that 
the perfect matching step of the procedure is well-defined. 
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R6: Assuming a correct application of Algorithm 4.3.1, a postman would never 
traverse an edge more than twice in an optimal walk. 


COMPUTATIONAL NOTE: _ If double-traversing occurred, the supergraph, G, would have 
an edge from G duplicated more than once. But this would deny that G had been 
constructed correctly since two of the duplicated copies could be eliminated, leaving a 
connected graph with the same (even) degree parity everywhere and with smaller weight 
than G. 


4.3.3 Directed Postman Problems 


The strategy for solving the DCPP is analogous to the one used for the undirected 
case. If the digraph is not symmetric, then a minimum-weight arc duplication produces 
a symmetric super(multi)digraph. The number of copies of each arc is determined by 
solving a circulation problem. 


FACTS 
F10: Since easy to test, we may take G to be strongly connected. 


F11: The multigraph G produced from a correct application of Algorithm 4.3.2 below 
is symmetric; obviously it remains strongly connected. 


COMPUTATIONAL NOTE: The circulation problem in Algorithm 4.3.2 is easily solved 
by standard network flow techniques (Chapter 11). 


Algorithm 4.3.2: Solution to DCPP 


Input: Strongly connected digraph G with positive arc weights. 
Output: Minimum-weight postman tour in G. 
If G is symmetric 
Produce an eulerian tour in G. 
Else 
For each vertex vz, set by = indegree(v,) — outdegree(vz). 
Solve the following circulation problem: 


minimize S- WijLiz 
(vi,vj)€E 
Sit; S- Lei — S- Tip = by for ve EC V 
(vz,vj)EE (vi,0b)EE 
Lig = 0 


For each pair of vertices v;,v;, add x;; copies of arc (v;,v;) to G. 
Call the resulting super(multi)digraph G. 


Produce an eulerian tour in G. 


Producing an Eulerian Tour in a Symmetric (Multi) Digraph 


Algorithm 4.3.2 requires the traversal of an eulerian tour in the original digraph (if it 
is symmetric) or in a symmetric multidigraph. This can be accomplished by applying 
Algorithm 4.3.3 below (cf. [EhBr51)). 
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DEFINITION 


D18: An intree is a connected, acyclic digraph where the outdegree of every vertex 
is at most 1. 


Algorithm 4.3.3: Producing a Tour in an Eulerian Digraph 


Input: Eulerian digraph G. 
Output: Eulerian tour in G. 
Select any vertex in G and denote it by v*. 
Form an intree T' that spans G and that is rooted at v*. 
For each vertex w in G, w #0", 
Label the out-arcs from w randomly with consecutive integers 
subject to the restriction that the last (highest) label is given 
to the arc in intree T. 
Label the out-arcs from v* arbitrarily. 
Starting at vertex u*, trace an eulerian tour in G by always selecting 
the untraversed out-arc with the smallest label. 


COMPUTATIONAL NOTE: An easy way to form an intree T of digraph G is to start with 
T = {v*} and proceed iteratively: select at each iteration an arc in G that is directed 
from a vertex in V(G) — V(T) to a vertex in T; repeat until T spans G. 


EXAMPLE 


E2: Consider the instance digraph in the upper left of Figure 4.3.2. Specified beside 
each vertex is the respective value for b,. Solving the explicit circulation problem defined 
in Algorithm 4.3.2 produces the following outcome: #5; = #34 = 1; @45 = 2; and x; = 0 
elsewhere. Copies of the respective arcs are added, forming the multigraph shown in the 
upper right of the figure. Applying Algorithm 4.3.3 and selecting (arbitrarily) vertex 
v4 aS a root, an intree JT’ is constructed and shown at the bottom of Figure 4.3.2. The 
stated arc-labeling scheme is applied with labels affixed to the arcs in the multidigraph. 
Starting with vertex v4 and proceeding in label order produces an eulerian tour specified 
(unambiguously) by the following vertex sequence: 


U4, U5, U1, U3, U4, U5, U1, V2, U3, U4, U5, V2, V4 


REMARKS 


R7: The network flow formulation in Algorithm 4.3.2 is due to Edmonds and Johnson 
([EdJo73]). 


R8:  Trivially, a correct application of Algorithm 4.3.2 may require that an arc be 
duplicated several times. 


R9: In applying Algorithm 4.3.3, the requirement that a tour be traced beginning 
with vertex v* and proceeding in label order cannot be casually relaxed. For instance, 
if one starts from vertex v3 on the labeled digraph in the upper right in Figure 4.3.2, 
any tour generated will violate the label ordering. 
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. 
i a 
Figure 4.3.2: Applications of Algorithms 4.3.2 and 4.3.3. 


4.3.4 Mixed Postman Problems 


FACTS 


F12: The mixed postman problem, MCPP, is V’P-hard; the reduction is from the 
3-satisfiability problem (cf. [Pa76]). Papadimitriou’s reduction makes use of a widget 
(a special subgraph) that has the following property: any optimal postman tour must 
traverse the widget in one of only two possible ways. The reduction creates one copy 
of the widget for each variable appearance within the 3-satisfiability instance, and the 
two possible ways to traverse the widget correspond to this variable having value 0 or 
1. The remainder of the reduction provides edges that connect the various copies of the 
widget to ensure consistency of the values chosen for all the appearances of the same 
variable. See [Pa76] for further details. 


F13: MCPP remains \VP-hard even on planar graphs with no vertex (total) degree 
exceeding 3 and with all edge weights the same (see [Pa76]). 


Deciding if a Mixed Graph Is Eulerian 


DEFINITIONS 


D19: The total degree of a vertex v in a mixed graph G is the total number of arcs 
and undirected edges incident on v. 


D20: A mixed graph is even if the total degree of each of its vertices is even. 


D21: ~~ A vertex in a mixed graph is symmetric if its indegree and outdegree are 
equal. 


TERMINOLOGY NOTE: A mixed graph is said to be symmetric if all of its vertices are 
symmetric. 
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D22: A mixed graph G satisfies the balance condition if for every S C V(G), the 
difference between the number of arcs from S to V(G) — S and the number of arcs from 
V(G) — S to S is no greater than the number of undirected edges joining vertices in S 
and V(G) — S (cf. [FoFu62]). 


FACTS 


F14: A (strongly) connected, mixed graph G is eulerian if and only if G is even and 
satisfies the balance condition. 


F15: Mixed graphs that are even and symmetric are balanced. 


EXAMPLE 


E3: Clearly, the even-degree condition is necessary for a mixed graph to be eulerian 
while symmetry at each vertex is not. The graph in Figure 4.3.3 illustrates. An eulerian 
tour is specified by the vertex sequence v1, U5, U6, U1, V2, U3; V4, U5, U3; UL- 


U1 v3 


UE Us vA 
Figure 4.3.3: An eulerian mixed graph. 


COMPUTATIONAL NOTE: The (nontrivial) requirement in the mixed-graph case is to 
create a graph that satisfies the symmetry condition at each vertex or show that this 
is not possible. That is, we seek to orient some undirected edges in such a way that 
symmetry is created, albeit artificially. There is an easy network flow formulation that 
will do this or correctly conclude that no such orientation is possible. 


EXAMPLE 


E4: The application of Algorithm 4.3.4 on the (mixed) graph in Figure 4.3.3 is illus- 
trated in Figure 4.3.4. In the upper graph in the figure, the values b; are written beside 
each vertex. The non-zero variables x23 and 73; induce the specified orientation for the 
original, undirected edges (v2, v3) and (v1, v3), as indicated by the lower graph in the 
figure. The existing eulerian tour can be found using Algorithm 4.3.3 (see Remark R11 


below) . U1 U3 


al 0 


vé U5 v4 


Figure 4.3.4: Application of Algorithm 4.3.4 on the graph in Figure 4.3.3. 
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Algorithm 4.3.4: Deciding if a Mixed Graph Is Eulerian 


Input: An even and strongly connected mixed graph G. 
Output: An orientation of some or all of the undirected edges of G that is 
eulerian or a conclusion that no such orientation is possible. 
For each vertex vz, set by = indegree(vz,) — outdegree(v,). 
Replace each undirected edge in G by a pair of oppositely oriented arcs. 
Let U be the set of these new pairs of arcs. 
Solve the following network flow problem P;: 


minimize ) Xij 
(v4,0j)EU 
s.t. y Lj — y Vip = by for vz, € V 
(vz vj )EU (v4,0K)EU 


0< rij <1 for (U;,0;) EU 


If P; has an admissible solution (i.e., G is eulerian) 
For each undirected edge {v;, v;} 
Orient edge {v;,v;} so that it is directed from v; to v;. 
Else if Lyi = 1 
Orient edge {v;,v,;} so that it is directed from v; to 1. 
Else 
Leave edge {v;,v,;} undirected. 
Else (P, has no admissible solution) 
Conclude that G cannot be made eulerian through edge orientation. 


REMARKS 


R10: The second part of the balance condition in Definition D22 asks that for 
every subset of vertices, a lack of symmetry (a difference between total indegree and 
total outdegree of vertices in the subset) must be made up for by some or all of the 
undirected edges joining vertices in the subset to those outside. 


R11: If the orientation produced by Algorithm 4.3.4 results in a graph with all edges 
directed, then an eulerian tour is produced by employing the strategy described pre- 
viously in the case of eulerian digraphs (Algorithm 4.3.3). Alternatively, if Algorithm 
4.3.4 outputs a graph with some undirected edges remaining, proceed as follows: First, 
for each connected component induced by the remaining undirected edges, find an eule- 
rian tour of that component, and then orient each edge in the direction it is traversed. 
Now all the edges in the original mixed graph G have been oriented, so apply Algorithm 
4.3.3 to the resulting digraph (cf. [EdJo73]). 


The Postman Problem for Mixed Graphs 


Since MCPP is NP-hard in general, options are few. We may have to look for special 
cases that do submit to polynomial-time resolution, or we will simply have to be less 
ambitious and settle for approximation algorithms, i.e., fast procedures that cannot 
guarantee optimal solutions but that will produce ones that are, in some well-defined 
sense, reasonably close to optimal. Of course, for instances of manageable size, it might 
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be feasible to resort to exact procedures. However, these approaches are inherently 
enumerative and will require effort that is exponential in the worst case (cf. [EiGeLa95- 
a], [EiGeLa95-b]). Such algorithms typically model the MCPP as an integer linear 
program, and then solve this ILP by adapting standard techniques such as the cutting- 
plane method (cf. [NoPi96], [Dr00}). 


DEFINITION 


D23: An algorithm is an approximation algorithm for a given problem if given 
any instance of the problem, it finds at least a candidate solution for the instance. 


REMARK 


R12: If the instance for MCPP is at least even but perhaps not symmetric, we can 
apply Algorithm 4.3.4 in order to test if symmetry at each vertex can be created. If 
so, the instance is eulerian (by Facts F14 and F15), and we can proceed accordingly. 
Otherwise, it is not eulerian, and we have to determine if it can be made so through 
some duplication of edges and/or arcs. 


FACT 


F16: A mixed graph G has a postman tour if and only if G is strongly connected. 


COMPUTATIONAL NOTE: ‘Testing for strong connectivity in mixed graphs can be done 
in polynomial time, because each undirected edge can be replaced by a pair of oppositely 
directed arcs, and then the (polynomial-time) algorithm for digraphs will apply. 


REFERENCE NOTE: The graphs that are employed in the remaining figures are either 
explicitly drawn from or are alluded to in an important paper by Frederickson ([Fr79]). 


EXAMPLE 


E5: To illustrate the problematical aspect unique to the mixed postman problem, 
consider the mixed graph in Figure 4.3.5, part a; all edges are assumed to have weight 
1. It is easy to see that no orientation exists for undirected edges that would create 
symmetry. Now, duplication of two arcs creates symmetry, as shown in part b of the 
figure; however, the resulting structure is not even so further duplication is required. 
On the other hand, the multigraph in part c also has only two arcs duplicated but is 
both symmetric and even; clearly, this graph is preferred. Unfortunately, it is not easy 
to distinguish, in any general way, its selection over the structure of part b. 


(a) (b) (c) 


Figure 4.3.5: Interaction between symmetry and even-degree. 
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COMPUTATIONAL NOTE: _ It is possible to deal with a few of these complications, albeit 
in somewhat ad hoc fashion, by employing various network flow formulations; relevant 
results are discussed in [EdJo73]. It is important to note that the prime contributor 
to the intractability of the general, mixed postman case is the ambiguity in effectively 
dealing with the interaction between symmetry and even-degree creation and/or preser- 
vation. 


Approximation Algorithm ES 


The following approximation algorithm combines an easy even-degree-creation phase 
followed by a more intricate, joint symmetry-producing/even-degree-preserving phase. 
The details are based on solving a min-cost flow problem which is somewhat complicated 
and beyond the scope of this section (cf. [EdJo73] and [Fr79]), and so in the statement 
of Algorithm 4.3.5, the step is simply referenced as “symmetric/even-parity.” 


Algorithm 4.3.5: Approximation Algorithm ES 


Input: Strongly connected, mixed graph G with positive edge/arc weights. 
Output: Admissible postman tour. 

Apply the even-degree-creation component of UCPP to the underlying graph 

of mixed graph G. 

Restore orientation to edges as specified in G. 

Let G be the resulting super(multi)graph. 

Operating on G, apply the symmetric/even-parity construction. 

Let G be the resulting graph. 


Produce an eulerian tour in G. 


TERMINOLOGY NOTE: The approximation procedure stated by Algorithm 4.3.5 is some- 
times referred to as the even-symmetric strategy, i.e., ES. 


REMARK 


R13: Since it cannot guarantee an optimal solution, it is interesting to consider the 
limit (if any) to how poorly Algorithm ES could perform. In fact, this was answered by 
Frederickson. 


FACT 


F17:  [Fr79] The ratio of the value of a postman solution produced by Algorithm ES 
to an optimal value cannot exceed 2. Importantly, the value of 2 is approachable as 
established by Example E6. 


EXAMPLE 


E6: Consider the mixed graph in Figure 4.3.6, part a, where edge weights are specified 
on the graph. The even-degree-creation phase of Algorithm ES duplicates the directed 
edges (considered undirected for the stated step), yielding the multigraph in part b. 
Operating on this graph to produce symmetry while preserving the even degree condition 
yields the structure in part c having total edge weight 4 + 12e. The eulerian tour in 
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this multigraph is not optimal, however. Had one been less greedy in the even-degree- 
creation application, duplicating instead the edges with weight 2, the structure in part 
d of the figure would have resulted, yielding an optimal multigraph and hence, a correct 
tour directly. Its weight is 2 + 10e. 


1 
[I 
a) () 
( 


(c) 


2€ 2€ 


Figure 4.3.6: Application of Approximation Algorithm ES. 


Approximate Algorithm SE 


A natural alternative approximation is to reverse the strategy proposed by Algorithm 
ES, yielding the following, symmetric-even approach (SE) (cf. [Fr79]). 


Algorithm 4.3.6: Approximation Algorithm SE 


Input: Strongly connected, mixed graph G with positive edge/arc weights. 
Output: Admissible postman tour. 
Create symmetry on mixed graph G, and denote resulting mixed graph G*. 
Let H be the subgraph induced on the undirected edges of G°*. 
Apply the even-degree-creation component of UCPP to H. 
Let G be the resulting even-degree, symmetric super(multi)graph of G°. 
Produce an eulerian tour in G. 


COMPUTATIONAL NOTE: The symmetry-creation construction of step 1 in Algorithm 
4.3.6 takes polynomial time (cf. [Fr79]) and employs the same symmetry-creation com- 
ponent of the symmetry/even-parity step in Algorithm 4.3.5. Recall this step is based 
on solving a min-cost flow problem, but again we omit the details here. 


EXAMPLE 


E7: Consider the graph in Figure 4.3.7, part a. A correct application of Algorithm 
SE produces the multigraph in part b. However, the structure in part c is optimal. 
The tour obtained using Algorithm SE has weight 4 + 2e, whereas the optimal tour has 
weight 2 + 3e. 
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Figure 4.3.7: Application of Approximation Algorithm SE. 


Some Performance Bounds 


FACT 


F18:  [Fr79] The ratio of the length of an outcome from Algorithm SE to an optimal 
tour value will also never exceed 2. Example E7 provides evidence that this value is 
approachable as well. 


EXAMPLE 


E8: Proceeding from left to right in Figure 4.3.8 on the respective input instances, it 
is evident that Algorithm SE solves a worst-case instance for Algorithm ES, while the 
latter achieves the same outcome on a worst-case instance for Algorithm SE. 


eeeye 


Application of Algorithm SE 


AAS 


Application of Algorithm E'S 


Figure 4.3.8: Algorithms ES and SE on each other’s worst-case instances. 


REMARKS 


R14: In the worst-case sense, both approximation procedures Algorithm 4.3.5 (ES) 
and Algorithm 4.3.6 (SE) perform the same. However, Example E8 demonstrates an 
interesting phenomenon: if each approximation is applied to a worst-case instance of 
the other, the outcome is that Algorithm ES solves (to optimality) the worst-case in- 
stance for Algorithm SE, while the latter, when operating on the worst-case instance 
for Algorithm ES, produces the optimal outcome. 


R15: The outcome of Example E8 motivates an obvious question which is stated 
loosely as follows: What if Algorithms ES and SE realize their respective worst-case 
behaviors on different classes of graphs? If this is the case, it is conceivable that they 
could be employed in a “composite” fashion where each strategy is applied separately 
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and the best outcome is then selected. Since each algorithm runs in polynomial time, 
the total time required to separately run both algorithms is also polynomial. Preserving 
polynomial time is meaningful because if this composite strategy is applied, the outcome 
does indeed yield an improvement in guaranteed performance. The first result of this 
sort is also due to Frederickson ({Fr79]) where it was shown that applying the two stated 
heuristics and selecting the best result would never produce a tour having length that 
when compared to an optimal value yielded a ratio in excess of 3. 


R16: In the same paper ([Fr79]), Frederickson also proposed a separate composite 
strategy for planar instances. The bound on its performance was shown to be 3. 


R17: When the 3 result in [Fr79] appeared, attempts to create an instance establishing 
realizability were not fruitful. The closest was a 3-inducing instance shown in Figure 
4.3.9; here the optimal tour has weight 4 + 13, but Algorithm ES yields a tour with 
weight 6 + 15, and Algorithm SE yields a tour with weight 6 + 12. Eventually, 
Raghavachari and Veerasamy (RaVe99]) were able to employ a modification of the stated 
Frederickson approximation to obtain a performance ratio bounded by 3. The instance 
in Figure 4.3.9 establishes tightness. 


Figure 4.3.9: A worst case for composite use of Algorithms ES and SE. 


R18: Following its attendant proof of intractability, MCPP is not likely to submit to 
any fast solution. However, as with any other provably ’P-hard problem, this attribute 
does not preclude the existence of nor the value in pursuing special cases which might 
prove to be quickly solvable. This is certainly the case for MCPP. 


COMPUTATIONAL NOTE: If input instances are confined to the class of recursively 
structured graphs, then it is possible to solve MCPP on members of this class and, in 
fact, by strategies requiring only linear-time effort (cf. [Co90], [ArLaSe91], [BoPaTo91], 
[BoPaTo92], [CoMo93], and §10.4). Typical recursive graph classes include trees, series- 
parallel graphs, Halin graphs, partial k-trees, bandwidth-k graphs, pathwidth-k graphs, 
and treewidth-k graphs. 


4.3.5 Recent Research 


All the results presented in the previous subsections of this section were obtained 
prior to the year 2000. Those results established a strong foundation for what today 
remains an active research area. Next we briefly summarize the current status of research 
in this area. 
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DEFINITIONS 


In addition to the (classical) variations of CPP given previously in Definitions D2 to 
D5 and also in Definitions D10 to D15, many additional variations have been developed 
to more closely match the constraints and objectives of certain real-world applications. 
Several such variations are defined below. As before, each variation can be applied to 
either an undirected or directed or mixed graph. Some of these variations were defined 
fairly recently, while others were defined long ago but have recently attracted greater 
attention. 


D24: generalized CPP: The edges are partitioned into subsets, and at least one 
edge from each subset must be traversed. 


D25: hierarchical CPP: Each edge has both a service time and a cruising time. 
The edges are partitioned into subsets, and a precedence relation (<) is given between 
these subsets. If £; < E; then all edges of £; must be serviced before any edges of E; 
are serviced. (See [EiGeLa95-a], [EiGeLa95-b].) 


D26: time-dependent CPP: The cost to traverse each edge (x,y) depends on the 
time t of departure from x, and is given by a function W;(2, y). 


D27: time-constraint CPP: Each edge (x,y) must be serviced no earlier than a 
specified start time S(x,y) and/or no later than a specified finish time F(z, y). 


D28: k-CPP: Generalizes the CPP to use k > 2 postmen, and designates a particular 
depot vertex. Every postman’s tour must include the depot vertex, and the goal is to 
minimize the total weight of the & tours. 


D29: min-max k-CPP: Same scenario as the k-CPP, except here the goal is to 
minimize the maximum weight incurred by any of the k tours. (See [FrHeKki78].) 


D30: capacitated CPP: Generalizes the k-CPP such that every postman’s vehicle 
has capacity C, each edge has both a weight and a demand, and the sum of the demands 
of each tour is at most C. 


D31: capacitated arc routing problem (CARP): Generalizes the capacitated 
CPP such that only a specified subset of edges must be included (as in the rural postman 
problem). (See [GoWo81], [AsGo95], [Wo08], [CoPr10].) 


D32: generalized routing problem (GRP): Generalizes the CARP such that both 
a required subset of edges and a required subset of vertices may be specified. 


REMARKS 


R19: Most recent papers in this area typically select one or more (A’P-complete) 
problem variations such as those given above, and then provide one or more of the 
following types of results: 


e Describe real-world applications to motivate the choices made in the definition of 
the problem. 


e Formulate the problem using a precise mathematical programming model, for 
example, an integer linear program. 


300 Chapter 4. Connectivity and Traversability 


e Design an optimal algorithm that is guaranteed to produce an exact solution 
(but may take exponential time). Often this is accomplished using one of these 
methods: branch-and-bound, cutting planes, or a hybrid (branch-and-cut). 


e Design an approximation algorithm that is guaranteed to produce a solution whose 
value is always within some known ratio of optimal. 


e Design a heuristic algorithm that has no guarantees, but that is expected to usually 
perform well in practice. This may be accomplished using genetic algorithms, 
simulated annealing, ant-colony algorithms, or other machine learning or ad hoc 
approaches. 


e Implement two or more proposed algorithms for the same variation, and perform 
experiments to empirically compare their performance. This may be done using 
benchmark data or by generating large random instances. 


Analyze the running time or approximation ratio of an algorithm. 


Develop better lower bounds for the value of an optimal solution. This can lead 
to better performance of exact algorithms such as branch-and-bound, and also 
enables better comparisons to determine the quality of non-optimal algorithms. 


R20: The subsequent list of references includes many papers which together encompass 
all the preceding problem variations, and also all the preceding kinds of results. 
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INTRODUCTION 

N. deBruijn solved the problem of finding a minimum-length binary string that 
contains as a (contiguous) substring every binary string of a prescribed length k. For 
this purpose, he prescribed a special directed graph, of in-degree 2 and out-degree 2, 
now called a deBruijn graph. In this section, we cover the basics of deBruijn graphs, 
two methods to generate deBruijn sequences, and applications to the generation of 
pseudorandom numbers and to genetics. 


4.4.1 DeBruijn Graph Basics 

DeBruijn Sequences 

DEFINITIONS 

D1: A deBruijn sequence of order k is a binary string of length n = 2* in which 
e the last bit is considered to be adjacent to the first, and 
e every possible binary k-tuple appears exactly once. 


Two deBruijn sequences are considered to be the “same sequence” if one can be obtained 
from the other by a cyclic permutation. 


D2: Ina string s of length m > k, the successor of a substring t of length k is the 
k-bit substring t’ that begins at the second bit of t. This is understood cyclically within 
8, so that if needed, the last bit of the successor substring t’ is the first bit of string s. 
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D3: Ina string s of length m > k, the k-tour is the sequence of substrings of length 
k, starting with the initial substring. Since this is understood cyclically within s, there 
are m substrings in the tour. 


D4: A k-bit string b is said to be obtained from a k-bit string a = aja2a3---az by a 
(left) shift operation if b; = aj41, fori =1,2,...,k—1. The bit b, may be arbitrary. 


D5: A left shift aja2...ay, —> b1bo...by, is a cycle shift if by = ay. 


D6: A left shift ajag...a, —> b1b2...bp is a deBruijn shift if b, A ay. 


FACTS 


F1: An obvious lower bound on the length of a deBruijn sequence of order k is 2", 
since there are 2" different bitstrings of length k, and since each bit in a sequence starts 
only one é-bitstring. 


F2: The successor of each k-bit substring t in a deBruijn sequence is either a cycle 
shift or a deBruijn shift of t. 


EXAMPLES 
E1: 00010111 is a deBruijn sequence of order 3. Its 3-tour is 
000, 001, 010, 101, 011, 111, 110, 100 


E2: 0000101101001111 is a deBruijn sequence of order 4. 


DeBruijn Graphs 


An intuitive approach to the problem of constructing a deBruijn sequence is to construct 
a graph in which a hamiltonian tour corresponds to such a sequence. 


DEFINITIONS 


D7: A deBruijn graph of order k, denoted by G(k), is a directed graph with 2* 
vertices, each labeled with a unique k-bit string. Vertex a is joined to vertex b by an 
arc if bitstring b is obtainable from bitstring a by either a cycle shift or a deBruijn shift. 
Additionally, each arc of G(k) is designated as a cycle-shift arc or a deBruijn arc, 
according to the shift operation it represents. Each arc is labeled by the first bit of the 
vertex at which it originates, followed by the label of the vertex at which it terminates. 


D8: The cycle-shift 2-factor in a deBruijn graph is the 2-factor formed by all of its 
cycle-shift arcs. 


D9: The deBruzn 2-factor in a deBruijn graph is the 2-factor formed by all of its 
deBruijn arcs. 
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EXAMPLE 


E3: Figure 4.4.1 below illustrates the deBruijn graph of order 3. 


0000 


1111 


Figure 4.4.1: A deBruijn graph of order 3. 


FACTS 


F3: The cycle-shift arcs form a directed 2-factor, because the cycle-shift operation acts 
as a permutation on the bitstrings. Similarly, the deBruijn arcs form a directed 2-factor. 


F4: Every vertex of a deBruijn graph has out-degree 2. The first bit of the label on 
one of the vertices to which it points is 0, and the first bit on the label of the other is 1. 


F5: Every vertex of a deBruijn graph has in-degree 2. 
F6: Every deBruijn graph is strongly connected. 
F7: Every deBruijn graph is hamiltonian. 


F8: The hamiltonian (directed) circuits in the deBruijn graph G(k) are in one-to-one 
correspondence with the deBruijn sequences of order k. The correspondence is realized 
by listing, in sequence, the first bit of each vertex encountered on a hamiltonian tour. 


F9: deBruijn’s Theorem [dB46] For each positive integer k, there are gtk 


deBruijn sequences of order k. 


k 123 4 5 6 
o2""'-k 4-1 «2 «16 2048 67108864 
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REMARKS 


R1: A hamiltonian circuit in a deBruijn graph can be constructed by splicing together 
the components of its deBruijn 2-factors. However, deBruijn’s theorem depends on a 
more elegant way to construct deBruijn sequences. 


R2: Since each component of the deBruijn 2-factor of the deBruijn graph G(k) has 
cardinality at most k, it follows that the number of components of the deBruijn 2-factor 
grows exponentially in k. 


4.4.2 Generating deBruijn Sequences 


An efficient algorithm for constructing a deBruijn sequence of order & is based not on 
finding a hamiltonian circuit in the deBruijn graph of order k, but rather on the easier 
task of constructing an Eulerian tour in the deBruijn graph of order k — 1. Another 
interesting method is strictly lexicographic. 


FACTS 


F10: [Go46]: A strongly connected directed graph in which every vertex has the same 
indegree as outdegree has an Eulerian tour. 


F11: In a deBruijn graph G(k), the k-sequence of arc labels encountered on every 
directed path of length & originating at a vertex v is the binary string that labels 
vertex v. (This is an immediate consequence of the specification of the arc labels in the 
definition of a deBruijn graph.) 


F12: The sequence of arc labels encountered on an Eulerian tour of the deBruijn graph 
of order k is a deBruijn sequence of order k + 1. 


REMARK 


R3: The proof of Fact F12 is not difficult. Since an Eulerian tour of G(k) visits 
each vertex twice, it follows from Fact F11 that each bitstring label occurs twice in the 
sequence of arc labels. By Fact F4, one occurrence is followed by a 0 and the other by 
al. 


ALGORITHM 


Al: To construct a deBruijn sequence of order k, use Fleury’s algorithm (quadratic 
time) to construct an Eulerian tour of the deBruijn graph G(k — 1). Then record the 
sequence of arc labels on the Eulerian tour. (Fleury’s algorithm appears in §4.2.) 


EXAMPLE 


E4: Figure 4.4.2 illustrates the construction of a deBruijn sequence of order 4 from the 
deBruijn graph of order 3. 
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Gor = 1001 = em 


Figure 4.4.2: An Eulerian circuit in G(3). 


Necklaces and Lyndon Words 


Fredricksen and Kessler [FrKe77] have published a remarkable alternative method for 
constructing deBruijn sequences. 


DEFINITIONS 


D10: A rotation of a binary string is the result of an iteration of cycle shifts. 
(Rotation is clearly an equivalence relation.) 


D11: An equivalence class under rotation of the binary strings of length n is called a 
necklace of order n. 


D12: <A Lyndon word of order n is a necklace of order n whose rotation class 


has n binary strings. A Lyndon necklace of length 1 is called trivial. We take the 
lexicographically least element of the equivalence class as representative of the necklace. 


FACTS 


F13: A necklace representative is a Lyndon word if and only if it is aperiodic, i-e., 
cannot be written as the concatenation of two or more identical strings. 


F14: By an elementary application of Burnside-Polya enumeration, the number of 


necklaces of order n is 
1 i 
= k)-2 
— > o(h) 28 


where $(n) is the number of integers in the interval [1,n] that are relatively prime to n. 
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F15: ([FrKe77]: If the (lexicographically least) representatives of all the nontrivial 
Lyndon words whose lengths divide n are arranged into lexicographic order and con- 
catenated, with the terminal string 10 appended at the end, then the result is a deBruijn 
sequence of order n that is lexicographically minimum. 


REMARK 


R4: The number N(n) of necklaces grows exponentially with n. While N(5) = 8, we 
have N(10) = 108, and N(15) = 2192. 


EXAMPLES 


E5: Figure 4.4.3 displays five equivalent strings of length 5. 


01101 
11010 
10101 
0101 1<— 


10110 


Figure 4.4.3: A necklace and its representations. 


E6: The only nontrivial Lyndon word of length 2 is 01. We observe that 0110 is a 
deBruijn sequence of order 2. 


E7: The only nontrivial Lyndon words of length 3 are 001 and 011. We observe that 
00101110 is a deBruijn sequence of order 3. 


E8: We now illustrate Fact F15 for n = 4. In lexicographic order, the nontrivial 
Lyndon words of lengths that divide 4 are 


0001, 0011, 01,0111 


If we now concatenate these words in the order given, we obtain the lexicographically 
least deBruijn sequence of order 4: 


0001001101011110 


4.4.3 Pseudorandom Numbers 


For Monte Carlo applications, the numbers produced by ordinary pseudorandom 
number generators (e.g., congruential generators) are close enough to random not to 
affect the outcome of the study. But for certain applications called precision Monte Carlo 
simulation, special sequences must be used. DeBruijn sequences, which already appear 
somewhat random to the untrained eye, may be made more random by interchanging 
runs of zeros and ones. 
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DEFINITIONS 


D13: A run in a binary sequence is a subsequence of identical bits, and a maximal 
run is a run that is not contained in any longer run. 


D14: [Go67] The Golomb postulates of randomness for a periodic sequence X 
are as follows: 


e The number of 1’s in X differs from the number of 0’s by at most unity. 


e At least half the runs in X have length 1, at least one-quarter have length 2, at 
least one-eighth have length 3, etc. 


e The bit in position 7 is correlated to the same degree with adjacent bits (¢+1 and 
i — 1) as it is with ones further away (i + 100 and i — 100). 


D15: A run-permuted sequence is a pseudorandom sequence obtained by the fol- 
lowing procedure: 


0. Generate a random deBruijn sequence of order n. 
1. Randomly permute the maximal runs of 0’s. 
2. Randomly permute the maximal runs of 1’s. 
D16: [Ch87] The randomness of an infinite binary sequence S is defined to be 


s(m) 


lim 
mo mM 
where s(m) is the minimum number of states in a 2-symbol Turing machine that pro- 
duces the first m bits of the sequence S. 


FACTS 
F16: Obviously, every deBruijn sequence can be generated as a run-permuted sequence. 


F17: Any deBruijn sequence X of order n satisfies Golomb’s first two postulates. First, 
the number of 1’s exactly equals the number of 0’s. Second, it is easily shown that over 
all possible binary subsequences of each length n, exactly half of the runs have that 
length. 


F18: Interchanging (maximal) runs in permuting a deBruijn sequence does not change 
the number of runs of any length or kind. One therefore obtains a much larger class 
of sequences that are, by Golombs measure, just as random as the original deBruijn 
sequence from which the new sequences are generated. 


F19: [Je01] The class C;, of run-permuted sequences of order n contains a vanishingly 
small proportion of deBruijn sequences of order n as n increases. 
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4.4.4 A Genetics Application 


Typically, the short DNA fragments observed in experiments are not sufficient to 
reconstruct the genome of an organism completely. Because of the time and expense of 
such experiments, it is desirable to minimize the remaining work. To this end, biologists 
algorithmically assemble as much of the genome as they can, thereby obtaining longer 
DNA fragments that are fewer in number. They then perform additional experiments 
at specific locales in the resulting sequences, in order to extend the reconstruction. 

A phenomenon that complicates the stepwise reconstruction of the genome is the 
natural occurrence of multiple copies of the same substring in a number of DNA se- 
quences acquired by experiment. To help resolve this difficulty, Pvezner, Tang and 
Waterman [PvTaWa01] have applied modified de Bruijn graphs, in which a repeated 
k-string in a given sequence s; results in multiple vertices, and consequently, in multi- 
ple paths connecting certain pairs of vertices in the graph. Such a graph need not be 
connected. 


DEFINITIONS 


D17: A DNA sequence is any finite sequence of the letters A, C, G, T. 


D18: For any set S = {5,52,...,5n} of DNA sequences, we define the S-relative 
deBruijn graph of order k to have vertices corresponding to all k-substrings from 
the elements of S', one for each occurrence of a substring. Two such vertices u and v 
are adjacent if their substrings belong to the same DNA sequence s; and the last k — 1 
letters of wu coincide with the first k — 1 letters of v. 


REMARK 


R5: Since Eulerian paths can be found very quickly in connected portions of the S- 
relative deBruijn graph, partial paths can be produced efficiently for the graph as a 
whole. These not only (in most cases) recapture the original sequences, but suggest 
where additional experiments need to be performed to choose between different possible 
paths through the S-relative deBruijn graph. 
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INTRODUCTION 


Named for Sir William Rowan Hamilton, the hamiltonian problem traces its origins 
to the 1850s. 


4.5.1 History 


Characterizing hamiltonian graphs is an NP-complete problem (see [GaJo79]); thus 
the hamiltonian problem is generally considered to be determining conditions under 
which a graph contains a hamiltonian cycle. Hamilton exhibited his [cosian Game at 
a meeting in Dublin in 1857. The game involved finding various paths and cycles, 
including spanning cycles, of the regular dodecahedron. The game was marketed by a 
wholesale dealer in 1859, but apparently was not a big hit. Perhaps the only profit was 
Hamilton’s, as he sold the game to the dealer for 25 pounds. 

Hamilton does not appear to be the first to have considered the question of spanning 
cycles. In a paper [Ki56] submitted in 1855, Thomas Penyngton Kirkman posed the 
question: Given the graph of a polyhedron, can one always find a circuit (cycle) that 
passes through each vertex once and only once. Thus, Kirkman actually asked a more 
general question than Hamilton. Unfortunately for Kirkman, the term hamiltonian 
cycle is much too ingrained to be changed now. For a more detailed account of this 
history see [BiL]Wi86]. 


DEFINITIONS 
D1: A graph G is hamiltonian if it contains a spanning cycle (hamiltonian cycle). 


D2: A graph G is traceable if it contains a spanning path. 
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D3: Further, G is hamiltonian connected if any pair of vertices are the ends of a 
spanning path. 


4.5.2 The Classic Attacks 


There are certain fundamental results that deserve attention, both for their contri- 
bution to the overall theory and for their affect on the later development of the area. 


The approach taken to developing sufficient conditions for a graph to be hamiltonian 
usually involved some sort of edge density condition, providing enough edges to ensure 
the existence of a hamiltonian cycle. 


TERMINOLOGY NOTE: The order of a graph is the cardinality of its vertex set, and the 
size of a graph is the cardinality of its edge set. 


Degrees 


NOTATION: The minimum degree of the vertices of a graph G is denoted dnin(G), and 
the maximum degree is denoted Odmaz(G). 


DEFINITIONS 


D4: We say aset X C V is independent if there are no edges between vertices in X. 
The largest cardinality of an independent set in G is called the independence number 
of G and is denoted ind(G). 


D5: The k-degree closure of G, denoted C;,(G), is the graph obtained by recursively 
joining pairs of nonadjacent vertices whose degree sum is at least k, until no such pair 
remains. 


D6: For a balanced bipartite graph G = (X UY, E£), the bipartite degree closure is 
that graph obtained by joining any nonadjacent pair x € X and y € Y whose degree 
sum is at least n+ 1. 
NOTATION: The following notation has become standard in the area: 
k 
o;4,(G) = min{S > deg x; |1,...,%,% are independent}. 

i=1 
FACTS 
F1: [Di52] If G is a graph of order n such that dmin(G) > n/2, then G is hamiltonian. 
F2: Let G be a graph of order n. 


i. [Or60] If o2(G) > n, then G is hamiltonian, 
ii. [Or63] If c2(G) > n+ 1, then G is hamiltonian connected. 
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EXAMPLE 


E1: Consider two K(p+1)/2 with one vertex from each identified (graph on left in 
Figure 4.5.1). This graph is not hamiltonian, but has order p, dmin(G) = (p— 1)/2 and 
o2(G) = p—1, illustrating the sharpness of Dirac’s Theorem and Ore’s Theorem (Fact 
F2 i). The graph obtained by identifying a pair of vertices from two copies of K(p+2)/2 
is not hamiltonian connected, has dmin(G) = p/2 and o2(G) = p, showing Fact F2ii is 
sharp. 


Figure 4.5.1: Illustrating the sharpness of Dirac’s and Ore’s results. 


FACTS 


F3: [Ja80] If G is a d-regular 2-connected graph of order n with d > n/3, then G is 
hamiltonian. 


F4: [MoMo63] If G = (X UY, £) is a balanced bipartite graph of order 2n (n > 2) 
with deg u+deg v > n+ 1 for each nonadjacent pair u € X and v € Y, then G is 
hamiltonian. 


F5: [BoCh76] Let G have order n. Then 

i. C;,(G) is well defined, 

ii. G is hamiltonian if, and only if, C,,(G) is hamiltonian, 

iii. if C,41(G) is complete, then G is hamiltonian connected. 


F6: [He91] A balanced bipartite graph is hamiltonian if, and only if, its bipartite 
closure is hamiltonian. 


REMARK 


R1: These closure results provide an interesting relaxation of the degree conditions. 
The closure is (hopefully) a denser graph, making it easier to find a hamiltonian cycle. 
However, the number of edges actually added in forming these closures can vary widely. 
It is easy to construct examples for all possible values from 0 to all the missing edges. 
Thus, we might receive no help in deciding if the original graph is hamiltonian, or we 
might conclude trivially that it is (when the closure is the complete graph). 


Other Counts 


DEFINITIONS 


D7: The neighborhood of a vertex x, denoted Ng(x), is the set of all vertices adjacent 
to x in G. Similarly, Ng(S) denotes the neighborhood of the set S and is the 
collection of all vertices adjacent to some vertex in S. 
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NOTATION: When the graph in which the neighborhood is defined is clear, the subscript 


is omitted. 


D8: The (vertex-) connectivity of a connected graph, denoted &,,(G), is the minimum 
number of vertices whose removal either disconnects G or reduces it to a 1-vertex graph. 


D9: A graph G is k-connected if k,(G) > k. 


EXAMPLE 


E2: The graph G(r,p) is that graph with order p and vertex set SUTUU where 
|S| = |Z| =r and |U| = p— 2r and where two vertices are adjacent if either belongs 


to S or both belong to U. Hence, the graphs induced by S, T, and U are K,, K,., and 
Ky—2, respectively. Figure 4.5.2 below shows the graphs G(1,6) and G(2,5). 


T S U T S U 


Figure 4.5.2: Graphs G(1,6) and G(2,5). 


FACTS 


F7: [Or63] If G is a graph of order n and size greater than Co) +1, then G is 
hamiltonian. Furthermore, the only nonhamiltonian graphs with exactly size ie) +1 
are G(1,n) and G(2,5). In addition, if G has size at least Coal then G is hamiltonian 


connected. 

F8: [Fa84] If G is a 2-connected graph of order n such that 
min{max(deg u, deg v) | d(u,v) = 2} > n/2, 

then G is hamiltonian. 


F9: [BaBrVeLi89] If G is a 2-connected graph of order n and connectivity ,)(G) such 
that o3(G) > n+ ,(G), then G is hamiltonian. 


F10: [ChEr72] Let G be a graph of order at least 3. 

i. If k,(G) > ind(G), then G is hamiltonian. 

ii. If k,(G) > ind(G) — 1, then G is traceable. 

iii If Ky(G) > ind(G) + 1, then G is hamiltonian connected. 


F11: [Wo78] If for any nonempty S C V, |N(S)| > SEAR then G is hamiltonian. 


F12: [Fr86] Let G be a k-connected graph of order n. Suppose there exists some 
t < k, such that for every independent set S of vertices with cardinality t we have 


|N(S)| > ed then G is hamiltonian. 
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F13: [BrVe91], [FaGoJaLe92] If G is a 2-connected graph of sufficiently large order 
n such that |N(S)| > n/2 for every set S of two distinct vertices of G, then G is 
hamiltonian. 


REMARK 


R2: The Petersen graph is the only counterexample for n < 10 (see [BrVe91]}). 


Powers and Line Graphs 


TERMINOLOGY: A circuit is a closed walk having no repeated edges (also called a closed 
trail). 


DEFINITIONS 


D10: The line graph L(G) of a graph G is that graph whose vertices can be put into 
1-1 correspondence with the edges of G in such a way that two vertices of L(G) are 
adjacent if, and only if, the corresponding edges of G are incident (have an endpoint in 
common). 


D11: A circuit C such that every edge of G is incident to a vertex of C is called a 
dominating circuit. 


D12: We say that G contains a k-system that dominates if G contains a collection 


of k edge disjoint circuits and stars (here stars are Ky ,,n; > 3), such that each edge of 
G is either contained in one of the circuits or stars, or is adjacent to one of the circuits. 


D13: The k-th power, G*, of a connected graph G is that graph with V(G") = V(G) 
for which uv € E(G) if, and only if, 1 < dg(u,v) < k. 


D14: A k-factor of a graph G is a k-regular spanning subgraph of G. In particular, 
a 2-factor is a (vertex-disjoint) union of cycles that covers V(G). 


FACTS 


F14: [HaNW65] Let G be a graph without isolated vertices. Then L(G) is hamiltonian 
if and only if G~ Ky, for some n > 3, or G contains a dominating circuit. 


F15: [GoHy99] Let G be a graph with no isolated vertices. The graph L(G) contains 
a 2-factor with k (k > 1) cycles if, and only if, G contains a k-system that dominates. 


F16: [ChWa73] If G is connected with 6min(G) > 3, then L?(G) = L(L(G)) is hamil- 
tonian. 


F17: [F174] If G is a 2-connected graph, then G? is hamiltonian. 


F18: If G is connected then G® is hamiltonian (in fact, hamiltonian-connected) (see 
[Be78]). 
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Planar Graphs 


FACTS 


F19: [Th83] Every 4-connected planar graph is hamiltonian connected (and hence 
hamiltonian [Tu56}). 


F20: [Gr68] Let G be a plane graph of order n with hamiltonian cycle C. If r; denotes 
the number of i sided regions interior to C and r’; the number of 7 sided regions exterior 
to C, then 


2 


(¢ = 2) (rj = r';) = 0. 
=3 


4.5.3. Extending the Classics 


Adding Toughness 


DEFINITION 


D15: If every vertex cut-set S of G satisfies t- c(S) < |.S|, where c(S') is the number of 
components of G — S$, we say that G is t-tough. The toughness of G is the maximum 
t such that G is t-tough. 


FACTS 


F21: [Ju78] Let G be a 1-tough graph of order n > 11 such that o2(G) > n—4. Then 
G is hamiltonian and this bound is sharp. 


F22: [BaMoScVe90] Let G be a 2-tough graph of order n such that o03(G) > n. Then 
G is hamiltonian. 


F23: [BrVe90| Let G be a 1-tough graph of order n > 3 with dmin(G) > ae 
Then G is hamiltonian. 


REMARK 


R3: Chvatal conjectured that there is a to such that all to-tough graphs are hamil- 
tonian. For years to = 2 seemed possible. However, in [BaBrLiVe00], examples of 
(9/4 — €)-tough nonhamiltonian graphs, for arbitrary « > 0, were presented. 


More Than Hamiltonian 
DEFINITIONS 
D16: A graph G of order n is pancyclic if it contains cycles of all lengths 1, 3 <1 <n. 


D17: A bipartite graph G of order n is bipancyclic if it contains cycles of all possible 
even lengths from 4 to n. 
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D18: A graph of order n is cycle extendable if any cycle C' of length m < n can 
be extended to a cycle of length m+ 1 containing all of V(C). Further, if G is cycle 
extendable and every vertex is on a triangle, then G is called fully cycle extendable. 


D19: A graph is k-ordered (hamiltonian) if for every ordered sequence of k vertices 
there is a cycle (hamiltonian cycle) that encounters the vertices of the sequence in the 
given order. 


FACT 


F24: [BrChFaGoLe97] If G is a graph of order n satisfying 

(1) bmin(G) > n/2 and n > 4k or 

(2) o2(G) >n and n > 4k 

then G contains a 2-factor with k cycles for each k, 1 < k < [n/4], and this result is 
best possible. 


EXAMPLE 


E3: To see this result is best possible we need only consider the complete bipartite 
graph Kp /2,n/2- The smallest cycle in any 2-factor of this graph is a 4-cycle, hence the 
bounds on & are sharp. 


FACTS 
F25: [Bo77] If G is a hamiltonian graph of order n with |E(G)| > ae then either G is 
pancyclic or G ~ Ky /2.n/2- 


F26: [He90] If G has order n > 3 and o2(G) > n, then G is cycle extendable unless 
G belongs to one of two special classes. Also, if o2(G) > (4n — 5)/3, then G is cycle 
extendable. Further, if dmin(G) > (n + 1)/2, then G is fully cycle extendable. 

F27: [He91] If G = (X UY,E) is a balanced bipartite graph of order 2n such that 
for any nonadjacent pair x © X and y € Y we have deg x+ deg y > n+1, then G is 
bipancyclic. 


F28: [He91] Let n > 2m > 2. If G= (X UY, E) is a balanced bipartite graph of order 
2n satisfying dmin(G) > m and |E(G)| > n? — mn+m?, then G is bipancyclic. 


F29: [KoSaSz96], [KoSaSz98] There exists a natural number no such that if G has 
order n and n > no and bmin(G) > kn/(k +1), then G contains the k-th power of a 
hamiltonian cycle. 


F30: [KiSaSe99] Let k > 2 be an integer and let G be a graph of order n > 11k — 3. 
If bmin(G) > [| + | | —1, then G is k-ordered hamiltonian. 

F31: [FaGoKoLeScSa03] Let & be an integer with 3 < k < n/2 and let G be a graph of 
order n. If deg(u) + deg(v) > n+(3k—9)/2 for every pair u,v of nonadjacent vertices 
of G, then G is k-ordered hamiltonian. 
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REMARK 


R4: The bounds in Facts F30 and F31 are both sharp for the respective values of k. 
Unexpectedly, the Dirac type bound does not follow from the Ore type bound. 


FACTS 


F32: [Ha79] Let G be a graph of order n > 4, n even. If o2(G) > n+ 1, then for any 
1-factor F' of G, there is a hamiltonian cycle containing F’. 


F33: [LV72] Let G = (AUB, E) be a bipartite graph with |A| = |B] = n > 2. If for 
each pair of nonadjacent vertices u,v with u € A and v € B, degu+degu>n+1, 
then any 1-factor F’ of G is contained in a hamiltonian cycle of G. 


F34: [Ya99] If G is a graph of order n > 4, n even and dmin(G) > 2 and |E(G)| > 


Yr?) +1, then for any 1-factor F’, there is a hamiltonian cycle of G containing F’. 


F35: [KaYo01] Let G be a graph of order n with b,in(G) > n/2 and let d be a positive 


integer such that d < 7. Then, for any vertex subset A with |A| < there is a 


hamiltonian cycle C such that do(u,v) > d for any u,v € A. 


aL 
2d? 


F36: [SaSe08] There are w,n9 > 0 such that if G is a graph of order n > no with 
dmin(G) > n/2 and d is an arbitrary integer with 3 < d < wn/2 and S Cc V(G) 
with 2 < |S| = k < wn/d, then for every sequence d; of integers with 3 < d; < d, 
1<i<k-1, there is a hamiltonian cycle C of G and an ordering of the vertices of 
S, a1, @2,...,a,% such that the vertices of S are visited in this order on C' and we have 
\dc (ai, ai41) — d;| <1 for all but one 1 <i<k-1. 


F37: [FaGoJaMa] Let t > 3 be an integer and let 0 < ey For n > ieee Cea let 


G be a graph of order n having dmin(G) > 3 and K(G), > 2[§]. For every X = 
{x1,@2,...,21} C V(G), there exists a hamiltonian cycle such that do(a:,2;) > 
(4 —e)n for all 1 < i < j < t. Furthermore, the minimum degree and connectivity 


conditions are sharp. 


F38: Pasa Let t > 3 be an integer and €,71,7%,.-- a7 positive real numbers 
having )_j.y = 1 and 0 <e < min{}}. For n > exo" zo he , let G be a graph 
of order n having dmin(G) > atat ! or bmin(G) > 5 and Ko(G) > a For every 
X = {x 1,%2,..., 41} CV(G), here exists a hamiltonian cycle C containing the vertices 
of X in order such that (y;—6)n < do (ai, ti41) < (Yite)n for alll <i <t. Furthermore, 
the minimum degree and connectivity conditions are sharp. 


F39: [FaGoJa09] Let G be a graph of order n, and let F' be a k-edge forest composed 
of t paths, where 2 <k+t<n. If 

(a) o2(G) > n+k when either n+ k is odd or F = Py41 U (¢ — 1)K4, and 

(b) o2(G) > n+k —1, otherwise, 

then G has a hamiltonian cycle containing F’. 
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4.5.4 More Than One Hamiltonian Cycle 


A Second Hamiltonian Cycle 


FACTS 


F40: Every edge of a 3-regular graph is contained in an even number of hamiltonian 
cycles. Thus, every 3-regular hamiltonian graph contains a second and, in fact, a third 
hamiltonian cycle (C.A.B. Smith, see [Tu46]). 


F41: [Th98] If G is hamiltonian and m-regular with m > 300, then G has a second 
hamiltonian cycle. 


F42: [Th97] Let G be a graph with a hamiltonian cycle C. Let A be a vertex set 
in G such that A contains no two consecutive vertices of C and A is dominating in 
G—E(C) (ie., Ne—xc)(A) 2 V(G— E(C))). Then G has a hamiltonian cycle C’ such 
that C’ - A= C— A and there is a vertex v in A such that one of the two edges of C’ 
incident with v is in C and the other is not in C. 


F43: [HoSt00] For any real number k > 1, there exists f(k) so that every hamiltonian 
graph G with dmaz(G) > f(k) has at least dmin(G)— | cease | +2 hamiltonian cycles. In 
particular, every hamiltonian graph with dmax(G) > f(Omaz(G@)/dmin(G)) has a second 
hamiltonian cycle. 


F44: [Ma76], [GrMa76] There exist 4-regular, 4-connected planar graphs that do not 
have two edge-disjoint hamiltonian cycles. 


F45: [Za76], [Ro89] There exist infinitely many examples of 5-connected planar graphs 


(both regular and nonregular) in which every pair of hamiltonian cycles have common 
edges. 


REMARK 
R5: Thomason [Th78] extended Smith’s result (Fact F40) to all r-regular graphs 


where r is odd (in fact, to all graphs in which all vertices have odd degree). Thomassen 
extended this further (see Fact F41). 


Many Hamiltonian Cycles 


DEFINITION 
D20: A planar triangulation of a planar graph is the process of adding edges 


between pairs of non-adjacent vertices to produce another planar graph, each of whose 
regions is bounded by a triangle. 


FACTS 


F46: [KrZe88] If a planar triangulation (except K3 and K4) is hamiltonian, then it 
contains at least four hamiltonian cycles. 
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F47: [Th96] Let C : 21, y1, 22, y2,---,2n, Yn, 21 be a hamiltonian cycle in a bipartite 
graph G. 

(a) If all the vertices y,,...,Yn have degree at least 3, then G has another hamiltonian 
cycle containing the edge 7141. 

(b) If all the vertices y1,..., Yn have degree d > 3 and if P,, Po,...,P; (0<q<d-—3) 
are paths in C of length 2 of the form y;_12;y;, then G has at least 27+!~4(d — q)! 
hamiltonian cycles containing P, U---U Pg. 


F48: [FaRoSc85] Let k be a positive integer. 

(a) If G is a graph of order n > 60k? such that o2(G) > n+ 2k — 2, then G contains k 
edge-disjoint hamiltonian cycles. 

(b) If G has order n > 6k and size at least Ce + 2k, then G contains k edge-disjoint 
hamiltonian cycles. 


F49: [Eg93] Let n,k > 2 be integers with n > 44(k — 1). If G is a graph of order n 
with 02(G) > n and bmin(G) > 4k — 2, then G contains k edge disjoint hamiltonian 
cycles. 


Uniquely Hamiltonian Graphs 


DEFINITION 


D21: A graph is uniquely hamiltonian if it contains exactly one hamiltonian cycle. 


FACTS 


F50: [EnSw80] There exist infinitely many uniquely hamiltonian graphs with minimum 
degree three. 


F51: [JaWh89] Any uniquely hamiltonian graph contains a vertex of degree at most 
(n+ 9)/4, and if the graph has a unique 2-factor, then it contains a vertex of degree 2. 


F52: [BoJa98] Every uniquely hamiltonian graph of order n has a vertex of degree 
at most clog,(8n) +3 where c = (2 — log 3)~! ~ 2.41. Furthermore, every uniquely 
hamiltonian plane graph has at least two vertices of degree less than four. 


Products and Hamiltonian Decompositions 


DEFINITIONS 


D22: A hamiltonian decomposition is a partitioning of the edge set of G into 
hamiltonian cycles if G is 2d-regular or into hamiltonian cycles and a perfect matching 
if G is (2d + 1)-regular. 


D23: Each of the following four kinds of product graphs has vertex set V(G1) x V(G2). 
The cartesian product G = G x G2 has edge set 


E(G) = {(u1, u2)(v1, v2) | ur = v1 and ugve € E(G2) or ug = ve and uv, € E(G1)}. 
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The direct product (or conjunction) G = G, - G2 has edge set 
E(G) = {(u1, u2)(v1, va) | uur € E(G1) and ugv2 € E(G2)}. 
The strong product G = G; ® G2 has edge set 


E(G) = {(u1, u2)(v1, v2) | wr = v1 and ugve € Ge, or 
Ug = v2 and u,v, € E(G,), or both uyv, € E(G,) and ugve € E(G2)}. 


Finally, the lexicographic product (sometimes called composition, tensor or wreath 
product) G = G,[G2] has edge set 


E(G) = {(u1, u2)(v1, v2) | wiur € E(G1), or ur = v1 and ugv2 € E(G2)}. 


REMARK 


R6: Jackson [Ja79] conjectured that every k-regular graph on at most 2k +1 vertices is 
hamiltonian decomposable. Another natural question is: If G; and G2 are hamiltonian 
decomposable, is the appropriate product of G; and G2 also hamiltonian decomposable? 


FACTS 


F53: [St91] Let G; and G2 be two graphs that are decomposable into s and t hamil- 
tonian cycles, respectively, with t < s. Then G, x G2 is hamiltonian decomposable if 
one of the following holds: 

(1) s < 3¢ 

(2)t>3 

(3) the order of G2 is even, or 

(4) the order of G, is at least 6[s/t] — 3. 


F54: It is easy to see that if G; and G2 are both bipartite, then the direct product 
G1-Gp is disconnected. Hence, the set of hamiltonian decomposable graphs is not closed 
under the direct product. 


F55: [Bo90], [Zh89] Suppose both G; and G2 are hamiltonian decomposable. If at 
least one of them has odd order, then G - Gz is hamiltonian decomposable. 


F56: [FaLi98] The set of hamiltonian decomposable graphs is closed under strong 
products, that is, if G; and Gz are hamiltonian decomposable, then so is G1 ® Go. 


F57: [BaSz81] The lexicographic product of two hamiltonian decomposable graphs is 
hamiltonian decomposable. 


F58: [Kr97] Let H be a subgraph of a graph G. Each of the following conditions is 
sufficient for the lexicographic product G[H] to be hamiltonian. 

(a) G is 1-tough and contains a 2-factor, and |E(H)| > 2. 

) G is 2-tough and |E(H)| > 2. 

c) G is connected and 2k-regular, and |V(H)| > k. 

d) G is (2k + 1)-regular, connected, and has a 1-factor, and |V(H)| >k +1. 

e) G is connected and vertex transitive of degree k, and |V(H)| > k/2. 

f) G is connected and vertex transitive, and |E(H)| > 2. 

g) G is cubic and 2-edge connected, and |V(H)| > 2. 

h) G is 4-regular and connected, and |V(H)| > 2. 


(b 
( 
( 
( 
( 
( 
( 
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4.5.5 Random Graphs 


NOTATION: Throughout this subsection Pr(X) denotes the probability of event X and 
N denotes the quantity rs 


DEFINITIONS 


D24: (The edge density model) Suppose that 0 < p < 1. Let Gn, denote a graph 
on n vertices obtained by inserting any of the N possible edges with probability p. 


D25: (The fixed size model) Suppose that M = M(n) is a prescribed function of n 
which takes on values in the set of positive integers. Then there are s = ea different 
graphs with M edges possible on the vertex set {1,2,...,n}. We let G,,,y¢ denote one 
of these graphs chosen uniformly at random with probability 1/s. 


D26: A somewhat different approach is to consider a graph process as a sequence 
(G,) such that 


1. each G; is a graph with vertex-set V, 
2. G, has t edges fort = 0,1,...,N, 
3. Go CG1C.... 


D27: If Q, is a model of random graphs of order n, we say almost every graph in 
Q,, has property Q if Pr(Q) > 1 as n > co. Note that this is equivalent to saying that 
the proportion of all labeled graphs of order n that have @ tends to 1 as n > co. 


D28: The k-in, l-out random digraph Dy~in,i—out has n vertices and for each vertex 
v, a set of k arcs into v and l arcs out of v are chosen independently and uniformly at 
random. The union of these arc subsets is the arc-set of Dk—int—out- 


FACTS 


F59: [Po76], [Ko76] There exists a constant c such that almost every labeled graph on 
n vertices and at least cnlogn edges is hamiltonian. 


F60: [Ko76], [KoSz83] Suppose w(n) — oo as n — ov, and let 
1 
p = —{logn + log(logn) + w(n)} and M(n) = | F{logn + log(logn) + w(n)}| : 
n 
Then almost every Gp,,p is hamiltonian and almost every Gy, is hamiltonian. 


F61: [KoSz83] For M(n) = n/2 (logn + log(logn) + cn) 


0 :  ifc, 4 —oo 
limn—+oo Pr(Gn.m is hamiltonian)=< e-©° : ife, 9c 
1 : if, > . 


F62: [RoWo92], [RoWo94] For every r > 3, almost all r-regular graphs are hamiltonian. 


F63: [CoFr94] Almost all random digraphs D3_in,3-oue are hamiltonian. 
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F64: [CoFr00] Almost all random digraphs D2_in,2—out are hamiltonian, In particular, 
this implies that G4—our, the underlying graph of D2-in,2—out, is hamiltonian. On the 
other hand, almost all Dy~in,2—oue and Do_in,i—out are not hamiltonian. 


F65: [BoFr09] Almost all random graphs G3_o. are hamiltonian. 


REMARKS 


R7: Considering the probability space of all N! graph processes (with equal probability) 
allows us to consider when a property “appears” (called the hitting time). Erd6és and 
Spencer where the first to conjecture that with probability tending to 1, the very edge 
that increases the minimum degree to 2 also makes the graph hamiltonian. This was 
verified by Bollobas [Bo84]. 


R8: It is natural to ask for a polynomial algorithm which, with probability tending 
to 1, finds a hamiltonian cycle in G,xs(n)- Bollobaés, Fenner, and Frieze [BoFeFr85] 
constructed such an algorithm which is essentially best possible. 


4.5.6 Spectral Attacks 


NOTATION: Let A(G) be the adjacency matrix of the graph G, let D(G) be the degree 
matrix of G, let L(G) = D(G) — A(G) be the Laplacian of G, and let Q(G) = D(G) + 
A(G). 


REMARK 


R9: We extend our concept of a graph by allowing free edges, which are edges with 
only one end vertex. In this case the degree of a vertex counts both the ordinary and free 
edges incident with the vertex. However, the free edges do not appear in the adjacency 
matrix. 


DEFINITIONS 


D29: The subdivision graph of G, denoted S(G), is the graph obtained from G by 
subdividing each edge of G. 


D30: Let Con; denote the cycle C2, with | free edges added to every second vertex of 
Con. 


D31: The ezgenvalues of a graph are the eigenvalues of the adjacency matrix of that 
graph. For a graph G we denote the eigenvalues of G as \1(G) < A2(G) <... < An(G). 


FACTS 


F66: [Mo92] Let G be a k-regular graph of order n. If G is not hamiltonian, then for 
i= 1, 2, sie T, Ai(L(S(G)) < Ai(L(C2n,k-2))- 


F67: [vdH95] Let G be a graph of order n and size m. If G is not hamiltonian, then 
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F68: [KrSu03] If the second largest absolute value of an eigenvalue 2 of the adjacency 
matrix of a d-regular graph satisfies 


(log log n)? 
log n (log log log n) 


for a constant c and n sufficiently large, then G is hamiltonian. 


F69: [BuCh10] Let G be a graph of order n and average degree d, and let 0 = A, < 
Ag <... < An be the eigenvalues of the Laplacian of G. If there is a constant c so that 


=e (log log n)? 
~ log n (log log log n) 


for i £ 1 and n sufficiently large, then G is hamiltonian. 


F70: [FiNil0] Let G be a graph of order n, and let u(G) be the largest eigenvalue of 
the adjacency matrix of G. Then, 

(a) If w(G) > n — 2, then G is traceable unless G is the disjoint union of K,_1 and a 
vertex. 

(b) If w(G) > n - 2, ee G is hamiltonian unless G is K,,_; with a pendant edge. 

(c) If u(G) < /n—1, then G is traceable unless G is the disjoint union of K,_; and a 
vertex. 

(d) If u(G) < Vn — 2, then G is hamiltonian unless G is K,_1 with a pendant edge. 


4.5.7 Forbidden Subgraphs 


DEFINITION 


D32: A graph G is said to be {F, Fo,..., Fi, }-free if G contains no induced subgraph 
isomorphic to any F;, 1<i<k. 


NOTATION: The graph N;,;, is a graph which consists of K3 and vertex-disjoint paths 


of length 2, 7, k with one path rooted at each of its three vertices. The graph L consists 
of two vertex-disjoint copies of K3 and an edge joining them. The graph P; is a path 
with i vertices. The graph Ks is the four vertex star (also called the claw). 


icieeatitns Ses 


Figure 4.5.3: The graphs Nj 23 and L. 


FACTS 


F71: [DuGoJa81] If G is a {K1,3, Ni1,1}-free graph, then 
(a) if G is 2-connected, then G is hamiltonian; 
(b) if G is connected, then G is traceable. 


328 Chapter 4. Connectivity and Traversability 


F72: [BrDrKo00] There exists a linear time algorithm for finding a hamiltonian cycle 
ina {Ki.3, N11,1}-free graph. 


F73: [BrVe90] If G is a 2-connected {Ay,3, Pe}-free graph, then G is hamiltonian. 


F74: [GoJa82] If G is a 2-connected {K1,3, N2.0,0}-free graph, then G is hamiltonian. 


F75: [Be91] If G is a 2-connected {1,3, N2,1,0}-free graph, then G is hamiltonian. 


F76: [FaGoRySc95] If G is a 2-connected {Ky,3, N3,0,0}-free graph of order n > 10, 
then G is hamiltonian. 


REMARK 


R10: A natural question is: Are these the only such pairs? This was investigated in 
[Be91] for all graphs, and in [FaGo97] for graphs of order 10 or more. We now summarize 
these combined results. 


FACTS 


F77: [Be91], [FaGo97] Let R and S be connected graphs (R,S # P3) and G a 2- 
connected graph of order n. Then G is {R,S}-free implies G is hamiltonian if, and 
only if, R= ky3 and S is one of the graphs Nii, Pe, N2,1,0; N2,0,0 (or N3,0,0 when 
n > 10), or a connected induced subgraph of one of these graphs. 


F78: [FaGo97] Let R,S be connected graphs (R,S # P3) and let G(G#C,) bea 
2-connected graph of order n > 10. Then G is {R, S}-free implies G is pancyclic if, and 
only if, R= Ki3 and S is one of P,, Ps, Ps, N1,0,0 or N2,0,0: 


F79: [GoLuPf04] Let X and Y be connected graphs on at least three vertices such 
that X,Y # P3; and Y # K,,3. Then the following statements are equivalent: 

(a) Every 3-connected {X,Y }-free graph G is pancyclic. 

(b) X = Kj1,3 and Y is a subgraph of one of the graphs from the family F = {P;, L, 
N4,0,0, N3,1,0, N2,2,0, N2,1,1}- 


F80: [FaGo97] It is an easy observation that P3 is the only nontrivial single graph that 
when forbidden implies G is hamiltonian. 


REMARK 


R11: (Claw-free graphs) In each of the forbidden-pair results above, the claw Ky.3 
is one of the two forbidden graphs. This led naturally to the question: Is the claw in 
every triple of forbidden subgraphs implying hamiltonicity? This was answered nega- 
tively in [FaGoJaLe02] where all triples containing no Ky, with t > 3 for sufficiently 
large 2-connected graphs were given. Further, in [FaGoJa04] other forbidden triples for 
sufficiently large graphs were investigated. Brousek [Br02] characterized the collection 
of all forbidden triples that include the claw and imply hamiltonicity for all 2-connected 
graphs. In [FaGoJa05], all possible remaining triples implying hamiltonicity for all 
graphs were given. 
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DEFINITIONS 


D33: For a vertex x such that the induced subgraph G[N(«)] is connected, a local 
completion of G at x is the graph obtained by replacing G[N(a)] by a complete 
subgraph on V(N(x)). (Observe that a local completion of a claw-free graph is claw- 
free.) 


D34: The claw-free closure of G, denoted by cl(G), is that graph obtained by 
repeatedly finding the local completion of a vertex x until the resulting graph has 
the property that for every vertex x, G[N(a)] is either non-connected or a complete 
subgraph. 


D35: The circumference of a graph G, denoted circum(G), is the length of a longest 
cycle in G. 


FACTS 


F81: [FaGo97]| Let R,S be connected graphs (R,S 4 P3) and G a 2-connected graph 
of order n > 10. Then G is {R,S}-free implies G is cycle extendable if, and only if, 
R= Ky 3 and S is one of Ks, Py, N4,0,0 or N2,0,0- 


F82: [Sh97] If G is a 3-connected {Ki,3, Ni,1,1}-free graph, then G is hamiltonian- 
connected. 


F83: [Ry97] Let G be a claw-free graph. Then 

(a) the closure cl(G) is well-defined, 

(b) there is a triangle-free graph H such that cl(G) = H, and 
(c) circum(G) = circum(cl(G)). 


REMARKS 


R12: The claw-free closure is different from the degree closure (see [BoCh76]) or any 
of several other closures that have recently been developed. For more information on 
closures, see [BrRySc00]. 


R13: By Fact F83, if G is claw-free, then G is hamiltonian if and only if cl(G) is 
hamiltonian. 


R14: Matthews and Sumner [MaSu84] conjectured that every 4-connected claw-free 
graph is hamiltonian. At about the same time Thomassen [Th86] conjectured that every 
4-connected line graph is hamiltonian. These were shown to be equivalent by Ryjacek 
[Ry97]. Several other statements are known to be equivalent to these (see [KaVra]). 


FACT 


F84: [KaVra] Every 5-connected line graph with minimum degree at least 6 is hamil- 
tonian (in fact, hamiltonian connected). 


REMARK 


R15: This vast area contains far more than can be written here. For more details on 
hamiltonian graphs the reader should see [Be78], [Bo78], [WiGa84], [Bo95], [CuGa96], 
[Go91] and [Go03]. 
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INTRODUCTION 


The Traveling Salesman Problem (TSP) is perhaps the most frequently studied dis- 
crete optimization problem. Its popularity is due to the facts that TSP is easy to 
formulate, difficult to solve, and has a large number of applications. TSP has a number 
of variations and generalizations extensively studied in the literature [Pu02]. In this 
section, we consider TSP, the Generalized TSP and the Vehicle Routing Problem. 


4.6.1 The Traveling Salesman Problem 


K. Menger [Me32] was perhaps the first researcher to consider the Traveling Sales- 
man Problem (TSP). He observed that the problem can be solved by examining all 
permutations one by one. Realizing that the complete enumeration of all permutations 
was not possible for graphs with a large number of vertices, he looked at the most nat- 
ural nearest neighbor strategy and pointed out that this heuristic, in general, does not 
produce the shortest route. In fact, as we will see below, the nearest neighbor heuristic 
will generate the worst possible route for some problem instances. (For an interesting 
overview of TSP history, see [HoWo85].) 

In applications, both the symmetric and asymmetric versions of the TSP are impor- 
tant. 
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Symmetric and Asymmetric TSP 


DEFINITIONS 


D1: Symmetric TSP (STSP): 
Given a complete (undirected) graph K,, with weights on the edges, find a hamiltonian 
cycle in K,, of minimum (total) weight. 


D2: Asymmetric TSP (ATSP): 

oO 
Given a complete directed graph K,, with weights on the arcs, find a hamiltonian cycle 

oO 

in K, of minimum weight. 
D3: The Euclidean TSP is the special case of STSP in which the vertices are points 
in the Euclidean plane and the weight on each edge is the Euclidean distance between 
its endpoints. 


Oo 
D4: A hamiltonian cycle in K, or Ky, is called a tour. 


NOTATION: Throughout this section, the set {1,2,...,} denotes the vertices of K,, or 
oO 
Ky, or any other n-vertex graph under discussion. 


NOTATION: By TSP we refer to both STSP and ATSP simultaneously. 


Matrix Representation of TSP 


Every instance of TSP can be associated with the matrix of edge-weights of the cor- 
responding complete graph. Such a matrix is symmetric for STSP and, in general, 
asymmetric for ATSP. 


DEFINITIONS 
D5: The distance (or weight) matriz of an instance of STSP is the matrix D = [d;,], 
where d,; is the weight of the edge between vertices 7 and 7. The distance matriz of 


an instance of ATSP is the matrix D = [d;;], where d,; is the weight of the arc directed 
from 7 to 7. Accordingly, the diagonal entries d;; are set to zero. 


D6: An instance of TSP is said to satisfy the triangle inequality if dij + djx > dix 
for all distinct vertices i, 7, k. 


D7: Metric TSP is the special case of STSP where every instance satisfies the triangle 
inequality. 


Clearly, the Euclidean TSP is a special case of the Metric TSP. 
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EXAMPLES 


E1: An instance of ATSP with distance matrix 


0 6 5 10 
3.0 3 9 
7 4 0 8 
12 7 5 O 


is shown in Figure 4.6.1. There are 3! = 6 tours of total weight 29, 27, 30, 23, 27, and 
22. The optimal tour is (1,4,3,2,1) of weight 22. 


Figure 4.6.1: An instance of ATSP. 


E2: An instance of STSP with distance matrix 


0 10 7 7 Ii 
10 0 9 6 5 
D=|7 9 0 9 10 
7 6 9 0 6 
11 5 10 6 O 


is shown in Figure 4.6.2. Since this graph has 5 vertices, there are 4!/2 = 12 tours. The 
optimal tour is (1,3, 2,5,4,1) of weight 34. 


2 


Figure 4.6.2: An instance of STSP. 


Section 4.6. Traveling Salesman Problems 339 


Algorithmic Complexity 


FACTS 


F1: The hamiltonian cycle problem on an n-vertex graph G can be transformed into 
STSP by converting G to an edge-weighted K,, as follows: assign weight 0 to each 
edge of G, and assign weight 1 to each edge in the edge-complement of G. A similar 
transformation can be used for digraphs and ATSP. 


F2: The previous fact implies that even the Metric TSP is NP-hard. 


By replacing the weights 0 by 1 and the weights 1 by nr in the transformation above, 
we obtain the following result. 


F3: [SaGo76] For an arbitrary constant r, unless P = NP, there is no polynomial time 
algorithm that always produces a tour of total weight at most r times the optimal. 


Exact and Approximate Algorithms 


DEFINITIONS 
D8: An exact algorithm is an algorithm that always produces an optimal solution. 


D9: An approximate (or approximation) algorithm is an algorithm that typically 
makes use of heuristics in reducing its computation but produces solutions that are not 
necessarily optimal. 


NOTATION: Let A be an approximate algorithm for TSP and J a problem instance. 
Then wmin(1), Wmax(L), wa(1) denote the weights, respectively, of an optimal tour, a 
heaviest tour, and a tour produced by algorithm A, for instance I. 


D10: The Zemel measure [Ze81] of an algorithm A, denoted p,(A), is the supremum 
of (wa(Z) — Wmin(Z))/(wmax(1) — Wmin(Z)), taken over all TSP instances J for which 
Wmax (I) x Wmin (1). 


FACT 


F4: [Hakh01] There is a polynomial time approximate algorithm A for ATSP with 


pz(A) < 5, and one for STSP with p.(A) < 3. 


The Euclidean TSP 


Despite the fact that Euclidean TSP is NP-hard [Pa77,GaGrJo76], there was a feeling 
among some researchers that the Euclidean TSP is somewhat simpler than the gen- 
eral STSP. This was confirmed by Arora [Ar98] in 1996 (see below). Mitchell [Mi99] 
independently made a similar discovery a few months later (see [Ar02}). 
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FACTS 


F5: [Ar02] For every € > 0, there is a polynomial time algorithm A, that, for any 
instance of the Euclidean TSP, finds a tour at most 1+ times longer than the optimal 
one. 


F6: As of this writing, the fastest algorithm A, has time complexity O(nlogn + 
n/poly(e)) [RaSm98]. 


COMPUTATIONAL NOTE: These A, algorithms have been implemented, but, in their 
current form, they are not competitive with other TSP heuristics [Ar02, RoCiFa09]. 


F7: [Tr97] There exists a constant r > 1 such that, for the Euclidean TSP in O(log n)- 
dimensional Euclidean space, the problem of finding a tour that is at most r times longer 
than the optimal tour is NP-hard. 


REMARKS 


R1: Arora’s result above can be generalized to d-dimensional Euclidean space for any 
constant d. However, the previous fact limits the scope of this generalization. 


R2: Exact algorithms cannot be relied on for applications requiring very fast so- 
lutions (online, for example) or ones that involve huge problem instances. Although 
approximate algorithms forfeit the guarantee of optimality, with good heuristics they 
can normally produce solutions close to optimal. For many applications this is good 
enough, since often the data are inexact anyway. 


R3: TSP heuristics can be roughly partitioned into two classes: construction heuris- 
tics and improvement heuristics. Both types are discussed below. Other overviews 
of TSP heuristics can be found in [GoSt85], [JoGuMcYeZhZv02], and [JoMc02]. 


4.6.2 Exact Algorithms 


The NP-hardness results mentioned in the previous subsection indicate that it is 
rather difficult to solve large instances of TSP to optimality. Nevertheless, there are 
computer codes that can solve many instances with thousands of vertices within days 
(on a single-processor computer) [ApBiChCo06]. For a discussion of some TSP software 
implementing both exact algorithms and heuristics, see [LoPu02]. 


FACT 


F8: The brute-force method of explicitly examining all possible TSP tours is impractical 
for even moderately sized problem instances because there are (n — 1)!/2 different tours 
oO 


in K,, and (n — 1)! different tours in Kp. 
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Integer Programming Approaches 


Various methods have been suggested to solve TSP to optimality. They include La- 
grangian relaxation ([BeLu00]), dynamic programming ([PaSt82]), and branch-and- 
bound and branch-and-cut (see [BaTo85], [FiLoTo02], and [Na02]). These are all well- 
known methods in integer programming ([Wo98]). The earliest (and still useful) integer 
programming formulation of ATSP is due to Dantzig, Fulkerson, and Johnson [Da- 
FuJo54]. 


DANTZIG, FULKERSON, AND JOHNSON FORMULATION: Define zero-one variables x;; by 


1, if the tour traverses arc(?, 7) 
Ce — 
~ 0, otherwise 


Let d,;; be the weight on arc (7,7). Then ATSP can be expressed as: 


n n 
i=1j=1 
n 
subject to Mag ed FSD 24s on 


j=l 
YS YS ay <|S|—1 for all 0 < |S] <n 
i€S JES 

tj =Oorl, 1,7 =1,...,n 


FACTS 


F9: The first set of constraints ensures that a tour must come into vertex j exactly 
once, and the second set of constraints indicates that a tour must leave every vertex 7 
exactly once. These two sets of constraints ensure that there are two arcs adjacent to 
each vertex, one in and one out. However, this does not prevent non-hamiltonian cycles. 
Instead of having one tour, the solution can consist of two of more vertex-disjoint cycles 
(called sub-tours). 


F10: The third set of constraints, called sub-tour elimination constraints, requires 
that no proper subset of vertices, S, can have a total of |.S| arcs. 


F11: The formulation without the third set of constraints is an integer programming 
formulation of the Assignment Problem that can be solved in time O(n?) [Wo98]. A 
solution of the Assignment Problem is a minimum-weight collection of vertex-disjoint 


cycles C),...,C, spanning a . If ¢ = 1, then an optimal solution of ATSP has been 
obtained. Otherwise, one can consider two or more subproblems. For example, for a 
particular arc a € C;, one subproblem could add the constraint that arc a be in the 
solution, and a second subproblem could require that a not be in the solution. This 
simple idea gives a basis for branch-and-bound algorithms for ATSP. 
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4.6.3 Construction Heuristics 


Approximate algorithms based on construction heuristics build a tour from scratch 
and stop when one is produced. 


Greedy-Type Algorithms 


The simplest and most obvious construction heuristic is nearest neighbor. The nearest 
neighbor greedy algorithm constructs a tour by always choosing as the next vertex to 
visit one that is nearest to the last one visited. 


Algorithm 4.6.1: Nearest Neighbor (NN) 


Input: n x n distance matrix [d;;] and a fixed vertex 4}. 
Output: TSP tour (i1,72,...,¢n, 741). 
Initialize S := {1,2,...,n}— {a}. 
For k = 2,3,...,n 
Choose ix, such that d;,_. 4, = mint di,_4,s}- 
s 


S:=S- {ix}. 


A second greedy- type algorithm is based on the observation that a vertex-disjoint col- 
lection of paths in Kn (K,,) can be extended to a tour in Ky (Kn). 


Algorithm 4.6.2: Greedy Heuristic (GR) 


Input: n x n distance matrix [d;,]. 
Output: ATSP (STSP) tour as a set S' of arcs (edges). 
Set S = and m= n(n -— 1) (for ATSP) or m = n(n — 1)/2 (for STSP). 
Sort the arcs (edges) a1, @2,...,@m in non-decreasing order of weight. 
For 1 = 1,2,...,m 
If SU {a;} is the arc (edge) set of a collection of vertex-disjoint paths 
or is the arc (edge) set of a tour, 


S:= SU {a;} 


EXAMPLE 


E3: The performance of Nearest Neighbor on STSP or ATSP can be arbitrarily bad. 
For the instance of STSP shown in Figure 4.6.3, starting at vertex 1, Algorithm NN 
moves to vertex 4, then to vertex 3, and on to vertex 2. The resulting tour is (1, 4,3, 2, 1) 
with weight 103, whereas, the optimal tour is (1,3, 2,4,1) with weight 6. 


COMPUTATIONAL NOTE: Computational experiments in [JoGuMcYeZhZv02] indicate 
that, in fact, on most real-world problem instances of ATSP, Algorithm NN performs 
better than Algorithm GR; GR fails completely on one family of instances, where the 
average GR-tour is more than 2000% above the optimum. Computational experiments 
for STSP in [JoMc02] show that both GR and NN perform relatively well on Euclidean 
instances and perform poorly for general STSP. GR appears to perform better than NN 
for STSP. 
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1 100 2 


4 1 2 


Figure 4.6.3: Performance of Nearest Neighbor can be arbitrarily bad. 


Insertion Algorithms 


Vertex insertion, another type of construction heuristic, applies to both Symmetric and 
Asymmetric TSP. For ATSP, the insertion algorithm begins with a cycle of length 2, 
and in each iteration, inserts a new vertex into the cycle. For STSP, the algorithm 
begins with a cycle of length 3. The algorithm descriptions and examples that follow 
are for ATSP, but with the obvious adjustments, they apply equally well to STSP. 


DEFINITION 


Dili: Let C = (t1,%2,...,im,t1) be the vertex sequence of a cycle in Kp, and let v 
be a vertex not on C. For any arc (a,b) on cycle C, the insertion of vertex v at 
arc (a,b) is the operation of replacing arc (a,b) with the arcs (a,v) and (u,b) (see 
Figure 4.6.4). The resulting cycle is denoted C(a,v,b). Thus, if (a,b) = (tx, in41), 
1<k<m-1, then C(a,v,b) = (%,%2,...,a,0,0,..-im,t1), and if (a,b) = (im,%1), 
then C(a,v,b) = (41, 72, ..-%m_—1, @, V, b). 


@v Vv 


Figure 4.6.4: Insertion of vertex v at arc (a, 6). 


Algorithm 4.6.3: Vertex Insertion (VI) 


Input: n x n distance matrix [d;,]. 
Output: TSP tour (i1,7%2,...,¢n,7%1). 


oOo 
Let 7, and iz be two vertices of K,, chosen by some rule. 
Initialize cycle C = (41, i2, 71). 


For s = 3,...,” 
Let 7; be a vertex not on cycle C’, chosen by some rule [*]. 
Insert vertex i, at an arc (a*,b*) of cycle C = (i1, 22,...,¢%s—1, 41) 


such that the weight of C(a*,is,b*) is minimum among the 
cycles C(a,is,b) for all arcs (a,b) in C. 
C := C(a*, is, b*). 


344 Chapter 4. Connectivity and Traversability 


REMARK 


R4: Random vertex insertion, nearest vertex insertion, and farthest vertex insertion, 
which are defined below, are three different versions of Algorithm VI. Each is determined 
by how rule [*] chooses vertex i,. 


oO 
NOTATION: Given a vertex v and a cycle C in Ky, d(v,C) denotes the distance from v 
to C, that is, d(v,C) = min {dy}. 
xEV(C) 


DEFINITIONS 
D12: The random vertex insertion (RVI) chooses vertex i, randomly. 


D13: The nearest vertex insertion (NVI) chooses vertex i, so that its distance to 
cycle C is a minimum. That is, d(is,C) = mo C)}. 
VU 


D14: The farthest vertex insertion (FVI) chooses vertex i, so that its distance to 
cycle C is a maximum. That is, d(t,,C) = ee C)}. 


COMPUTATIONAL NOTE: The vertex insertion heuristics described above perform quite 
well for Euclidean TSP (see [JoMc02]). Computational experiments with Algorithm RVI 
for ATSP in [Gl1GuYeZv01] show that RVI is good only for instances close to Euclidean. 


Minimum Spanning Tree Heuristics 


There are many more construction heuristics for TSP and especially for STSP (see, e.g., 
[JoGuMcYeZhZv02], [JoMc02], [KaRe04]). For STSP, the heuristics that are often given 
in the literature include the Double Minimum Spanning Tree (DMST) and Christofides 
(CH) heuristics (see, e.g., Algorithms 6.4.2 and 6.4.3 in [GrYe06]). Here is pseudocode 
for the DMST heuristic. Following the pseudocode is a remark highlighting the difference 
between the DMST and CH heuristics. 


Algorithm 4.6.4: DMST 


Input: n x n distance matrix [d;,]. 

Output: STSP tour (1, 72,...,%n, 71). 
Find a minimum spanning tree T in Ky. 
Create a eulerian multigraph H by doubling the edge set of T. 
Construct a eulerian tour W of H. 


Let W = (¢1,%2,-.-,%m,t1) (written as a sequence of vertices). 
For s = 3,...,n (look for shortcuts) 
If i, = % for some t < s, then delete i441,...,2; from W. 
REMARK 


R5: The only difference between the algorithms that use the CH and DMST heuristics 
is the way in which the eulerian multigraph H is constructed. In particular, using the 
CH heuristic, E(H) = E(T) UM, where M is a minimum-weight perfect matching in 
the subgraph of K,, induced on the odd-degree vertices of T. 
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DEFINITION 


D15: The operations to eliminate repeated vertices in the eulerian tour W in order to 
obtain an STSP tour are called shortcuts. 


COMPUTATIONAL NOTE: Implementing the ordinary shortcuts when using the CH 
heuristic already produces a relatively good heuristic for the Euclidean TSP [JoMc02]. 
However, so-called greedy shortcuts (see [JoMc02}) result in a modification of CH, which 
seems to be one of the best construction heuristics for the Euclidean TSP. Deineko and 
Tiskin [DeTil0] designed and implemented an algorithm that finds an optimal sequence 
of shortcuts for the multigraph H of DMST. The resulting heuristic appears to be among 
the best construction heuristics for STSP. 


FACTS 


F12: [JoPa85] The algorithm that uses the CH heuristic can be implemented to run 
in time O(n). 


F13: [DeTil0] For the Euclidean TSP, the algorithm of Deineko and Tiskin mentioned 
above runs in time O(n”) and memory O(n). 


Worst Case Analysis of Heuristics 


While computational experiments are important in the evaluation of heuristics, they 
cannot cover all possible families of instances of TSP, and, in particular, they normally 
do not cover the most difficult instances. Moreover, certain applications may produce 
families of instances that are much harder than those normally used in computational 
experiments. For example, such instances can arise when Generalized TSP (discussed 
later) is transformed into TSP. Thus, theoretical analysis of the worst possible cases is 
also important in evaluating and comparing TSP heuristics. 


FACTS 


F14: [GuYeZv02-a] For every n > 3, there is an instance of ATSP and an instance of 
STSP with n vertices satisfying the triangle inequality on which NN outputs the unique 
worst possible tour. 


F15: [GuYeZv02-a, GuYe07] For every n > 3, there is an instance of ATSP and an 
instance of STSP with n vertices satisfying the triangle inequality on which GR outputs 
the unique worst possible tour. 


F16: [Ru73] Let H be a tour produced by RVI for an instance [,, of STSP with n > 3 
vertices. Then H is not worse than at least (n — 2)! tours when n is odd and (n — 2)!/2 
tours when n is even (including H itself). 


F17: [GuYeZv02-a] For every n > 2, n #6 and every instance of ATSP with n vertices, 
RVI computes a tour T that is not worse than at least (n — 2)! tours, including T itself. 


F18: [PuMaKa03] For the Metric TSP, the DMST algorithm always produces a tour 
no more than twice as long as the optimal one, while the CH one produces tours never 
worse than 1.5 times the optimum (see [JoPa85]). However, there are instances for which 
DMST produces the unique worst possible tour, and there are instances for which CH 
produces a tour worse than all but at most [n/2]! tours. 
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F19: [PaVa84] Given the multigraph H created by the CH heuristic, it is NP-hard to 
find an optimal sequence of shortcuts even for the Euclidean TSP. 


F20: For the Metric TSP, given the multigraph H of DMST, we can find an optimal 
sequence of shortcuts (a) in time O(n? + 2¢n?) and memory O(2¢n?) [BuDeWo98] or 
(b) in time O(4%n?) and O(4¢n) [DeTil0], where d equals half the maximum degree of 
Al. 


Part (b) of Fact F20 implies Fact F13 as d < 6 for the Euclidean TSP [DeTi10]. 
REMARKS 


R6: A simplified proof of Fact F14 for ATSP can be found in [GuYeZv02-b]. It is 
based on a proof of a much more general result for the greedy algorithm in combinatorial 
optimization (see [GuYe02-a]). 


R7: The proof of Fact F15 for ATSP in [GuYeZv02-b] cannot be directly used for 
STSP. A proof of Fact F15 for STSP is given in [GuYe07]. 


R8: A proof of Fact F17 is similar to the proof of Fact F16, but is based on a different 
result that was first proved for n odd by Sarvanov [Sa76], and for n even by Gutin and 
oOo 


Yeo [GuYe02-b]. The proofs use decompositions of K,, and K,,, into hamiltonian cycles, 


oO 
which exist for K,, if and only if n is odd (see, e.g., [Ha69]), and for K,, if and only if 
n #4 or 6 [Ti80]. 


4.6.4 Improvement Heuristics 


Approximate algorithms based on improvement heuristics start from a tour (nor- 
mally obtained using a construction heuristic) and iteratively improve it by changing 
some parts of it at each iteration. The best known tour improvement procedures are 
based on edge exchange, in which a tour is improved by replacing k of its edges (arcs) 
with & edges (arcs) not in the solution. 


COMPUTATIONAL NOTE: For many combinatorial optimization problems, well-known 
metaheuristics including tabu search, simulated annealing, and genetic algorithms pro- 
vide the best tools for producing good quality approximate solutions. This has not 
been the case for TSP, for which variations of the edge-exchange algorithms of Lin 
and Kernighan (Lin—Kernighan local search) are still state-of-the-art ({JoMc02], [Ap- 
BiChCo06], [He09]). They are typically much faster than the exact algorithms, yet often 
produce solutions very close to the optimal one. Interested readers can find a detailed 
description of the Lin—Kernighan local search and its generalizations in [ReGl02] and 
[ApBiChCo06]. Although the Lin—Kernighan local search can be applied only to STSP, 
ATSP can be transformed into STSP (see, e.g., [JoGuMcYeZhZv02}). 


DEFINITIONS 


D16: For STSP, the 2-opt algorithm starts from an initial tour T and tries to improve 
T by replacing two of its non-adjacent edges with two other edges to form another 
tour (see Figure 4.6.5). Once an improvement is obtained, it becomes the new T. The 
procedure is repeated as long as an improvement is possible (or a time limit is exceeded). 


Section 4.6. Traveling Salesman Problems 347 


j 
Figure 4.6.5: Edges {i,j} and {k,l} are replaced by {i,/} and {j, k}. 


D17: For k > 3, the k-opt algorithm is the same as 2-opt except that k edges (arcs) 
are replaced at each iteration. 


D18: The best improvement k-opt is similar to the k-opt defined above; the differ- 
ence is that every set of k edges (arcs) is tried for deletion from T and all possibilities 
of adding & edges (arcs) are considered before the best replacement of T is retained (as 
a replacement for T). The procedure is repeated. 


FACT 


F21: [PuMakKa03] Although best improvement 2-opt can take exponential time to find 
a local optimum, any (possibly sub-optimal) tour obtained after O(n? log n) iterations is 
not worse than — of all STSP tours. Similar statements hold for the best improvement 
k-opt, k > 3. 


Exponential Neighborhoods 


Best improvement k-opt considers O(n) tours that can be obtained from a tour T 
by replacing edge-exchanges involving exactly k edges (arcs). Thus, it considers only 
a polynomial number of tours in the neighborhood of T. For TSP, one can construct 
various neighborhoods with an exponential number of tours in which the best tour 
can be found in polynomial time. In particular, there exist TSP neighborhoods of 
size 208") (2°(™)), where the best tour can be found in time O(n?) (O(n)). These 
neighborhoods are discussed in [AhErOrPu02], [DeWo00], and [GuYeZv02-b]. 


COMPUTATIONAL NOTE: While having seemingly strong theoretical properties, TSP 
exponential-neighborhood, local-search algorithms have not shown strong computation 
properties so far. Perhaps this is due to the fact that it is not the size of the neighborhood 
that matters, but the total number of tours of TSP that are worse than the best tour in 
the neighborhood. This may explain why computational experiments show that some 
exponential-neighborhood, local-search heuristics are worse than the (seemingly much 
weaker) 3-opt. 
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4.6.5 The Generalized TSP 


The Generalized TSP (GTSP) has numerous applications and is one of the most 
studied extensions of TSP [FiSaTo02]. 


DEFINITIONS 


D19: The Generalized Asymmetric Traveling Salesman Problem (GATSP): 


<P 
Given a weighted complete digraph K,, and a partition Vi,...,V, of its vertex-set, find 
a minimum-weight cycle containing exactly one (at least one) vertex from each set V;, 
C= Ay nceg he: 


D20: The sets V; are called clusters, and a cycle containing exactly one (at least one) 
vertex from each cluster is called a tour. 


D21: The Generalized Symmetric Traveling Salesman Problem (GSTSP) is 
o> 
formulated similarly with K,, replaced by Kp. 


REMARK 


R9: Observe that the requirements ‘at least one’ and ‘exactly one’ in GATSP and 
GSTSP coincide when the triangle inequality holds. The ‘exactly one’ versions of 
GATSP and GSTSP have received the most attention in the literature, and only these 
versions are discussed here. 


Transforming Generalized TSP to TSP 


One of the ways to solve instances of the Generalized TSP is to transform them into 
TSP instances. The most efficient transformations from GATSP to ATSP and from 
GSTSP to STSP appear to be the ones given in [NoBe93] and [LaSe99], respectively. 


FACTS 


F22: In the transformation of [NoBe93], from GATSP into ATSP, the number of 
vertices remains the same. Weights are modified so that an optimal ATSP tour must 
visit all the vertices that belong to the same cluster in the original problem before 
moving on to the next cluster. This is achieved by adding a large positive constant M 
to the weight of each inter-cluster arc. If the constant is large enough, an optimal tour 
will contain exactly k such heavy arcs, thus ensuring that no cluster is visited more 
than once. 


F23: In the transformation of [LaSe99], from a GSTSP instance into an STSP instance, 
we first add a sufficiently large positive constant to every edge-weight, if needed, to 
ensure that all edge-weights are nonnegative. Then we consider each cluster V; of 
cardinality at least 2. For each vertex v; in such a cluster, we create a copy v;. In each 
such cluster, we form a hamiltonian cycle C = (v1, v},...,Uz,U},U1) and assign weight 
—M to every edge of the form u,v; and weight —2M to the rest of the edges in cycle 
C, where M is any constant larger than the sum of n heaviest edges in the GSTSP 
instance. The weights of the remaining edges within the clusters and between the 
clusters are inherited from the corresponding weights of the GSTSP instance. Clearly, 


Section 4.6. Traveling Salesman Problems 349 


an optimal tour T of the resulting STSP instance will use all edges of weight —2M, all 
but one of the (—/)-weight edges from each cluster of cardinality at least 2, and edges 
between the clusters. By contracting every vertex vu; and its copy vj in T, we obtain an 
optimal tour of the GSTSP instance. 


F24: For the transformations in [LaSe99] and [NoBe93], there is a bijection between 
optimal tours in the original problem and those in the transformed one, making the 
transformations suitable for exact algorithms. 


Exact Algorithms 


FACTS 


F25: Computational experiments ([BeGuPeYeZv03] and [LaSe99]) have shown that 
the transformations given in [NoBe93] and [LaSe99] can be used to solve to optimality 
small to moderate instances of Generalized TSP. However, even small instances require 
substantial computation because of the corresponding TSP instances’ very large weights 
on some of its edges (arcs). 


F26: A successful branch-and-cut algorithm for GSTSP is described and analyzed in 
[FiSaTo02], and a Lagrangian-based approach for GATSP is given in [NoBe91]. 


The next result appears to be a major stumbling block for using a standard branch- 
and-bound for ATSP adapted for GATSP. 


F27: [GuYe03] Let D = (V, A) be a digraph and let V,,V2,...,V, be a partition of V. 
The problem of checking whether D has a 1-regular subdigraph containing exactly one 
vertex from each V,, Vo,...,V;, is NP-complete even if |V;| < 2 for every i =1,2,...,k. 
(A digraph H is 1-regular if the indegree and outdegree of every vertex in H equal 1.) 


Approximate Algorithms 


Researchers designed, implemented, and tested many GTSP heuristics: metaheuristics, 
local search algorithms, and construction heuristics. As of this writing, the most pow- 
erful GTSP heuristic approach is the use of memetic algorithms, which combine genetic 
algorithms and local search. The following is a general scheme of memetic algorithms. 


Algorithm 4.6.5: General Scheme of Memetic Algorithms 


1. Initialize: Construct the first generation of solutions using construction 
heuristics. 

2. Improve: Use a local search procedure to replace each of the first generation 
solutions by the local optimum. Eliminate duplicate solutions. 

3. Produce next generation: Use reproduction, crossover, and mutation genetic 
operators to produce the non-optimized next generation. Each of the genetic 
operators selects parent solutions from the previous generation. The length 
of a solution is used as the evaluation function. 

4. Improve next generation: Use a local search procedure to replace each of 
the current generation solutions except the reproduced ones by the local 
optimum. Eliminate duplicate solutions. 

5. Evolve: Repeat Steps 3-4 until a termination condition is reached. 
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COMPUTATIONAL NOTE: Memetic algorithms differ in their use of local search and 
genetic operators. Computational experiments for GTSP show that memetic algorithms 
with powerful local search provide best results. At the time of this writing, the best 
GTSP memetic algorithm is by Gutin and Karapetyan [GuKa10]. The algorithm applies 
to both GATSP and GSTSP. The algorithm can likely be improved if some local search 
algorithms recently developed in [KaGu] are used. For efficient adaptation of the Lin— 
Kernighan heuristic for GTSP, see [KaGu1]]. 


COMPUTATIONAL NOTE: The following Cluster Optimization heuristic [FiSaTo02] is 
used in many successful GTSP heuristics. It finds a minimum-weight cycle C’ having ex- 
actly one vertex from each cluster and traversing the clusters in the order V;,, Vi,,..., Vix 
by solving |V;,| shortest path problems, each associated with a different vertex vu € Vj,. 
In particular, for each v € Vj,, let v’ and v” be two distinct copies of vertex v, and 
construct an acyclic digraph D, with vertex-set {v’}UV;, U...UV;, U {v”} and whose 
weighted arcs are defined as follows: for each y € V;, and each x € V;,, there are arcs 
(v’,y) and (a, v’’) whose weights equal the weights of the arcs (v,y) and (x, v), respec- 
tively, in the original digraph; for each x € V;, and y € Vj,,,, t = 2,...,k — 1, there is 
an arc (x,y) with the same weight as arc (x,y) in the original digraph. The heuristic 
proceeds by finding a shortest (v’,v”)-path in each D,, and the minimum-weight cycle 
C' will correspond to a smallest one of these shortest paths. 


4.6.6 The Vehicle Routing Problem 


The Vehicle Routing Problem (VRP) was introduced by Dantzig and Ramser [DaRa59]. 
This problem (including its versions with additional constraints) seems to be the most 
applicable of all generalizations of TSP. Vehicle routing is the generic name given to a 
large family of problems involving the distribution of goods, information, services, or 
people. A particularly important special case of VRP is that of minimizing the total 
distance traveled by a fleet of vehicles that deliver goods ordered by customers. The 
vehicles are assumed to have equal capacity Q, and their delivery tours start and end 
at a central depot. 


DEFINITIONS 


D22: Given a weighted complete directed or undirected graph on vertices {0,1,...,n}, 
a demand d; > 0 for i = 1,2,...,n, and two positive integers Q (vehicle capacity) and 
k (number of vehicles), a CVRP tour is a collection of k cycles C1, Co2,...,Cx such 
that 


(i) U V(Gj) S40) Lh 
(ii) V(C;) NV (GC) = {0}, for 7 Al; and 


(iii) So d; < Q for each j =1,2,...,k. 


i€V(C;) 
D23: The Capacitated Vehicle Routing Problem (CVRP): Given a weighted 
complete directed or undirected graph on vertices {0,1,...,n}, a demand d; > 0 for 


i=1,2,...,n, and two positive integers Q and k, find a CVRP tour for which the total 
weight of the cycles is minimum. 
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REMARKS 


R10: Practitioners and researchers often consider additional complicating constraints. 
Some examples are: the total weight of each cycle is limited; each vertex must be visited 
within a prescribed time window, vehicles are allowed to have different capacities; routes 
of different vehicles cannot cross, etc. [PoKaWa99]. 


R11: In most research papers, the symmetric CVRP (on K,,+1) is considered. Never- 
theless, the asymmetric (i.e., ‘directed’) CVRP version is also of interest [Vi96]. 


Exact Algorithms 


FACTS 


F28: The most efficient exact algorithms for symmetric CVRP are those based on 
branch-and-cut ([BIHo00], [NaRi01], [RaKoPuTr03}). 


F29: For the asymmetric version of CVRP, it seems that the state-of-the-art exact 
algorithms still use branch-and-bound [ToVi01,ToVi02]. 


F30: Since CVRP has aspects of both TSP and Bin Packing, set-covering methods 
can sometimes be applied to CVRP and its generalizations with great success [BrSi01]. 


COMPUTATIONAL NOTE: ‘The exact algorithms appear to be less powerful for CVRP 
than they are for TSP. Although they are able to solve some instances with 100 or more 
vertices, the exact algorithms were unable to solve an instance of symmetric CVRP with 
as few as 51 vertices [RaKoPuTr03,ToVi02]. Often, practical versions of CVRP have 
various complicating constraints that cannot be tackled by exact algorithms. Thus, 
heuristics are of great importance for CVRP. 


Heuristics for CVRP 


CVRP heuristics fall roughly into two categories: those that produce a CVRP tour rela- 
tively quickly, and those that try to produce a near-optimal solution, using a substantial 
amount of computing, if necessary. The latter kind are mostly metaheuristic-based al- 
gorithms. Tabu search seems to provide a good tradeoff between the quality of solution 
and running time ({(ErOrSt06], [GeLaPo01], [Ta93], [ToVi02]). [DeFiTo06] uses TSP 
exponential neighborhoods to great effect. 


Fast CVRP heuristics are of great importance, supplying quick and flexible solutions, 
good starting tours for metaheurisic-based algorithms, and upper bounds for exact 
algorithms. We close this section with brief descriptions of three classes of fast CVRP 
heuristics: savings heuristics, insertion heuristics, and two-phase heuristics. 


REMARKS 


R12: An important difference between TSP and CVRP is that a CVRP heuristic may 
not produce a feasible solution even if one exists. We illustrate this fact below for the 
Clarke-Wright savings heuristic. 


R13: The descriptions that follow are for the asymmetric CVRP, but they also apply 
to the symmetric CVRP with digraph replaced by graph and arc replaced by edge. 
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Savings Heuristics 


The Clarke-Wright savings heuristic is perhaps the earliest [ClWr64] and best known 
heuristic for the VRP. Here, we describe a generic savings heuristic, whose concrete 
implementations may be found in [AlGa91], [ClWr64], and [LaSe01]. 


NOTATION: (a) For a vertex subset 5S, t(S) denotes (an approximation of) the weight 
of an optimal TSP tour of the induced subdigraph on S. (b) The total demand of a 


vertex subset S is d(S) = S> dj. 
ies 


DEFINITIONS 


D24: A merge of cycles C; and C2, denoted merge(C1, C2), is a cycle whose vertex 
set equals V(C) U V(C2). The resulting cycle is determined by some prescribed rule. 
Cycles C; and Cy can be merged only if the total demand of their vertices does not 
exceed capacity Q (i.e., d(V(C,) UV(C2)) < Q). 


D25: Given cycles C, and C2, the saving of merge(C1, C2), denoted s(C1, C2), is 
given by s(C1,C2) = t(V(C1)) + (V(C2)) — t(V (C1) UV (C2). 


oO 
D26: Let R = Ci,C2,...,Cm be a collection of m cycles of Ky, whose pairwise in- 
tersections are vertex 0. The savings digraph, D(R), is the weighted digraph on m 
vertices, labeled C1, C2,...,Cm, such that arc (C;, Cj) exists if d(V(C1) UV(C2)) < Q, 
and the weight assigned to arc (C;,C;) is the saving s(Ci, C;). 


REMARKS 


R14: In the Clarke-Wright savings algorithm ([ClWr64], [LaSe01]), the weight of cycle 
C is used as an estimate of t(V(C)). To obtain the exact value of t(V(C)), one would 
have to solve a TSP on the induced subdigraph on V(C), which may be too costly 
computationally. 


R15: The simplest way to merge cycles C; and C2 is the one used in the Clarke—Wright 
algorithm. If (i,0) is the arc in C; that enters vertex 0, and (0,7) is the arc in C2 that 
leaves vertex 0, then these two arcs are deleted and arc (i,j) is added to complete the 
new cycle. (See Figure 4.6.6.) 


j 
Ji 
(0) 


Figure 4.6.6: A Clarke—Wright merge of cycles C) and C2. 


merge (C, , Cy) 


Section 4.6. Traveling Salesman Problems 353 


Algorithm 4.6.6: Savings Heuristic (SH) 


Input: distance matrix [d;;]; demands d;,i=1,...,n; capacity Q; 
and number of vehicles k. 
Output: CVRP tour R = {C1,...,Cg}. 
Initialize cycle count m = n and cycles C; = (0,7,0), i=1,... 
Initialize R = {Ci,...,Cm}. 
While m > k 
Construct savings digraph D(R). 
Construct a matching M in D(R) with |M| <m-—k. 
For each are (C;,C;) in M, 
R= (R = {Ci, C;}) U merge(C;, C;) 


m:=m-—1 


,;m. 


REMARKS 


R16: The easiest way to construct M is to simply choose a pair (C,, C) with maximum 
saving s(C,, Cy) as the only arc in M. In some versions in which cycle-merging occurs 
in parallel, M is built in a greedy manner [LaSe01]. 


R17: Often, after R is produced by Algorithm SH, each of the cycles in R is improved 
by some TSP improvement heuristic. For example, in [LaSe01], a CVRP tour found by 
the Clarke—Wright algorithm is improved by applying the best improvement 3-opt to 
each of its cycles. 


R18: Let n = 12, k = 2, Q = 6, and d; = 1 for each i. Clearly, there is a feasible 
solution. However, the Clarke—-Wright algorithm may first produce six cycles, each 
containing two vertices different from 0, and then construct three cycles, each containing 
four vertices different from 0. Now there is no possibility to obtain a feasible solution. 


Insertion Heuristics 


In CVRP iteration algorithms [LaSe01], we start from k cycles of the form C, = (0, ip, 0). 
The vertices outside of the cycles are inserted one by one in sequential or parallel manner. 
The word ‘parallel’ here means that a vertex is inserted in one of the current cycles Cp 
for which the insertion is most beneficial. In the sequential mode, we start constructing 
a new cycle only when the previous one cannot be increased because of the capacity 
constraints. The cost of insertion of a vertex 7 into a current cycle C, can be measured 


by ((V(C) U {i}) — L(V (C)). 
REMARKS 


R19: An example of such an algorithm is the Christofides—Mingozzi—Toth insertion 
heuristic [ChMiTo79]. 


R20: Fisher and Jaikumar [FiJa81] suggested inserting all vertices at the same time. 
They apply the Generalized Assignment Problem to find ‘optimal’ insertions that do 
not violate the capacity constraints. 
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Two-phase Heuristics 


The basic idea of two-phase heuristics is to partition vertices {1,2,...,n} into k clusters 
Vi,...,V~ and solve TSP for each of the graphs induced by V; U {0}, i = 1,2,...,k. 
Wren and Holliday [WrHo72] suggested a sweeping technique for the Euclidean CVRP, 
in which the depot 0 and vertices 7, 1 <%< n are points on the Euclidean plane, and 
dj; is the Euclidean distance between i and j. 


We introduce a polar coordinate system, in which one of the n vertices, say 7, is chosen 
as the reference point, with polar coordinates (0,do;). Then every vertex j # i has 
coordinates (¢;,do;), where ¢; is the angle between the rays from 0 to i and from 0 to 
j. The sweeping algorithm in its simplest form is as follows. 


Algorithm 4.6.7: Sweeping Heuristic 


Input: distance matrix [d;,;]; polar angles ¢; for 1 <i<n; 
demands d;,i=1,...,n; capacity Q; and number of vehicles k. 
Output: CVRP tour consisting of cycles Cj, j = 1,...,k. 


Sort the vertices 21,%2,...,% such that ¢;, < @j,,,,8=1,...,n—1. 
Initialize S$; =@, 7 =1,...,k. 
Set 7 = 1. 
For s=1,...,” 

If d(S; U {ts}) > Q 

gage 

S; = S; U {i,} 
For j = 1,...,k 

Let C; be a TSP tour for the subgraph induced on S; U {0}. 


REMARK 


R21: An extension of this approach to the general CVRP is described in [BrSi95]. 
Another example of a two-phase heuristic is a truncated branch-and-bound provided in 
[(CHMiTo79]. 
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INTRODUCTION 

Continuing the study of connectivity, initiated in 84.1 of the Handbook, we survey 
here some (sufficient) conditions under which a graph or digraph has a given connectivity 
or edge-connectivity. First, we describe results concerning maximal (vertex- or edge-) 
connectivity. Next, we deal with conditions for having (usually lower) bounds for the 
connectivity parameters. Finally, some other general connectivity measures, such as one 
instance of the so-called “conditional connectivity,” are considered. 

For unexplained terminology concerning connectivity, see $4.1. 


4.7.1 High Connectivity 


Since connectivity has to do with “connection,” intuitively we can expect to find 
high connectivity when the “edge density” of the graph is large. Different situations in 
which this seems to be the case are: 


(a) Large minimum or average degree. 

(b) Small diameter (for given girth). 

(c) Small number of vertices (for given degree and girth). 

(d) Large number of vertices (for given degree and diameter). 


The results in this subsection give several conditions of the above types, under 
which maximum vertex- or edge-connectivity is attained. An extensive collection of 
results about maximally edge-connected and vertex-connected graphs and digraphs can 
be found in the survey by Hellwig and Volkmann [HeVo08b]. 
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Minimum Degree and Diameter 


NOTATION: Let G = (V,£) be a graph with order n, minimum degree 6, maximum 
degree A, edge-connectivity A, and (vertex-)connectivity «. In some other sections of 
the Handbook, the notations dmin, Omax, Ke, aNd Ky are used instead of 6, A, A, and k, 
respectively. 


DEFINITIONS 


D1: The girth g of a graph G with a cycle is the length of its shortest cycle. An 
acyclic graph has infinite girth. 


D2: The diameter D of G is max {diste(u, v)}. 
U,vEe 


D3: The clique number of a graph G, denoted w(G), is the maximum number of 
vertices in a complete subgraph of G. 


D4: A (di)graph G is p-partite if its vertex-set can be partitioned into p independent 
(or stable) sets. 

FACTS 

F1: [Ch66] If 6 > |n/2], then G is maximally edge-connected (i.e., A = 6). 


F2: [Le74] If for any non-adjacent vertices u and v, deg(u) + deg(v) > n — 1, then 
A=. 


F3: [P175] If G is a graph with diameter D = 2, then A = 0. 
F4: [HeVo08a] For any graph G, \(G) = 6(G) or \(G) = 6(G). 
F5: [Vo88] If G is bipartite and 6 > |n/4| +1, then A=. 


F6: [Vo89] If G is p-partite (p > 2) andn <2 | | a2 Shenae: 


F7: [ToVo93] If G is p-partite (p > 2) andéd>n ae then G is maximally connected 
(i.e., K = 0). 


F8: [DaVo95] If G is p-partite (p > 2) with clique number w < pand n < 2 | 9 -1, 
then A = 0. 

REMARKS 

R1: It is easily shown that Fact F3 > Fact F2 => Fact F1. 

R2: Fact F5 is a slight improvement of Fact F6 for p = 2. 


R3: In addition to Fact F8, the authors in [DaVo95] gave other sufficient conditions 
for A = 6 that mostly generalize conditions in [P1Zn89]. 


R4: A consequence of Fact F4 is that A(G) = 6(G) for any self-complementary graph 
(G =). 
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Degree Sequence 


NOTATION: For the next group of results, G is an n-vertex graph with degree sequence 
d, > dy >--->d, = 0. For a vertex u, N(u) denotes the set of vertices adjacent to u. 


FACTS 


F9: [GoWh78] If the vertex set of G can be partitioned into |n/2| pairs of vertices 
(ui, vi) (and, if n is odd, one “unpaired” vertex w) such that deg(u;) + deg(vi) > n, 
¢=1,2,...,|n/2], then \=0. 


F10: [GoEn79] If each vertex u of minimum degree satisfies 


[n/2|? —|n/2], for even n or odd n < 15, 
Dd. deg(v) = ' 
veEN(u) [n/2|* — 7, for odd n > 15, 


then A = 6. 


F11: [Bo79] Let G be a graph with order n > 2. If its degree sequence d; > dz >--- > 
dn = 6 satisfies Sear +dy-;) > kn —1 for all k with 1 < k < min{|n/2] — 1,6}, 
then A = 6. 


F12: [DaVo97] If 5 > |n/2| or if 6 < [n/2] —1 and S0*_, (di + dn4i_s_1) > k(n —2)+ 
26 — 1 for some k with 1 << k <6, then A= 6. 


F13: [Vo03] Suppose that G is p-partite (p > 2) and has order n > 6 with clique 
number w < p. Let vy = 1 when n is even and v = 0 when n is odd. If 6 > |n/2| or if 


o+1 —-ln Z 
5 < |[n/2] —1 and i271 dngi-a > (6 + 1) PS HY — Pte, then A= 6. 


REMARKS 


R5: Note that Fact F9 implies Fact Fl only when n is even. Fact F10 also implies 
Fact Fl. Moreover, as shown by the examples in [P1Zn89], Fact F10 is independent of 
Fact F2 and Fact F3. 


R6: Fact F11 implies Fact F1 when n is even, but in general, as shown in [P1Zn89], it 
is independent of Facts 1, 2, 3 and 10. 


R7: Fact F12 is even valid for digraphs, and a theorem of Xu [Xu94] follows easily (see 
Fact F23). It is easily shown that Fact F12 implies Fact F11. 


R8: Fact F13 generalizes results in [Vo88, Vo89], as well as Fact F8. Furthermore, as 
shown in [HeVo03b], the conditions in Fact F13 also guarantee maximum local edge- 
connectivity for all pairs u and v of vertices in G; that is, A(u—v) = min{deg(u), deg(v)}. 
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Distance 


DEFINITIONS 


D5: The distance distg(U,, U2) between two given subsets U,,U2 C V(G) is the 
minimum of the distances distg(u1, u2) for all vertices u; € Uy and uz € U2. (When 
there is no ambiguity, we omit the subscript G.) 


D6: = The line graph L(G) of a graph G has vertices representing the edges of G, 
and two vertices are adjacent if and only if the corresponding edges are adjacent (that 
is, they have one endpoint in common). 


FACTS 


F14: Let wiv; and ugve be edges in a graph G, and let U; = {u;,u;}, 7 = 1,2. Then, 
the distance between the corresponding vertices of L(G) satisfies dz(g)(uiv1,u2v2) = 
dg(U;, Uz) +1 and thus, the diameters of L(G) and G satisfy D(L(G)) < D(G) +1. 


F15: [P1Zn89] Let G be a connected graph such that every pair of vertex subsets U;, U2 
of cardinality two satisfies dist(U;,U2) < 2. Then \ = 0. 


F16: [BaCaFaFi96] Let G be a graph with minimum degree 6 and line graph L(G). 
Then, 


(a) If L(G) has diameter at most three, then A = 0. 


(b) If L(G) has diameter two, then « = 0. 


REMARKS 


R9: The sufficient condition given in Fact F15 is slightly weaker than the one given 
in Fact F3. Furthermore, it suffices to require such a condition on the 2-element subsets 
that are the endpoints of some edge, as shown in Fact F16(a). 


R10: From the above remark, Fact F16(a) generalizes both Fact F15 and Fact F3 
(Plesnik’s result). 


Super Edge-Connectivity 
Here we consider a stronger measure of edge-connectivity. 
DEFINITION 


D7: A maximally edge-connected graph is super- if every minimum edge-discon- 
necting set is trivial, that is, consists of the edges incident on a vertex of minimum 
degree. 
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EXAMPLE 


E1: Figure 4.7.1 shows a 3-regular maximally edge-connected graph that is not 
super-A. The set {e, f,g} is a non-trivial minimum edge-disconnecting set. 


LES 


Figure 4.7.1: G is maximally edge-connected but not super-.. 


FACTS 


F17: [Le74] Let G # Ky 2 x Ko. If for any non-adjacent vertices u and v, deg(u) + 
deg(v) > n, then G is super-A. 


F18: If for any non-adjacent vertices u and v, deg(u) + deg(v) > n+ 1, then G is 
super-A. 


F19: [Ke72] If 6 > |n/2| +1, then G is super-.. 


F20: [Fi92] If G has diameter two and contains no complete subgraph H on 6 vertices 
with dega(v) = 6 for all v € V(#), then G is super-. 


F21: [S092] Let G be a graph with maximum degree A. If n > 26 + A — 1, then G is 
super-A. 
REMARKS 


R11: Facts F18 and F19, which are analogues of Facts F2 and F1, are direct conse- 
quences of Fact F17. 


R12: Fact F20 can be seen as a refinement of Fact F3 (where only the diameter 
condition is required) and has Fact F21 as a corollary. 


Digraphs 


As mentioned in §4.1, since the connectivity parameters of a graph G equal those of its 
symmetric digraph G* (obtained by replacing each edge of G by a digon), many of the 
previous results can be generalized to the directed case. 


DEFINITIONS 


D8: The vertex-connectivity of a digraph G, denoted «(G), is the minimum size of 
a vertex subset whose deletion results in a non-strongly connected or trivial digraph. 
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D9: The edge-connectivity of a digraph G, denoted \(G), is the minimum size of an 
edge subset whose deletion results in a non-strongly connected digraph. 


NOTATION: (a) For a vertex u € V(G), degt(u) denotes the out-degree, the number 
of vertices adjacent from vertex u, and deg (u) denotes the in-degree, the number of 
vertices adjacent to vertex u. Let 6(u) = min{deg* (u), deg™ (u)}. 

(b) 6* = min,cy{degt(u)} and 6~ = min,cy{deg~ (u)}. 

(c) 6 = minyey 6(u) = min{6T, d- }. 

Similar notations with A stand for maximum degrees. 

NOTATION: For vertices u,v € V(G), A(u — v) denotes the maximum number of edge- 
disjoint directed paths from u to v. 


FACTS 
F22: [Jo72] If G is a digraph with diameter D = 2, then A = 6. 


F23: [Xu94] Let G be a digraph of order n. If there are |n/2| pairs of (different) 
vertices (u;,v;) such that d(u;) + 6(v;) >n,7=1,2,...,|n/2], then A = 6. 


F24: [HeVo03b] Let G be a digraph with diameter at most two. Then, A(u—v) = 
min{degt (u), deg (v)} for all pairs wu and v of vertices in G. 


F25: [HeVo03a] Let G be a strongly connected digraph with edge-connectivity \ and 
minimum degree 6. If for all maximal pairs of vertex sets X and Y at distance 3 there 
exists an isolated vertex in the induced subgraph on X UY, then A = 6. 


F26: [{HeVo03b] Let G be a p-partite digraph of order n and minimum degree 6 with 
p> 2. Ifn < 2|(pd)/(p — 1)| —1, then A(u—v) = min{degt (uw), deg” (v)} for all pairs u 
and v of vertices in G. 


F27: [HeVo03b] Let G be a bipartite digraph of order n and minimum degree 6 > 2 
with the bipartition V’ UV”. If deg(x) + deg(y) > (n+ 1)/2 for each pair of vertices 
x,y € V’ and each pair of vertices x,y € V”, then \(u—v) = min{degt (u), deg” (v)} 
for all pairs u and v of vertices in G. 


REMARKS 


R13: Notice that Plesnfk’s result (Fact F3) is, in fact, a consequence of the older result 
of Jolivet (Fact F22). Similarly, Fact F23 generalizes Fact F9. 


R14: Fact F23 was improved by Dankelmann and Volkmann in two subsequent papers 
[DaVo97, DaVo00], where the bipartite case was also considered. 


R15: A restatement of Fact F24 states that a digraph with diameter two has maximum 
local edge-connectivity. Moreover, this obviously implies Jolivet’s result (Fact F22) and 


the corresponding local connectivity result for undirected graphs, proved in [FrOeSw00]. 


R16: A consequence of Fact F25 is the directed version of Fact F15. 
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Oriented Graphs 


DEFINITIONS 


D10: A digraph is super-A if every minimum edge-disconnecting set consists of the 
edges directed to or from a vertex with minimum degree. A digraph is super-« if every 
minimum disconnecting set consists of the vertices adjacent to or from a vertex with 
minimum degree. 


D11: An oriented graph G (also called an antisymmetric digraph) is a digraph 
such that between any two vertices u,v, there is at most one (directed) edge ((u,v) or 


(v,u)). 


EXAMPLE 


E2: Figure 4.7.2 shows a 2-regular maximally connected digraph G that is not super-k. 
If F = {x,y}, then G — F is not strongly connected (for instance, there is no [directed] 
path in G— F from u to v) and F is non-trivial (it does not consist of the vertices 
adjacent to or from a vertex with minimum degree). 


y 


Figure 4.7.2: G is maximally connected but not super-k. 


FACTS 


F28: [AyFr70] Let G be an oriented graph with n vertices and minimum degree 6. If 
6 > |(n+2)/4], then \=6. 


F29: [Fi92] If G is an oriented graph with n vertices and minimum degree 6 > |n/4|+1, 
then G is super-A. 


F30: [Fi92] If G is an oriented graph with diameter two, then G is super-,. 


REMARKS 


R17: Facts F28 and F29 are analogues of Fact Fl, whereas Fact F30, similar to 
Fact F22, is a consequence of Fact F20. 
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R18: In fact, the sufficient conditions given in [AyFr70] and [Fi92] (Facts F28 and 
F29) were 6+ + 6~ > [n/2| and 6+ +6~ > |n/2| +1, respectively. Furthermore, it is 
easily shown that Facts F29 and F30 do not imply each other. 


R19: Higher connectivity in tournaments, which are oriented complete graphs, is 
discussed in §3.3 of the Handbook. 


Semigirth 


To generalize Jolivet’s result (Fact F22) and give new results on superconnectivity, it is 
relevant to consider a new parameter related to the path structure of the digraph. In 
our context, this parameter plays a role similar (and is tightly related) to the girth of a 
graph. 


DEFINITIONS 


D12: [FaFi89, FiFaEs90] For a given digraph G = (V, E) with diameter D, the semi- 
girth, denoted &(G), is the greatest integer between 1 and D such that for any u,v € V, 


(a) if dist(u,v) < £, the shortest u-v directed walk is unique and there are no u-v 
directed walks of length dist(u,v) +1. 


(b) if dist(u, v) = @, there is only one shortest u-v directed walk. 


D13: A digraph G is a generalized p-cycle when it has its vertex set partitioned in 
p parts cyclically ordered, and vertices in one part are adjacent only to vertices in the 
next part. Thus, a generalized 2-cycle is the same as a bipartite digraph. 


EXAMPLE 


E3: Figure 4.7.3 shows a 2-regular digraph for which the semigirth @ is equal to its 
diameter, namely, = D = 3. 


Figure 4.7.3: Semigirth €= D=3. 


368 Chapter 4. Connectivity and Traversability 


FACTS 


F31: [FaFi89] Let G be a digraph with minimum degree 6 > 1, diameter D, semigirth 
£, and connectivities « and X. 


(a) If D < 2é, then \= 0. 
(b) If D < 2¢—1, then G is super-A and k = 6. 
(c) If D < 2¢— 2, then G is super-k. 
F32: [FaFi96a, PeBaGo01] Let G be a generalized p-cycle (p > 2). 
(a) If D < 2€+ p—1, then \ = 60. 


(b) If D < 2€4 p— 2, then G is super-A and « = 6. 


(c) If D < 20+ p— 3, then G is super-r. 


F33: Any bipartite digraph with diameter three is maximally edge-connected. 


REMARKS 


R20: The main idea in the proof of the results in Fact F31 is that semigirth 2 measures 
how far away one can move from or to a given subset F’ of vertices. For instance, in 
proving (a), it is shown that if |F'| < 6, in any connected component of G — F' there are 
vertices u,v such that dist(u, F),dist(F,v) > @. Hence, any shortest path of length at 
most 2¢—1 cannot contain a vertex of F’. As a conclusion, F’ cannot be a disconnecting 
set. 


R21: Since any digraph G has semigirth @ > 1, Fact F22 is included in Fact F31(a). 
R22: Fact F33 is the analogue for bipartite digraphs of Jolivet’s result (Fact F22). 


In fact, for a bipartite (di)graph, the condition 6 > |n/4| +1 implies D < 3, so that 
Fact F33 can be also seen as a generalization of Fact F5. 


Line Digraphs 

DEFINITION 

D14: The line digraph of a digraph G, denoted L(G), has V(L(G)) = E(G), and a 
vertex (u,v) is adjacent to a vertex (w,z) if v = w (that is, the head of edge (u,v) is 
the tail of edge (w, z) in digraph G). The k-iterated line digraph, L"(G), is defined 
recursively by L*(G) = L(L*-1(G)). 

FACTS 


F34: The order of L(G) equals the size of G, |V(L(G))| = |E(G)|, and their minimum 
degrees coincide, 5(L(G)) = 6(G) = 6. Moreover, «(L(G)) = A(G). 
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F35: If G is d-regular, d > 1, has order n, diameter D, and semigirth @, then L*(G) 
is also d-regular, has d*n vertices, diameter D(L‘(G)) = D(G) + k, and semigirth 
e(L*(G)) = £(G) +k. See the papers [Ai67, ReKuHoLe82, FiYeA184, FaFi89}. 


F36: [FaFi89] Let G be a digraph with minimum degree 6 > 1, diameter D, and 
semigirth @. 


(a) Ifk > D— 20, then L*(G) is maximally edge-connected. 


(b) If k > D—2¢+1, then L*(G) is super-\ and maximally connected. 


(c) Ifk > D—2¢+4 2, then L*(G) is super-k. 


REMARK 


R23: As shown in Fact F36, the interest of considering k-iterated line digraphs stems 
from the fact that if k is large enough, Fact F35 guarantees that the conditions of 
Fact F31 hold. 


Girth 


For a given girth, high density/connectivity graphs occur when they have a reduced 
diameter, and also when they have a small number of vertices. 


DEFINITION 


D15: The same definition for the semigirth (Definition D12) applies for an undirected 
graph G (considering undirected walks). In this case, it turns out that the semigirth 
= ¢(G) = &(G*) equals |(g — 1)/2| where g = g(G) stands for the girth of G. 


FACTS 


F37: Let G be a graph with minimum degree 6 > 1, diameter D, girth g, and connec- 
tivities « and X. 


g-1, godd, 
g—2, g even, 


(a) [SoNaIlm85, SoNaIlmPe87, FaFi89] If D < then A = 6. 
g—2, g odd, 


3 then G is super-A and 
g—3, g even, 


(b) [SoNaIm85, SoNalmPe87, FaFi89] If D < 
K=O. 


g—3, g odd, 


4 then G is super-k. 
g—4, g even, 


(c) [SoNaIm85, SoNalmPe87, FaFi89] If D < 


(d) [BaCeDiGVMa06] If D < g — 3, then G is super-x. 


(e) [BaTaMaLi09] If G is regular and D < g — 2, g odd, then G is super-k. 


(f) [BaMaMol10] If 6 > 3, A < 36/2 —1, and D < g — 2, g odd, then G is super-rt. 
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F38: [BaCaFaFi96, CaFa99] Let G be a graph with minimum degree 6 > 1, girth g, 
and connectivities « and A. Let L(G) be the line graph of G, with diameter D(L(G)). 
Then, 


then A = 6. 


(a) If D(L(G)) < {° ae 
g—1, g even, 


=A 
(b) If D(L(G)) < {! ya aes Cas Sane Gree: 
g—2, g even, 


3 then G is super-K. 
g—3, g even, 


— dd 
(c) If D(L(G)) < © ee BONS, 
F39: [FaFi96a] Any bipartite graph with diameter three is maximally edge-connected. 
F40: [KnNi03] For every graph G there is a number i(G) such that L*(G) is maximally 
connected when k > i(G). 
REMARKS 
R24: Fact F37 is a simple consequence of Definition D15 and Fact F31. 


R25: Fact F39 is the undirected version of Fact F33, which can be seen as Plesnik’s 
analogue for the bipartite case. 


R26: Fact F40 is based on a result of Hartke and Higgins [HaHi99] about the growth 
of minimum degree in iterated line graphs. For regular graphs this result is not needed, 
and in this case i(G) < 5. 


Girth Pair 


DEFINITIONS 


D16: The girth pair (gi, 92) of a graph G gives the length g; of a shortest odd cycle 
and the length gz of a shortest even cycle. 


EXAMPLE 


E4: The Dodecahedron graph is a cubic graph with girth 5 and a shortest even cycle 
has length 8. Hence its girth pair is (5,8). 


FACTS 


F41: [BaCeDiGVMa07, BaGVMol11] Let G be a graph with minimum degree 6 > 3, 
diameter D, girth pair (g,h), odd g and even h with g +3 < h < ov, and connectivities 
kK and X. 


(a) If D<h—83, then A= 0. 
(b) If D<h—4, then k= 0. 
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(c) If D< h—4 and 6 > 4, then G is super-k. 
(d) If D<h—5 and 6 =3, then G is super-k. 
(e) If g > 5, D(L(G)) < h—3, and the maximum degree of G satisfies A < 26 — 3, 
then K = 6. 
REMARKS 


R27: Fact F41 improves Fact F37 for graphs with girth pair (g,h), g odd and h > g+3 
even. 


Cages 


DEFINITIONS 


D17: A (k,g)-cage is a k-regular graph with girth g having the least possible number 
of vertices. 


D18: A 3-connected graph G = (V, E) is said to be quasi 4-connected if for every 
vertex-cut FC V such that |F| = 3, F is the neighborhood of a vertex of degree 3 and 
G — F has exactly two components. 


EXAMPLE 


E5: The Heawood graph, shown in Figure 4.7.4, is a (3,6)-cage with order 14 and 
diameter 3. 


Figure 4.7.4: The Heawood graph. 


FACTS 
F42: [FuHuRo97] All (k, g)-cages are 2-connected. 


F43: [JiMu98, DaRo99] Every (k, g)-cage with k > 3 is 3-connected. 
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F44: [MaPeBa02] Every (3, g)-cage is superconnected, edge-superconnected, and quasi 
4-connected. 


F45: [WaXuWa03] Every (k,g)-cage with k > 3 and odd girth g is maximally edge- 
connected. 


F46: [MaBa04] Every (k, g)-cage with k > 3 and odd girth g is super-y. 
F47: [LiMiBaMa06] Every (k,g)-cage with & > 3 and even girth g is super-,. 
F48: [XuWaWa02] Every (4, g)-cage is 4-connected. 

F49: [MaBaPeFa05] Every (k, g)-cage with k > 4 and g > 10 is 4-connected. 


F50: [LiMiBa05] Every (k, g)-cage with & > 3 and odd girth g > 7 is r-connected with 
r>Vk+1. 


F51: [LiBaMaMi08] Every (k,g)-cage with k > 3 and even girth g > 6 is (r + 1)- 
connected, r being the largest integer such that r? + 2r? < k. 


F52: [MaBaPe07] (k,6)- and (k,8)-cages are maximally connected. 


F53: [ArGoMoSe07] (k, 12)-cages are maximally connected. 
F54: [LuWuLuLil0] Every (k, g)-cage with k > 3 and odd girth g > 9 is [Vk + Vk — 2]- 


connected; and every (k, g)-cage with k > 3 and even girth g > 10 is r-connected, where 
r is the largest integer such that r(r — 1)?/4+1+2r(r—1) <k. 


F55: [BaSal2] Every (k,g)-cage with k > 3 and odd girth g > 7 is r-connected with 
r > [k/2]. 


CONJECTURE 
[FuHuR097] Every (k, g)-cage is maximally connected. 


Large Digraphs 


The following results support the intuitive idea that dense (di)graphs have high con- 
nectedness. 


DEFINITION 

D19: For a digraph with maximum degree A and diameter D, the Moore bound, 
denoted n(A, D), is given by n(A,D) =1+A+A?4+---+A?. 

FACTS 

F56: An n-vertex digraph with maximum degree A and diameter D has n < n(A, D). 


F57: [Wa67] The order of a (di)graph with connectivity « > 1 and diameter D satisfies 
n>n«(D—1)+2. 


F58: [ImSoOk85] 
(a) If A <6, then n < A\(n(A, D—2)+ A+1). 
(b) If kK < 46, then n < w(n(A, D—1) +A). 
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F59: 
(a) If n > (6-1) (n(A, D- 2) + A+1), then \ =. 


(b) Ifn > (6-1) (n(A, D —- 1) + A), then & = 6. 
F60: [Fi93] 
(a) If \ <6, then n < A(n(A, D—-2)+1)+A. 


(b) If« <6, then n < «(n(A, D—1)-1)+A+1. 
F61: [Xu92, Fi93] Let G be d-regular. 
(a) Ifn > d?-! 4 2d— 2, then A= d. 
(b) Ifn >d? +1, then k =d. 
F62: (S092, Fi94] Let G be a d-regular digraph, d > 2, with diameter D. 
(a) If G satisfies either of the following conditions, then G is super-,. 


(i) D=2 and n > 3d. 
(ii) D>3 and n> 2d?-14d?-? +.---4+d? 4 2d. 


(b) If G satisfies either of the following conditions, then G is super-x. 


i) D=3andn> 3d? +1. 
(i) 
(ii) D>4andn>2d?-!4+d?-74.---+.d2 +20 +1. 


EXAMPLE 


E6: Figure 4.7.5 shows a regular digraph for which n = 6, A=6 =d= 2, and D=2. 
Since n > d?-! + 2d—2 and n > d? +1, Fact F61 guarantees that it is maximally 
connected («& = A = d). 


Figure 4.7.5: k=A=d=2. 
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REMARKS 


R28: To our knowledge, Fact F57, due to Watkins, was the first result in which the 
order n, the diameter D, and the connectivity « were related (in the undirected case). 
It follows easily from counting the minimum number of vertices involved in « internally 
disjoint u-v paths between a pair of vertices u,v at distance D, as Menger’s theorem 
guarantees. 


R29: Similar reasoning gives a lower bound for the number of edges m of a (di)graph 
with edge-connectivity A, namely, m > AD. However, it is not difficult to realize that 
this is not a very strong result. (The situation seems to depend heavily on the values 
of X and D: for \ = 3 there are constructions giving a lower bound of the order of LD, 
whereas for \ = 4 we have a bound which is “asymptotically optimal,” that is, of the 
order of 4D.) 


R30: If we take into account the connectivity parameters « or A, the Moore bound 
can be refined. Intuitively, a disconnecting set with few vertices or edges is a kind of 
“bottleneck” that prevents the order from being large, as shown in Fact F58. 


R31: Fact F59 is a direct consequence of Fact F58, and Fact F60 is an improvement 
of Fact F58. Notice that if we set « = A in the upper bound on n of Fact F60(b), we 
obtain the Moore bound n(A, D). 


Large Graphs 


Similar results for graphs were derived independently by Esfahanian [Es85], Fiol [Fi93, 
Fi94], and Soneoka et al. [SoNaImPe87]. 


DEFINITION 

D20: The Moore bound for an undirected graph with maximum degree A and 
diameter D is given by n(A, D) =1+A+A(A—1)4+---+A(A-1)P71. 

FACTS 


F63: [SoNalmPe87]| 
(a) Ifn > (6-1) (n(A-—1, D-—2)+1)+A-—1, then X= 6. 


(b) Ifn > (6—1)(A—1)9-! +2, then « =6. 


F64: [So092, Fi94] 


(a) Let D> 2 and 6 > 2. Ifn > 6(n(A—1,D—-2) +1) + (A—1)?"1, then G is 
super-A. 


(b) Let D> 3,5>3 and g>5. Ifn > (6-1) (n(A, D—1)4+ A), then k = 5. 
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4.7.2 Bounded Connectivity 


The techniques used for proving the results of the preceding subsection can often be 
used to derive bounds on the connectivity or edge-connectivity of a (di)graph. In this 
subsection, we provide some examples. 


m-Semigirth 


The following definition generalizes semigirth (Definition D12). 


DEFINITION 


D21: [FaFi89] Let G = (V, F) be a digraph with minimum degree 6 and diameter D, 
and let 7 be an integer, 0 < 7 <6—2. The 1-semigirth of G, denoted ¢,(G), is the 
greatest integer €, between 1 and D such that, for any u,v € V, 


(a) if dist(u,v) < €,, the shortest u-v path is unique and there are at most 7 distinct 
u-v walks of length dist(u,v) + 1. 


(b) if dist(u,v) = £,, there is only one shortest u-v walk. 


FACT 


F65: [FaFi89, MaBaPe04] Let G be a connected digraph with minimum degree 6 > 1, 
diameter D, 7-semigirth ¢, for 0 < 7 < 6 — 2, and with k-iterated line digraph L*(G). 
Then, 

(a) If D < 2@,, then X\ > 6-7. 

(b) If D < 2@, —1, then k > 6-7. 

(c) If D < 2@, —1 and a < |(6—1)/2], then A = 6. 

(d) If D < 2@, — 2, > 2 and a < |(6—1)/2], then & = 6. 

(e) Ifk > D— 2¢,, then \(L*(G)) > 6-2. 


(f) Ifk > D—2é, +1, then «(L*(G)) > 6 —7. 


REMARKS 


R32: Note that 9 corresponds to the ordinary semigirth @. Moreover, for 7 > 1, ; is 
well defined even for a digraph with self-loops. 


R33: The definition of ¢, is restricted to 7 < 6 — 2 since, otherwise, the above results 
become irrelevant. 
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Imbeddings 


Here we cite one of the earliest results relating the connectivity of a graph to a topological 
property of that graph. Other more recent results of this kind can be found in [PluZh98, 
PluZh02]. 


DEFINITION 


D22: A graph G is said to be imbeddable in a given surface S if G can be drawn on 
S without edge crossings. 


FACT 


F66: [Co73] Let G be any graph embeddable in a oriented surface of genus g > 0 
(where the genus is, informally, the number of handles on its surface [see Chapter 7 of 
this Handbook]). Then, « < | (5+ /T+48g)/2|. 


Adjacency Spectrum 


Given a (di)graph G with some associated matrix A, a natural problem is to study how 
much can be said about the structure of G from the spectrum of A. This is a major 
topic in algebraic graph theory, and has been the object of research (see §6.5 of the 
Handbook or the classic textbooks D. Cvetkovi¢é, M. Dragos, and H. Sach [CvDoSa95], 
Biggs [Bi94]). 


DEFINITIONS 


D23: Given a graph G on n vertices, its adjacency matrix A = (duy) is the n x n 
matrix indexed by the vertices of G with entries ay, = 1 if u and v are adjacent and 
Quy = 0 otherwise. 


D24: The toughness t of a graph G is defined as t = ming{|S|/c(G — S)}, where S 
runs over all vertex-cuts of G and c(G—S) denotes the number of components of G— S. 


FACTS 


F67: [A195, Br95] Let G be a connected, non-complete d-regular graph and let be 
the maximum of the absolute values of the eigenvalues of G distinct from d. Then, 
t > d/rA—2. 


NOTATION: Given a graph G, let Dg denote the maximum distance between vertex 
subsets of G with two vertices. (This parameter is a particular case of the so-called 
conditional diameter, introduced in [BaCaFaFi96].) 


F68: [FiGaYe97]| Let G be a d-regular graph with D2 > 1 and distinct eigenvalues (of 
its adjacency matrix A) A9(= d) > A, > +++ > Ar. Let P(x) := 2(a — d,)/(A1 — Ar) — 1. 


Then, «(G) > min {a, Bott | \. 
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REMARKS 


R34: Besides Fact F67, Brouwer [Br96] gave some other interesting examples of results 
about the connectivity of a graph G in terms of its spectrum. 


R35: For other results concerning the toughness of a graph, mainly used in the study 
of vulnerability of network topologies [BoHaKa81], see, for instance, [ChLi02]. 


R36: Notice that, from Fact F16(b), if D2 = 1 then G is maximally connected. 
Otherwise, Fact F68 applies. 


Laplacian Spectrum 


DEFINITION 


D25: Given a graph G, its Laplacian matrix L is defined as L = D— A, where D is 
the diagonal matrix of the vertex degrees and A is the adjacency matrix of G (see, for 
instance [Bi94]). The Laplacian eigenvalues of G are the eigenvalues of its Laplacian 
matrix. 


TERMINOLOGY: The second smallest Laplacian eigenvalue, 6,, usually denoted by a = 
a(G), is called the algebraic connectivity of G because it has some properties which 
are similar to those satisfied by the connectivity k. 


FACTS 


F69: Since the Laplacian matrix L is positive semidefinite, its eigenvalues are all 
nonnegative, with the first one equal to zero. If G is d-regular with (distinct) eigenvalues 
Ao(= d) > »A1 > ++: > -, then its Laplacian eigenvalues are 09,61,...,0,, where 
6; =d—-;,i= 1, Qos 


F70: [Fi73] Let G be a graph with second smallest Laplacian eigenvalue a. 
(a) « >a> 0, and a = 0 if and only if G is not connected. 
(b) For any spanning subgraph H of G we have a(H) < a(G). 
(c) For any vertex subset U of G we have a(G — U) > a(G) — |U|. 


F71: Let G be a d-regular graph with n vertices, Dp > 1, and Laplacian eigenvalues 


Bo(= 0) <0) < 02 <+++ < Op. Hd < MO HEAD then «= d. 


REMARK 


R37: Fact F71 is just a consequence of Fact F68 in terms of the Laplacian eigenvalues. 
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4.7.3. Symmetry and Regularity 


Boundaries, Fragments, and Atoms 


The concepts of fragment and atom are very useful in the study of connectivity, both 
in the undirected and the directed case, and, in particular, for (di)graphs with strong 
symmetries. For graphs, the concept of an atom was introduced independently by Mader 
[Ma70] and Watkins [Wa70]. The notion of an atom for digraphs was introduced by 
Chaty [Ch76] and first used extensively by Hamidoune [Ha77, Ha80, Ha81]. 

Because of the close relationship between a graph G and its corresponding symmetric 
digraph G*, we only give the definitions for digraphs. (For undirected graphs, the 
corresponding definitions are unsigned.) 


DEFINITIONS 


D26: The positive boundary of a vertex subset F in a digraph G, denoted O* F, is 
the set of vertices that are adjacent from F’, and the negative boundary, O- F, is the 
set of vertices adjacent to F. 


D27: The positive edge-boundary and the negative edge-boundary, denoted wt F 
and w” F, respectively, are given by 


wtF ={(u,v) € BE: ué F and v€ V — F}; 
w F={(u,v)€ E:ueV—F and ve F}. 


D28: Let G be a strongly connected digraph with connectivity «. A vertex subset F' is 
a positive fragment of G if |O7F| =« and V —(FUOtF) £9, and F is a negative 
fragment if |O-F| =« and V —-(0-FUF) 490. 


D29: Let G be a digraph with edge-connectivity 4. A vertex subset F is a positive 
a-fragment of G if |wt F| = \, and F is a negative a-fragment if |w~ F| = 2. 


D30: A vertex u of a positive [negative] a-fragment F is called interior if none of the 
edges adjacent from [to] u belongs to wt F |[w7 F). 


D31: An atom is a (positive or negative) fragment of minimum cardinality. 


EXAMPLE 


E7: For the digraph of Figure 4.7.6, « = 2 and F is a positive (respectively, negative) 
fragment with positive (respectively, negative) boundary {u,v} (respectively, {z,t}). 
Analogously, wt F = {(2,v), (y,u)} and w F = {(z,2),(t,y)}. In this digraph, each 
single vertex is an atom. 


FACT 


F72: If FUO*TF 4V [FUO F#V\], then 0tF [0- F] is a vertex-cut of G. Similarly, 
if F is a proper (nonempty) subset of V, then wtF [w7 F] is an edge-cut. Using these 
concepts, we have the following alternative definitions of the connectivity parameters: 


K=min{|JOtF|: FCV,FUOtTFAV or |F|/=1} 


A=min{|wtF|: F is a nonempty, proper subset of V} 
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Figure 4.7.6: F is a fragment. 


Fragments and Atoms in Undirected Graphs 


FACTS 
F73: [Wa70, Ma71la] In a connected graph, any two distinct atoms are disjoint. 


F74: [Ma7la] Let G be a graph with order n and connectivity «. Let F, and F2 be 
distinct minimal fragments of G, with at most n — 3«/2 vertices. Then FM Fo = 0. 


F75: [{Ma7la] Let G be a graph with connectivity «. If T is a disconnecting set with 
& vertices and A is an atom, then either AC T or ANT = 9. 


REMARKS 


R38: To quote a personal communication from Watkins [Wa02]: “It is an amazing 
coincidence that Prof. Mader and I not only conceived of the notion of ‘atom’ indepen- 
dently and simultaneously, but we also accorded this notion almost identical names.” In 
fact, Watkins [Wa70] used the term “atomic part,” while Mader [Ma70] used the term 
“kleinstes Glie” (that is, “smallest member”). Then, in a subsequent paper, Mader 
[Ma71la] mentioned the main result in [Wa70] on atomic parts, and deduced it from his 
results. Moreover, inspired by Watkins’ terminology, he first used the simpler name 
“atom.” 


R39: Fact F73 was proved by Watkins for transitive graphs, whereas the general case 
is due to Mader. 


R40: As already mentioned, the seminal papers on atoms are those of Mader [Ma70]| 
and Watkins [Wa70]. Notice that Fact F75 is a generalization of Fact F73, and it is 
considered as the more important property of an atom. 


R41: Results on atoms and the connectivity of infinite graphs can be found in [JuWa77| 
and [Ha89]. 
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Fragments and Atoms in Digraphs 


The results above can be seen as consequences of the corresponding directed versions, 
which are due to Hamidoune. 


FACTS 


F76: [Ha77] Let G be a connected digraph with a positive (negative) atom A and a 
positive (negative) fragment F. Then, either A C F or AN F = 9. In particular, two 
distinct positive (negative) atoms are disjoint. 


F77: [Ha80] If G is a connected digraph with A < 6* (A < 67), then every positive 
(negative) a-fragment contains an interior vertex. 


REMARKS 


R42: Contrary to the case of graphs, where the presence of an atom is always assured, 
a digraph does not necessarily have an atom with a prescribed sign. 


R43: Fact F77 implies Jolivet’s theorem (Fact F22). 


Graphs with Symmetry 


Graphs with high symmetry often have “good” properties, and their study has special 
relevance to other areas of mathematics. In particular, the results here show that, for 
connected graphs, high symmetry goes hand in hand with high connectivity. Graph 
automorphisms and symmetry are discussed in §6.1 and 86.2. 


DEFINITION 


D32: A (di)graph G is vertex-transitive (or vertex-symmetric) if for any vertices 
u,v there is an automorphism of G which maps uw into v. Similarly, G is called edge- 
transitive (or edge-symmetric) if for any (possibly oriented) edges uv, wz there is 
an automorphism of G that maps wv into wz. 


FACTS 


F78: [Ma70, Ma71b] Let G be a vertex-symmetric connected graph with degree d > 3. 
Then, A = d, « > 2|d/3| +2 if d> 3, and « = 3 otherwise. Furthermore, if G does not 
contain Ky4, then « = d. 


F79: [Ma70] Let G be an edge-symmetric connected graph with degree d. Then 
K=A=d. 


F80: [Ma70, Wa70] Let G be a vertex-transitive graph with an atom A. Then the 
subgraph G(A) induced on A is also vertex-transitive. Moreover, the set of atoms of G 
constitutes a partition of V(G). 


F81: [Ha77] Let G be a vertex-transitive digraph with a positive (negative) atom A. 
Then, the induced subdigraph G(A) is also vertex-transitive. Furthermore, the set of 
positive (negative) atoms of G constitutes a partition of V(G). 
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F82: [Ha81] Let G be a vertex-symmetric strongly connected digraph with (constant) 
outdegree d*. Then \ = d* and x > $d*. Moreover, if G is an oriented graph, then 
kK > 2dt. 

= 3 


REMARKS 


R44: The inequality in Fact F78, which is best possible, is a consequence of Fact F73 
and, ford #2 mod 3, it is an improvement of a result of Watkins [Wa70], who showed 
that « > (2/3)d. 


R45: From Facts F80 and F81, the order of a (positive or negative) atom of G divides 
the order of G. Consequently, every connected vertex-transitive (di)graph with a prime 
number of vertices is maximally connected (K = A = 0). In fact, it is known that such 
(di)graphs must be Cayley (di)graphs of cyclic groups. 


R46: By Fact F76, Hamidoune [Ha77] proved Kameda’s result stating that every 
minimal k-connected digraph has one vertex of out-degree or in-degree k [Ka74], and 
Hamidoune also proved that every edge-transitive digraph is maximally connected. 


Cayley Graphs 


The Cayley graphs are among the most interesting vertex-symmetric (di)graphs, mainly 
because of their relationship with group theory (see §6.1 and §6.2). In particular, the 
study of the connectivity of Cayley graphs has striking connections with some key results 
in additive number theory, such as the well-known Cauchy—Davenport theorem: If p is 
a prime number and A, B are two nonempty subsets of the cyclic group Zp, then either 
A+B=Z, or |A+B|>|A|4+ |B] —-1. 


DEFINITIONS 


D33: Let I be a finite group with identity element e and generating set S C T — {e}. 
The Cayley digraph G = (T, S) has vertices labeled with the elements of I’, and edges 
of the form (u,ug) where g € 9. In particular, when S~' = S$ (where S~! = {271 : ave 
S}) we obtain a symmetric Cayley digraph or, simply, a Cayley graph. 


D34: IfT is a cyclic group, then the Cayley graph is called a circulant graph. 


D35: A generating set S of a group [ is called minimal when any proper subset 
S’ C S does not generate I. 


D36: The symmetric group on n elements, denoted %,,, is the group of all permu- 
tations of the set {1,2,...,n}. 


D37: Let H be a subgroup of a group G, and let x € G. Then the set eH = {xh|h € H} 
is a left coset of G with respect to H. 
FACTS 


F83: [Im79] Let S be a generating set of the symmetric group T = ©, with n > 5, 
such that Sx! = S for every x € T. Then, the Cayley digraph (I, S) is maximally 
connected (that is, « = |.S|). 
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F84: [Ha84] Let I be a finite group with identity e and generating set S. Let A bea 
positive (respectively, negative) atom of ([, S) containing e. Then A is the subgroup of 
I’ generated by $1 A, and the positive (respectively, negative) atoms of (I, S) are the 
left cosets of with respect to A. 


F85: [Ha84] Let [ be a finite group with a minimal generating set S. Let S’ c S71. 
Then, the Cayley digraph (TS US’) is maximally connected. 


F86: [HaSe96] Let [ be an Abelian group of order n and let S be a generating subset 
of T such that |S U {0}| <n-—1. Let D be the diameter of G = ([,S). Then there is a 
vertex-cut of size less than (4n In(n/2))/D whose deletion separates G into a negative 
fragment B and a positive fragment B such that |B] = |B|. Moreover, G can be 
separated into two equal parts of size |B| by deleting less than (8e/|S|)n!~!/!5! In(n/2) 
vertices. 


REMARKS 
R47: Fact F84, due to Hamidoune, provides a very short proof of Fact F83. 


R48: For the case of Cayley graphs, Fact F85 was previously proved by Godsil 
[Go81]. Subsequently, Akers and Krishnamurthy [AkKr87], Hamidoune, Lladé, and 
Serra [HaLlSe92], and Alspach [Als92] improved these results by considering Cayley 
(di)graphs with a hierarchical generating set (that is, when the group generated by 
the first k generators is a proper subgroup of the group generated by the first k + 1 for 
each k). 


Circulant Graphs 


Because of their circular symmetry, circulant graphs have been proposed as good models 
for local area network topologies, where they are called loop networks. In this context, 
other good topologies are provided by Cayley graphs of Abelian groups, also called loop 
networks (see [BeCoHs95], [BoTi84]). 


FACTS 


F87: [Ha84] Let [ be the cyclic group Z,. Let S be the strictly increasing sequence 
of s integers (1 =)b1 < by < +--+ < bs(< n) such that bj41 — b; > min{2,b; — b;-1} for 
4 = 2,3,...,s—1. Then the circulant digraph (I, S') is maximally connected (kK = s). 


F88: [Ha84] Let [ be the cyclic group Z,. Let S be the strictly increasing sequence 
of s integers (1 =)b1 < by < ++: < bs(< n/2) such that bj41 — b; > min{2,b; — b;-1} 
for 7 = 2,3,...,s—1, and let S’ C —S, where —S denotes the set of (additive) inverses 
of the elements in S. Then the circulant digraph (I, .S U S’) is maximally connected 
(«x =|SUS"). 


F89: The Cauchy—Davenport theorem is equivalent to stating that, for any generating 
set S C Zp, p prime, the Cayley digraph (Z,,5'}) is maximally connected (that is, 
k= |S|). 
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REMARKS 


R49: The case S’ = —S in Fact F88 (that is, for circulant graphs) was proved in 
[BoFe70] using the “convexity conditions” bj41 — b; > b; — bj-1 (see also [BoTi84]). 


R50: Fact F89, noted by Hamidoune, is a bridge between additive number theory 
and graph theory. (For a comprehensive survey on the subject, we refer the reader to 
[Ha96].) 


Distance-Regular Graphs 


The concept of distance-regularity was introduced by Biggs [Bi71] in the early 1970s, by 
changing a symmetry-type requirement, that of distance-transitivity, to a regularity- 
type condition concerning the cardinality of some vertex subsets. Distance-regular 
graphs have important connections with other branches of mathematics, such as ge- 
ometry, coding theory, and group theory, as well as with other areas of graph theory. In 
our context, their high regularity seems also to induce a high degree of connectedness. 


DEFINITIONS 


D38: Let G be a regular graph with diameter D and let k be an integer between 1 
and D. Graph G is said to be distance-regular if, for any two vertices u and v with 
dist(u,v) = k, the numbers cx, ax, and by of vertices that are adjacent to v and whose 
distance from u is k —1, k, and k + 1, respectively, depend only on k. 


D39: An n-vertex k-regular graph G is called (n, k; a,c)-strongly-regular if any two 
adjacent vertices have a common neighbors and any two non-adjacent vertices have c 
common neighbors. 


FACTS 


F90: Let G be a connected graph. Then G is strongly-regular if and only if G is 
distance-regular of diameter two. 


F91: Every strongly regular graph is maximally edge-connected. 

F92: [BrMe85] Every strongly regular graph is maximally connected and super-r. 
F93: [BrKo09] Every distance-regular graph is maximally connected. If the degree is 
at least 3, it is also super-k. 

REMARKS 

R51: Fact F91 is a consequence of Facts F3 and F90. 


R52: Fact F93 was a long standing conjecture of Brouwer [Br96]. The result was 
previously proved for some families of distance-regular graphs, such as the so-called odd 
graphs Oj, (having the k-subsets of a (2k — 1)-set as its vertices and adjacencies defined 
by void intersection); see [Gh92]. 


R53: Fact F93 implies a previous conjecture of Godsil [Go81], stated in the context 
of association schemes, that every distance-regular graph is maximally edge-connected. 
In fact, this last result was proved by Brouwer and Haemers in [BrHa05]. 
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4.7.4 Generalizations of Connectivity Parameters 


The standard connectivity parameters have been generalized in different ways, giving 
rise to numerous articles; see, for instance, [BaBeLiPi87], [BeOePi02], [Ha83], [Wo73]. 
Here we will consider several examples, some of which have special relevance to the 
study of network vulnerability. 


Conditional Connectivity 
The next two definitions generalize the concept of superconnectivity. 


DEFINITIONS 


D40: Given a graph G = (V, £) and a nonnegative integer s, a vertex subset V’ C V 
is said to be n-trivial if it contains the boundary O(#) of some subgraph H C G with 
s’ vertices, 1 < s’ < s. Similarly, an edge subset E’ C E is said to be s-trivial if it 
contains the edge-boundary w(H) of some subgraph H C G with s’ vertices, 1 < s’ < s. 


D41: The conditional connectivity «, of a graph G is the minimum cardinality of 
a disconnecting set that is not s-trivial. The conditional edge-connectivity , of G 
is the minimum cardinality of a disconnecting edge set that is not s-trivial. 


FACT 


F94: [FaFi89, FiFaEs90, FaFi94] Let G be a graph with minimum degree 6 > 1, 
diameter D and girth g. Let = | +]. 


(a) If D < 2@, then Ao = 6. 

(b) If D < 2€—1, then kp = 6 and A, > 26 — 2. 

(c) If D < 2¢—2, then Ky > 25 —2 and \» > 36-4. 
(d) If D < 2¢—3, then Ky > 35 —4. 


CONJECTURE 
Cl: [FaFi94] 
(a) If D < 2€—s, then A, > (s +1)d — 2s. 
(b) If D < 2@—s—-1, then Kk, > (s+ 1)d — 2s. 


REMARKS 


R54: Harary [Ha83] introduced the general concept of conditional connectivity. In our 
context, the graphs are assumed to be those for which &, and A, are well-defined. 


R55: Note that the conditional connectivities Ko and Ap correspond to the standard 
connectivities « and » (thus, Fact F94 generalizes Fact F37). If «1 > 6, then G is 
super-&, and if A; > 6, then G is super-X. 


R56: The conjecture above was proved to be true for even s (provided that 6 > 
2 and ¢ > (s + 1)/2) [FaFi96b]. Moreover, if s is large enough in comparison with 
the minimum degree 6, further improvements of the sufficient conditions were given in 
[BaCaFaFi97b, Ba99]. 
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Restricted Connectivity 


D42: The restricted edge-connectivity \’ = \'(G), introduced by Esfahanian and 
Hakimi [Es88], is the minimum cardinality over all restricted edge-cuts, i.e., those 
edge-cuts S such that there are no isolated vertices in G — S. 


D43: The minimum edge-degree of G is € = €(G) = min{deg(u) +deg(v) —2 : uv 


D44: A graph is said to be \’-optimal if ’ = €. 


D45: Given a graph G = (V, £), an edge set S C E is called a k-restricted-edge-cut 
if G—S is disconnected and every component of G — Shas at least k vertices. 


D46: The k-restricted-edge-connectivity of G, denoted by \‘*) = \“)(G), is defined 
as the cardinality of a minimum k-restricted-edge-cut. 


D47: The edge-boundary w(F) is called a \\")-cut if |w(F)| = \, and F C V is called 
a k-fragment of G. A minimum k-fragment is called a k-atom, and its cardinality is 
denoted a;,(G) = ag. 


D48: The minimum k-edge-degree of G is &§ = &(G) = min{|w(U)|: 0 AUC 
V(G),|U| = k and G[U] is connected}. 


D49: A graph with k-restricted edge cuts is said to be \“)-optimal if A = €;. 


D50: A graph G is super k-restricted edge connected, or super-\“), if G is \‘*)- 
optimal and the deletion of every \‘*)-cut isolates a component with k vertices; that is, 
if every k-fragment X has cardinality |X| € {k,n —k}. 

REMARKS 


R57: Esfahanian and Hakimi [Es88] showed that each connected graph of order n > 4 
except a star has restricted edge-cuts and satisfies X’ < €. 


R58: The restricted edge-connectivity \’ corresponds to the 2-restricted-edge-connectivity 
A) and also to the conditional connectivity \; defined in Definition D41. 


R59: Furthermore, the k-restricted-edge-connectivity \“) corresponds to the condi- 
tional connectivity Ax-1 given in Definition D41, for any k > 1. 


R60: If G is super-\), then az = k. 


FACTS 


F95: [BaGVMa06] Every graph with girth g and 6 > 2 is super-\ and has X’ = € if its 
diameter D < g — 2. 


F96: [BaGVMa06] Every graph G with girth g odd and 6 > 2 is super-\ and has 
N = € if D=g-—1 and either of the following assertions holds. 


(i) All pairs wu, v of vertices at distance d(u,v) = g—1 are such that neither vertex u 
nor v lies on a cycle of length g. 
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(ii) |N(g—1)/2(u) A N(g—1)/2(v)| = 3 for all pairs u, v of vertices at distance d(u,v) = 
g —1 where Np(u) = {w € V(G) : d(u,w) = h}. 


F97: [BaCeDiGVMa05] Every graph with girth g even and 6 > 2 is super-A and has 
N = € if D=g-—1 and only 6 — 1 vertices are mutually at distance g — 1 apart. 


F98: [BaLiMi08] Every graph G with girth g, minimum degree 6 > 3, and diameter D 
is super-\’ if D < g — 3 or if the diameter of the line graph satisfies D(L(G)) < g — 3. 


F99: [ZhYu05] Let G be a connected graph with order at least 2(6 +1) not isomorphic 
to any Gf ;, where Gf ; is the graph obtained from s copies of A's by adding a new 
vertex u and joining wu to every other vertex. Then for any k < 6+1, G has k-restricted 
edge cuts and A“) < &. 


F100: [ZhYu07] Let G a connected graph on n > 2k vertices. If deg(u) + deg(v) > 
n+ 2k —3, for every pair of nonadjacent vertices u and v, then G is \“)-optimal. 


F101: [BoUeVo02, XuXu02] For k = 2,3, a graph with \“)-cuts has A“) = &, if and 
only if a, =k. 


F102: [BaGMMa09] Let G be a graph with \*)-cuts and such that \\) < &. Then 
G is \)-optimal if a, = k. Moreover, a, = k follows if G is \“)-optimal and either of 
the following conditions holds. 

(i) 6 > 2k-1. 

(ii) 6>k+1landg>k+1. 


F103: [BaGMMa09] Let G be a graph with \)-cuts such that \%) < & and \@+) 
exists. Then G is super-A“) if and only if APT) > &. 


F104: [ZhLil0] Let G be a graph with \“)-cuts, k > 3, girth g > 5, minimum degree 
6 > k, and diameter D. Then G is A*)-optimal if either of the following conditions 
holds. 

(i) D < g—4 when g is even, or D < g — 3 when g is odd; 

(ii) D<g—3 and 6 > 2k—3. 


F105:  [BaGV10] Let G be a graph with A“)-cuts, girth g, minimum degree 5 > 
max{3,k}, and diameter D. Then G is super-A“) if either of the following conditions 
holds. 


(i) D< g—4 when g is even, or D < g— 4 when g is odd and 6 > k +1; 


(ii) the diameter of the line graph is D(L(G)) < g —4. 
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Distance Connectivity 


Here we consider a generalization of the concepts of connectivity and edge-connectivity 
of a (di)graph, introduced in [FiFa94] and [BaCaFi96], which takes into account the 
distance between vertices. 


DEFINITIONS 


D51: Let G = (V,E) bea digraph. Given u,v € V such that (u,v) ¢ F, recall (from 
$4.1) that a set S C V — {u,v} is called a (u|v)-set if there is no u-v path in G— S, 
and «(u|v) is the minimum cardinality of a (u|v)-set. Similarly, a given edge-set T C E 
is called a (u|v)-edge-set for some u,v € V if there is no u-v path in G—T, and A(ulv) 
is the minimum cardinality of a (u|v)-edge-set. 


D52: Let G = (V, £) bea digraph with diameter D. Given t, 1 <t < D, the t-distance 
connectivity of G, denoted by «(t;G) = K(t), is defined as «(t) = min{xK(ulv) : u,v € 
V, dist(u,v) > t} ift > 2, and «(1) = «, where « is the standard connectivity of G. 
Analogously, the t-distance edge-connectivity is \(t;G) = A(t) = min{A(ulv) : u,v € 
V, dist(u,v) >t} fort > 1. 


FACTS 

F106: 
(a) = w(1) = «(2) < K(3) 
(b) A= A(1) < AQ) S++ SAD). 


F107: Let G be a digraph with minimum degree 6 > 1 and semigirth @ (see Defini- 
tion D12). 


(a) If \< 6 then D > 2€+1 and A= A(20 + 1). 
(b) If« <6 then D > 2¢ and k = K(20). 
F108: 
(a) \ = 0 if and only if D < 2¢ or A(204-1) > 6. 
(b) « = 0 if and only if D < 20-1 or K(20) > 0. 
F109: Every digraph with distance connectivity (3) > 6 has maximum edge-connectivity. 


F110: Let G be an undirected graph with associated symmetric digraph G*. Since a 
minimum t-distance disconnecting set of G* cannot contain digons, K(t;G*) = «(t;G) 
and A(t;G*) = A(t; G). 


F111: Let G be an undirected graph with girth g and 6 > 1. 


D>gand A= X(q), g odd, 


(a) If A < 6 then 
D>g-—1andrX=X(g-1), g even. 


D>g-landkt=x«(g-1), g odd, 


(b) If Kk <6 then 
D>g-2andk=x(g-—2), g even. 
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F112: 
D>qgq-1 > d 
(a) R= 5atund only 9 1 Aw) Bond, 
D>g-—2orX(g—1)>6, g even. 
D>g-2 —1)> d 
eee 2 Oe ea 
D>g-3o0rk(g—2)>6, g even. 


F113: Any graph with distance connectivity \(3) > 6 has maximum edge-connectivity. 


REMARKS 


R61: In Fact F108, since «(t) and X(t) are defined only for t < D, the two sufficient 
conditions on the diameter and the distance connectivity are complementary to one 
another. 


R62: Since the semigirth of any digraph is at least one, Fact F107(a) implies Fact F109, 
which complements Jolivet’s result (Fact F22). 


R63: Fact F111 follows from Fact F107 by considering Fact F110 and ¢(G*) = |(g — 
1)/2|. 


High Distance Connectivity 


DEFINITIONS 


D53: Given a vertex u of a digraph G, the out- and in-eccentricity of u are ecct(u) = 
maxyey {dist(u,v)} and ecc” (u) = maxyey {dist(v, u)}, respectively. 

D54: For any integer t, 1 < t < D, the minimum t-degree of a digraph G is 
6(t) = min{dt(t),6~ (t)}, where 6+ (t) = minyey{degt(u) : ecct(u) > t} and 6 (t) = 
minyev {deg (uw) : ecc” (u) > th. 


D55: A connected digraph G with diameter D is said to be s-geodetic, for some 
1<s< _D, if any two vertices of G are joined by at most one path of length less than 
or equal to s. If s = D, the digraph is called strongly geodetic (see [BoKoZn68}], 
[P1Zn74]). 

FACTS 

F114; 6 =6(1) =---=d0(r) < d(r4+1) <--- < d(D). 

F115: For any t, 1 <t< D, K(t) < A(t) < d(t). 


TERMINOLOGY: A digraph G is called maximally t-distance connected when «(t) = 
A(t) = 6(t), and maximally t-distance edge connected when X(t) = d(t). 


F116: Ifa digraph G is maximally connected, then G is maximally t-distance connected 
for any l <t<r. 
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F117: [BaCaFaFi97a] Let G be an s-geodetic digraph. Then, 
(a) A(t) = min{d(t), A(20+ 1)}, for any t < 2541. 
(b) «(t) = min{d(t), «(2€)}, for any t < 2s. 
F118: [BaCaFaFi97a] Let G be an s-geodetic digraph. 
(a) G is maximally t-distance connected for any t < 2s if D < 2@-1. 


(b) G is maximally t-distance edge connected for any t < 2s+1 if D < 2¢. 


F119: Let G be a graph with girth g and diameter D. Then, for any 1 <t < D, 


D<g-1, godd, 


(a) G is maximally t-distance edge connected if 
D<g-2, geven. 


D<g-2, godd, 


(b) G is maximally t-distance connected if 
D<g-3, geven. 


Maximal Connectivity 
Instead of looking for minimum disconnecting sets, we can consider those (minimal) 


disconnecting sets with maximum cardinalities. This leads to considering the following 
connectivity parameters. 


NOTATION: Denote by kmax and Amax the maximum cardinality of a minimal discon- 
necting (vertex) set and a minimal disconnecting edge set, respectively. 


FACTS 
F120: kKmax > K and Amax > ». 


F121: [PeLaHe86] For any non-trivial graph G with order n and maximum degree 
A#n-—1 we have kmax < Amax- Furthermore, if G is 2-connected, then Amax > A. 


F122: [PeLaHe86] Let G be an n-vertex graph with minimum degree 6. 
(a) If 6 > |[n/2], then Amax > 0. 
(b) If 6 > |[(n+7%)/2] for some i with 1 <i < n/2, then Amax > i (mn — 7+ 2)/2|. 


(c) If 6 > |(n+%)/2] for some i with n/2 <i <n—2, then Amax > [n/2]-|(¢+1)/2]. 
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Hamiltonian Connectivity 


DEFINITIONS 


D56: A graph G is hamiltonian connected if between any pair of vertices u,v there 
is a hamiltonian u-v path in G. 


D57: A graph G k-leaf-connected if |V(G)| > k and for each subset S of V(G) with 
k, = |S| there exists a spanning tree T with precisely S as the set of endvertices (vertices 
of degree 1). 


FACTS 


F123: [(GuWa86] Let wu and v be non-adjacent vertices of G with d(u) + d(v) > 
|V(G)| +k-1. If G+ uv is k-leaf-connected, then G is k-leaf-connected. 


F124: [GuWa86] For all natural numbers n,k, 2 < k < n—2, there are k-leaf-connected 
graphs with [(& + 1)n/2] edges (the minimum number of edges that a k-leaf-connected 
graph on n vertices can have). 


REMARK 


R64: The generalization of the concept of hamiltonian connectivity (Definition D57) 
is due to Murty. Notice that G is hamiltonian-connected if and only if G is 2-leaf- 
connected. 
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Glossary for Chapter 4 


A-B path: see path. 

A|B set: see separating set. 

adjacency matrix — of a graph (or digraph) G: the matrix A = (a,,,) indexed by 
the vertices of G with entries a,, = 1 if wv is an edge (or arc) of G and ay, = 0 
otherwise. 

algebraic connectivity — of a graph G: the smallest non-zero eigenvalue of the Lapla- 
cian matrix of G. 

antisymmetric digraph (or oriented graph): a digraph such that between any pair 
of vertices there is at most one directed edge. 

approximate (or approximation) algorithm: an algorithm that typically makes 
use of heuristics in reducing its computation but produces solutions that are not 
necessarily optimal. 

arborescence: synonym for rooted tree. 

associated symmetric digraph: see digraph. 

asymmetric TSP (ATSP): see TSP. 

atom — of a graph G: each minimum component obtained by removing a minimum 
disconnecting set from G. 

balance condition — for a mixed graph G: for every S C V(G), the difference between 
the number of arcs from S' to V(G) —S' and the number of arcs from V(G) — S' to S is 
no greater than the number of undirected edges joining vertices in S and V(G) — S. 
deBruijn graph of order k: a directed graph with 2" vertices, each labeled with a 
unique k-bit string; vertex a is joined to vertex b by an arc if bitstring b is obtainable 
from bitstring a by either a cycle shift or a deBruijn shift; each arc is labeled by the 
first bit of the vertex at which it originates, followed by the label of the vertex at 
which it terminates. 

balanced digraph: see digraph. 

balanced orientation — of a graph (or mixed graph): an assignment of a direction 
to each edge of the graph (or each undirected edge of the mixed graph) so that the 
resulting digraph is balanced. 

balanced vertex — in a digraph: a vertex whose indegree and outdegree are equal. 

bipartite degree closure: bipartite graph of order 2n obtained by recursively joining 
pairs of non-adjacent vertices x € X and y € Y whose degree sum is at least n+ 1, 
until no such pair remains. 

bipartite graph: a graph G with two independent vertex subsets that partition V(G). 

bipartite index — of a graph: the smallest number of vertices whose removal leaves a 
bipartite graph. 

block — in a connected graph: a maximal 2-connected subgraph. 

boundary — of a vertex subset U: the set of vertices which are at distance one from U. 

branching: synonym for rooted tree. 
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bridge — of a connected graph G: an edge whose deletion disconnects G; synonym for 
cut-edge. 
(k,g)-cage: a regular graph of degree k and girth g with the minimum number of 
vertices. 
Cayley (di)graph — of a group I’ with generating set S: a (di)graph whose vertices 
are identified with the elements of I, and there is an edge uv when u~tv ET. 
Chinese Postman Problem: finding a postman tour of minimum length in a graph 
where all edges are undirected; see postman. 
claw-free closure: the graph obtained by repeatedly applying the local completion 
until it is no longer possible to do. 
clique number —- of a graph G: the maximum number of vertices in a complete sub- 
graph of G. 
component, — of a graph: a maximal connected subgraph. 
component, — of a digraph: a maximal strongly connected subdigraph. 
connected graph: a graph in which there exists a walk between any pair of vertices. 
__, critically k-: a graph G such that «(G) > & but, for each vertex v € V, 
K(G-—v) <k. 
__, hamiltonian: containing a spanning path between any two vertices. 
__, k-: a graph with connectivity « > k > 1. 
__, k-edge-: a graph with edge-connectivity \ > k > 1. 
contraction: an operation involving the identification (amalgamation) of vertices. 
__, edge: given an edge wv, identification of its endpoints u and v (keeping the old 
adjacencies but removing the self-loop from u = v to itself). 
___, subgraph: identification of all the vertices of a given subgraph H by a succession 
of elementary contractions of the edges of H. 
k-contractible edge: an edge of a k-connected graph whose contraction results in a 
k-connected graph. 
k-contractible subgraph: a subgraph of a k-connected graph whose contraction re- 
sults in a k-connected graph. 
covering walk (or postman tour): — in an arbitrary graph G: a closed walk containing 
every edge of G. 
critically k-connected graph: see connected graph. 
cut-edge: synonym for bridge. 
cut-vertex — of a connected graph: a vertex whose deletion disconnects it. 
CVRP tour — in a weighted directed or undirected complete graph with vertex set 
{0,1,...,n}, with a demand d; > 0 for 1 = 1,2,...,n, and two parameters Q and 


k: a collection of k cycles C),C2,...,Cz, which contain all the vertices, pairwise 
intersect only in vertex 0, and satisfy S> d; <Q for each j =1,2,...,k. 
tEV(C;) 


cycle cover — of a graph G: a family S of cycles of G such that every edge of G belongs 
to at least one element of S. 

cycle decomposition — of a graph (or digraph) G: a partition of the edge-set (or 
arc-set) of G such that each partition set forms a cycle (or directed cycle). 

cycle double cover (CDC): a cycle cover S such that every edge of G belongs to 
exactly two elements of S. 

cycle extendable: any cycle C of length m < |V(G)| can be extended to a cycle of 
length m+ 1 containing all of V(C). 

__, fully: a graph that is cycle extendable and having any vertex on a triangle. 
cycle packing — in a graph G: a set of edge disjoint cycles in G. 
cycle shift — a,a2...a, —> b,b2...b,: a left shift such that 6, = ay. 
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deBruijn shift — aja2...a, —> b,b2...b%: a left shift such that b, 4 a1. bipancyclic: 
a bipartite graph of order 2n containing cycles of all even lengths from 4 to 2n. 

k-degree closure: graph obtained by recursively joining pairs of non-adjacent vertices 
whose degree sum is at least k, until no such pair remains. 

degree sequence — of a graph G: the degrees of the vertices of G ordered in non- 
increasing (or non-decreasing) order. 

detachment operation: see $4.2, Definition D??. 

detachment — of a graph G: a graph that results from a sequence of detachment 
operations performed at each of the vertices of some vertex subset W C V(G); used 
to transform or produce eulerian tours. 

diameter — of a (di)graph G: the maximum distance between vertices of G. 

digraph: a graph all of whose edges are directed; a directed graph. 


associated symmetric — of a graph G: the digraph G obtained from G by 
replacing each edge wv by the two directed edges (u,v) and (v,u) forming a 
digon. 
balanced: a digraph whose vertices are all balanced; in §4.3, this is called a 
symmetric digraph. 
connected: a digraph whose underlying graph is connected; also called weakly 
connected. 
critically k-connected: a digraph G such that «(G) > k but, for each vertex 
v EV, K(G—v) <k. 
strongly connected: a digraph with a u—v walk for any pair of vertices u,v; 
also called a strong digraph. 
k-(strongly) connected: a digraph with connectivity at least k; also called 
k-strong or k-strongly connected. 
symmetric): a digraph such that between each pair of distinct vertices, either 
both (oppositely directed) arcs exist or neither does. 
, symmetric: (used in §4.3) a digraph such that indegree equals outdegree for 
each vertex. 
disconnecting edge-set, — of a graph G: a subset of edges whose removal from G 
results in a non-connected graph. 
disconnecting edge-set2 — of a digraph G: a subset of arcs whose removal from G 
results in a non-strongly-connected digraph. 
disconnecting (vertex-)set — of a graph G: a subset of vertices whose removal from 
G results in a non-connected graph. 
__, (ulv)- — in a graph G: a disconnecting (vertex-)set whose removal from G leaves 
u and v in different components. 
disconnecting (vertex-)set — of a digraph G: a subset of vertices whose removal from 
G results in a non-strongly-connected digraph. 
t-distance connectivity — of a graph (or digraph) G: the minimum cardinality of a 
(u|v)-set with dist(u,v) > t. 
distance matrix of an instance of TSP: the matrix D = [d;,;], where d;; is the weight 
of the edge between vertices 7 and 7; analogously defined for digraphs. 
distance, — between vertices u,v in a graph (or digraph) G: the length of a shortest 
path joining wu to v. 
distancez — between vertex sets U,W in a graph G: the minimum of the distances 
between vertices of U and W. 
distance-regular graph: a graph whose number of /-walks between any pair of vertices 
u,v only depends on dist(u,v) and 1 > 0. 
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dominating circuit: A circuit C such that every edge of G is incident to a vertex of 
C. 
double tracing: a closed walk that traverses every edge exactly twice. 

__, bidirectional: a double tracing that uses every edge once in each of its two 
directions. 

__, strong: a double-tracing that is both bidirectional and retract-free. 

edge addition: given two non-adjacent vertices u,v of a graph G = (V,E), G+e= 
(V, EU {uv}) is the graph obtained from G by addition of the edge e = wv. 

edge contraction: see contraction. 

edge-boundary — of a vertex subset U: the set of edges having exactly one endpoint 
in U. 

edge-connectivity, — of a graph: the minimum number of edges whose removal leaves 
a connected graph; denoted 2 (or Ke). 

edge-connectivity2 — of a non-trivial digraph: the minimum number of arcs whose 
removal leaves a non-strongly connected graph; denoted A or Ke. 

edge-cut: a disconnecting edge-set. 

edge-disjoint paths: paths that have no edge in common. 

edge-symmetric graph: a graph whose automorphism group acts transitively on its 
edge set. 

Euclidean TSP: see TSP. 

eulerian graph (or digraph or mixed graph): a graph that has an eulerian tour. 

eulerian tour — in a graph (or digraph): a closed walk that uses each edge (or arc) 
exactly once. An eulerian tour in a mixed graph is a closed walk that uses each edge 
and each arc exactly once. 

even graph: an undirected graph whose vertices all have even degree. 

exact algorithm: an algorithm that solves a certain optimization problem to optimal- 
ity. 

{F\, Fo,..., F,}+free: containing no induced subgraph isomorphic to any F;, 1 <i <k. 

1-factor — of a graph G: a 1-regular subgraph H such that V(H) = V(G) (ie., spans 
G); thus, a 1-factor is a spanning matching. 

2-factor of a graph G: the (vertex-disjoint) union of cycles of G that covers V(G). 

fragment — of a graph G: a component obtained from G by removing a minimum 
disconnecting set. 

generalized p-cycle: a graph (or digraph) whose vertex set can be partitioned into p 
subsets, say Vo, Vi,..., Vp—1, in such a way that every edge wv is of the form u € Vj, 
v € Vi41 (arithmetic modulo p). 

generalized TSP: see TSP. 

s-geodetic digraph: see §4.7, Definition D55. 

girth — of a graph G: the length of a shortest cycle. 

half degree — of a vertex u in a digraph G: the indegree or outdegree of u in G. 

hamiltonian connected: containing a spanning path between any two vertices. 

hamiltonian cycle: a spanning cycle. 

hamiltonian decomposition: a partitioning of the edge set of G into hamiltonian 
cycles if G is 2d-regular or hamiltonian cycles and a perfect matching if G is (2d+1)- 
regular. 

hamiltonian graph: a graph containing a hamiltonian cycle. 

__, k-ordered: a hamiltonian graph such that for every ordered sequence of k vertices 
there is a hamiltonian cycle that encounters the vertices of the sequence in the 
given order. 

__, uniquely: containing exactly one hamiltonian cycle. 
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imbedding — of a graph G on a surface S: a drawing of G on S with no crossing edges. 

in-arcs — of a partition-cut E(X,X): the subset of arcs whose head is in X; denoted 
E~(X,X). 

incidence set — of a vertex v: the partition-cut E(X,X), where X = {v}; denoted E, 
(the out-arcs and in-arcs of F,, are denoted E;* and E;, , respectively). 

incidence-partition system: see §4.2, Definition D??. 

indegree — of a vertex u in a digraph G: the number of vertices in G adjacent to u. 

__, t-: the indegree of a vertex v with in-eccentricity at least t. 

independence number: cardinality of a largest set of independent vertices. 

independent set — of vertices: a subset of pairwise non-adjacent vertices. 

independent vertices: vertices with no edges between them. 

induced subgraph (or subdigraph) on a set U C V: the maximal subgraph (or 
subdigraph) of G with vertex set U; denoted G(U) or G[U]. 

in-eccentricity — of a vertex u in a digraph G: the maximum of the distances dist(v, u) 
for all vertices v in G. 

interior vertex — of a fragment F’: a vertex with no adjacent vertices outside of F’. 

internally-disjoint uw—v paths: u—v paths that have pairwise exactly vertices u and 
v in common. 

in-tree: a rooted tree with all arcs reversed. 

k-connected graph (or digraph): a graph (or digraph) with connectivity at least k. 

k-edge-connected graph (or digraph): a graph (or digraph) with edge-connectivity 
at least k. 


k-paths problem: given 2k distinct vertices u,, u2,--- , uz and v1, v2,°-: , Uk ina graph 
G, determining whether there exist k mutually edge-disjoint paths P,, P2,--- , Py in 
G such that P; connects u; and v; fori = 1,2,...,k. 


k-system that dominates: a collection of k edge-disjoint circuits and stars (with at 
least 3 endvertices), such that each edge of G is either contained in one of the circuits 
or stars, or is adjacent to one of the circuits. 

kappa transformations: various combinations of splitting, splicing, and reversing 
closed trails; they form the basis for constructing eulerian tours and for transforming 
one tour into another; see §4.2.6. 

Laplacian matrix — of a graph G: the matrix L = (l,,) with entries l,,, = deg(u) — 1 
if u,v are adjacent vertices of G, and ly, = 0 otherwise. 

line digraph — of a digraph G: the digraph whose vertices are the directed edges of G 
and vertex (u,v) is adjacent to vertex (v, w). 

__, k-iterated — of a digraph G: the digraph obtained by applying recursively k 
times the line digraph operation on G. 

line graph — of a graph G: the graph whose vertices can be put into 1—1 correspondence 
with the edges of G in such a way that two vertices of L(G) are adjacent if and only 
if the corresponding edges of G are incident. 

k-linked graph: a graph that has at least 2k vertices, and for every sequence of 
2k different vertices, u,,U2,...,Uk, U1, V2;---, UK, there exists a uj—v; path P;, i = 
1,2,...,k, such that the k paths are vertex-disjoint. 

k-parity-linked graph: a graph in which one can find k disjoint paths with prescribed 
endvertices and prescribed parities of the lengths. 

local completion — at a vertex x of a graph G such that the induced subgraph G[N (2)| 
is connected: the graph obtained by replacing G[N()] by a complete subgraph on 
V(N(a)). 

matching: a subset of edges no two of which have a common vertex. 
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__, perfect — in a graph G: a matching such that every vertex in G is incident to 
some edge in the matching. 
maximally connected graph (or digraph): a graph (or digraph) whose connectivity 
equals its minimum degree. 
maximally edge-connected graph (or digraph): a graph (or digraph) whose edge- 
connectivity equals its minimum degree. 
maximum degree - of a (di)graph G: the maximum of the (positive and negative) 
degrees of the vertices of G. 
minimally k-connected graph (or digraph): a graph (or digraph) G such that 
k(G) > k but, for each edge e € E, K(G—e) < k. 
minimally k-edge-connected graph (or digraph): a graph (or digraph) G such 
that A(G) > k, but for each e € EF, A\G-—e) <k. 
minimum (t-)degree — of a (di)graph G: the minimum among all t-(out and in)degrees 
of the vertices of G. 
minimum degree - of a (di)graph G: the minimum of the (out and in) degrees of the 
vertices of G. 
minimum degree — of a (di)graph G: the minimum of the (positive and negative) 
degrees of the vertices of G. 
mixed graph: a graph that has both undirected and directed edges. 
Moore bound — of a graph G: an upper bound for the maximum number of vertices, 
given its maximum degree and its diameter. 
negative a-fragment — of a digraph G: the subset of edges whose negative edge- 
boundary is a minimum disconnecting edge set of G. 
negative atom — of a digraph G: a negative fragment with minimum cardinality. 
negative boundary — of a vertex subset F in a digraph G: the set of vertices in G 
which are at distance one to F. 
negative edge-boundary — of a vertex subset F' in a digraph G: the set of edges in 
G which have only its final vertex in F’. 
negative fragment — of a digraph G: the subset of vertices whose negative boundary 
is a minimum disconnecting set of G. 
neighborhood, — of a vertex x: the set of all vertices adjacent to x. 
neighborhood), — of the set S: vertices adjacent to some vertex in S. 
out-arcs — of a partition-cut E(X,X): the subset of arcs whose tail is in X; denoted 
E*(X,X). 
outdegree — of a vertex u in a digraph: the number of vertices adjacent from wu. 
__, t+: the outdegree of a vertex v with out-eccentricity at least t. 
out-eccentricity — of a vertex u in a digraph G: the maximum of the distances 
dist(u,v) for all vertices v in G. 
out-tree: synonym for rooted tree. 
pancyclic: containing cycles of all lengths from 3 to |V(G)|. 
p-partite graph: a graph whose vertex-set can be partitioned into p independent vertex 
subsets. 
partition-cut — in a graph G = (V, E£) associated with X C V(G): the set of edges in 
G with one endpoint in X and one endpoint in X = V(G) — X; denoted E(X, X). 
path: a simple walk, that is, a walk in which all defining terms are distinct. 
__, A-B: given A, B C V, a u-—v path P such that u is the only vertex of P belonging 
to A and v is the only vertex of P that belongs to B. 
__,u-u —ina graph G: a path in G joining wu to v; analogously defined for digraphs. 
positive a-fragment — of a digraph G: the subset of edges whose positive edge- 
boundary is a minimum disconnecting edge set of G. 
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positive atom — of a digraph G: a positive fragment with minimum cardinality. 
positive boundary — of a vertex subset F in a digraph G: the set of vertices in G that 
are at distance one from F’. 
positive edge-boundary — of a vertex subset F' in a digraph G: the set of edges in G 
which have only its initial vertex in F’. 
positive fragment — of a digraph G: the set of vertices whose positive boundary is a 
minimum disconnecting set of G. 
postman problem: the class of problems of finding a minimum-length (or minimum- 
weight) postman tour in a graph under various conditions. 
, directed: a postman tour of minimum length in a digraph. 
, mixed: a postman tour of minimum length in a mixed graph (with both directed 
and undirected edges). 
, rural: relaxation of the basic postman version (undirected, directed, mixed) 
where only a subset of edges has to be included at least once. 
, Stacker crane: a rural postman version of a mixed postman problem but where 
each directed edge is traversed at least once. 
, undirected: a postman tour of minimum length in a graph where all edges are 
undirected. 
___, windy: an undirected postman problem where the cost of edge traversal depends 
on the direction the (undirected ) edge is traversed. 
postman tour (or covering walk): — in an arbitrary graph G: a closed walk containing 
every edge of G. 
k-th power — of a connected graph G: the graph with V(G*) = V(G) for which 
uv © E(G) if and only if 1 < dg(u,v) < k. 
quasi 4-connected graph: a 3-connected graph G such that, for each vertex-cut 
F CV with |F| = 3, F is the neighborhood of a vertex of degree 3 and G — F has 
exactly two components. 
retract or retracing — in a walk W: a section of the walk of the form v;_1, e;, U;, €:-41, 
Vi41 such that ey = €j41 (and Vi41 = U;-1)- 
retract-free walk: a walk that has no retracts. 
rooted tree: a directed tree having a distinguished vertex r, called the root, such that 
for every other vertex v, there is a directed r-v path. Occasionally encountered 
synonyms for rooted tree are out-tree, branching, and arborescence. 
semigirth ¢ — of a digraph G: the parameter @, for 7 = 0. 
semigirth, 7- — of a digraph G: the greatest integer such that, for any pair of vertices 
u,v: (a) if dist(u,v) < ¢, the shortest u-v directed walk is unique and there are at 
most 7 u-v directed walks of length dist(u,v) +1; (b) if dist(u, v) = @, there is only 
one shortest u-v directed walk; denoted ¢,. 
separating set — of sets A,B C V: a set X C V such that every A—B path in G 
contains a vertex of X. 
(left) shift operation — converting a k-bit string a = a a2a3---a, to a k-bit string 
b,b2...bg: an operation such that b; = aj41, fori1 = 1,2,...,k—1. 
spanning subgraph (or subdigraph) — of a graph G: a subgraph (or subdigraph) 
that contains all the vertices of G. 
splitting operation: see $4.2, Definition D11. 
stacker crane problem: a rural postman version of a mixed postman problem but 
where each directed edge is traversed at least once. 
strongly connected digraph: see digraph. 
strongly regular graph: a connected distance-regular graph with diameter two. 
subgraph contraction: see contraction. 
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super-« graph: a maximally connected graph whose minimum disconnecting sets are 
the vertices adjacent to some vertex of minimum degree; analogously defined for 
digraphs. 

super-\ graph: a maximally edge-connected graph whose minimum disconnecting edge 
sets are the edges incident on some vertex of minimum degree; analogously defined 
for digraphs. 

symmetric digraph: see digraph. 

symmetric digraphg: see digraph. 

symmetric digraphs: a digraph which is both vertex- and edge-transitive; used in 
84.7. 

symmetric graph: a graph which is both vertex- and edge-transitive. 

symmetric TSP (STSP) — for a complete (undirected) weighted graph: finding a 
minimum-weight hamiltonian cycle in K,,. 

total degree — of a vertex v in a mixed graph G: the total number of arcs and undirected 
edges incident on v. 

totally separating set: Given A C V, a subset X C V—A totally separates A if each 
component of G— X has at most one vertex of A. 

toughness — of a graph G: maximum t such that t c(.S) < |S|, where c(S) is the number 
of components of G— S. 

traceable: containing a spanning path. 

transition system: see §4.2, Definition D??. 

traveling salesman problem: finding a minimum-weight hamiltonian cycle in a 
weighted complete graph. A non-complete graph is made complete by adding the 
missing edges and assigning to them a prohibitively large weight; also referred to as 
the symmetric TSP. 

triangle inequality — on a weighted digraph (or graph): the condition dj; + dj; > dix 
for all distinct vertices 7,7,k, where dj; is the weight of the arc from vertex i to 
vertex j (of the edge between 7 and J). 

TSP: traveling salesman problem. 

__, asymmetric (ATSP): finding a minimum-weight hamiltonian cycle in a weighted 


complete digraph ne (every pair of vertices has both oppositely directed arcs). 

__, Euclidean: the special case of TSP in which the vertices are points in the 
Euclidean plane and the weight on each edge is the Euclidean distance between 
its endpoints. 


__, generalized — for a weighted complete digraph ra and a partition Vj,..., Vx of 
its vertices: finding a minimum-weight cycle containing exactly one (at least one) 
vertex from each set V;,i=1,...,k. 

(u|v)-(disconnecting) set: see disconnecting set. 

u—v path: a path from vertex u to vertex v. 

vehicle routing problem, capacitated (CVRP) -— in a weighted directed or undi- 
rected complete graph on n+ 1 vertices with a demand d; > 0 for = 1,2,...,n and 
with two parameters Q and k: finding a CVRP tour for which the total weight of 
the cycles is minimum. 

(vertex-)connectivity, — of a graph: the minimum number of vertices whose removal 
leaves a non-connected or trivial graph; denoted & or Ky. 

(vertex-)connectivity: — of a strongly connected digraph: the minimum number of 
vertices whose removal leaves a non-strongly connected or trivial digraph; denoted 
K OF Ky. 
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vertex splitting: operation in which a vertex w is replaced by an edge wv in such a 
way that some of the vertices adjacent to w are now adjacent to u and the rest are 
adjacent to v. 

vertex-symmetric graph: a graph whose automorphism group acts transitively on 
its vertex set; that is, for any pair of vertices u,v, there exists an automorphism of 
G mapping u to v. 

walk: an alternating sequence of vertices and edges such that for each edge, one end- 
point precedes and the other succeeds that edge in the sequence. 

weakly k-linked graph: a graph that has at least 2k vertices, and for every k pairs of 
vertices (u;,v;), there exists a u;—v; path P;, 1 <i<k, such that the k paths are 
edge-disjoint. 

weakly connected digraph: a digraph whose underlying graph is connected; also 
called connected digraph. 

wheel: a graph consisting of a cycle C’ and an additional vertex that is adjacent to 
every vertex of C. 
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INTRODUCTION 


85.1 concentrates on the classical concept of chromatic number and on the more 
recent but closely related concept of choice number, mostly in connection with other 
important graph invariants. Further developments of graph colorings appear in §5.2. 

Various problems, some of which are equivalent to colorings (e.g., 1-factorizations) 
are dealt with in the other sections of this chapter. The book [JeTo95] is a rich source 
of additional information, where results are organized around more than 200 open prob- 
lems. On list coloring and related topics, a comprehensive survey can be found in [Tu97] 
and its update [KrTuVo99]. The recent book [StScToF'al2] is devoted to the theory of 
edge colorings. 

TERMINOLOGY NOTE: We consider a graph to be without self-loops, to have at least one 
vertex, and, except within the last subsection, to be finite. 


5.1.1 General Concepts 


Graph coloring deals with the general and widely applicable concept of partitioning 
the underlying set of a structure into parts, each of which satisfies a given requirement 
(e.g., to be an independent set). One of the most famous problems in this area, and 
even within graph theory, what is now known as the Four Color Theorem (see 85.2.2), 
has been a driving force of research on graphs for nearly a century. 
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Proper Vertex-Coloring and Chromatic Number 


DEFINITIONS 
D1: A vertex-coloring of a graph G = (V, E) is a function 
p:VoCc 
from the set V of vertices to a set C of colors. 
D2: The coloring y is proper if no two adjacent vertices are assigned the same color. 


D3: A k-coloring is a vertex-coloring with at most k colors. 


NOTATION: If & is a positive integer, we assume (unless specified otherwise explicitly) 
that C = {1,2,...,k}. 


D4: A proper k-coloring y may also be viewed as a vertex partition 
ViU---UYy=V 


where the disjoint subsets V; = y~'(i) are called the color classes. (Thus, a k-coloring 
y is proper if and only if y~!(i) is an independent set for each i.) 


D5: A graph is k-colorable if it admits a proper vertex-coloring with at most k colors. 
D6: A graph is k-chromatic if it is k-colorable but not (k — 1)-colorable. 

D7: The chromatic number of a graph G, denoted ,(G), is the smallest nonnegative 
integer k such that G is k-colorable. 

EXAMPLES 

E1: The complete graph K,, on n vertices has chromatic number n. 

E2: A graph G (other than the null graph) has chromatic number 


x(G) = 1 if and only if G is edgeless, 
x(G) <2 if and only if G is bipartite. 


In particular, cycles of even length are 2-chromatic, while cycles of odd length are 
3-chromatic. 


DEFINITION 


D8: The (vertex) independence number of a graph G, denoted a(G), is the size of 
a maximum independent set in G. 


FACTS 
F1: An immediate consequence of the definitions is that for every graph G, 


AVL 
x(G) = a) 
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F2: [Bo88] The lower bound y(G) > 
the random graph Gn» satisfies 


X(Gnp) = (5 $3 o(t)) (tos i +) ee 


for every fixed p (0 < p < 1) with probability 1 — 0(1) as n > oo. 


(a 


a is almost tight for almost all graphs, since 


F3: Let f : R=° > R* be a nondecreasing, continuous function with f(0) = 1. If 
G = (V, E) is a graph such that, for every 1 < i < |V|, every subgraph of G on # vertices 
has independence number at least f (7), then 


Vly 


x(G) < ; Far 


List Coloring and Choice Number 


Many results on the chromatic number can be discussed in the following more general 
setting. 


DEFINITIONS 


D9: A (vertex) list assignment L on a graph G associates a set L, of colors with 
each vertex v of G. Each L, is interpreted as the set of allowed colors for vertex v. 


D10: The graph G is L-colorable (or list colorable, when L is understood from 
context) if it admits a proper vertex-coloring y such that y(v) € L, for all v. 


TERMINOLOGY NOTE: The term “list” is used in the literature for historical reasons 
only. No particular ordering on the color set L, is assumed here. 


D11: If |L.| =k for all v € V, then the list assignment L is called a k-assignment. 
D12: A graph G is k-choosable if it is L-colorable for every k-assignment L. 


D13: The choice number of G, denoted ch(G), is the smallest nonnegative integer 
k such that G is k-choosable. (In part of the literature, the choice number is called list 
chromatic number, and also the notation x¢(G) is commonly used for ch(G).) 


EXAMPLES 
E3: The complete graph K,, on n vertices has choice number n. 


E4: The choice number may be strictly larger than the chromatic number. In partic- 
ular, the complete bipartite graphs K3,3 and K2,4 are 2-colorable but not 2-choosable 
(the former with the lists (1,2), (1,3), (2,3) in both bipartition subsets, and the latter 
with lists (1,2), (3,4) and (1,3), (1,4), (2,3), (2,4) in its 2- and 4-element class, respec- 
tively). It is true, however, that cycles of even length are 2-choosable, while cycles of 
odd length have choice number 3 (although the former is not immediate to verify). 
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FACTS 


F4: [ErRuTa79] The complete bipartite graph K,,,, has choice number (1+ 0(1)) logy n 
as 1 —> 00. 


F5: (J. Kahn in [A193]; [TuVo94]) Asymptotically ch(G,,) = (1 + 0(1)) x(Gn,p) for 
every constant p (0 < p < 1) with probability 1 — 0(1) as n > oo. 


F6: [Kr00] If the edge probability p = p(n) is such that p(n) > 0 as n > oo and 
p(n) > n-/4+* for some ¢ > 0, then ch(Gp») = (1 + 0(1)) x(Gnp) with probability 
1— (1). 


F7: [AlkKrSu99, Vu99] The bound ch(Gn,p) = O(x(Gn,p)) holds also for p(n) > 2/n, 
and upper bounds of the form O(n p(n)/log(n p(n))) follow deterministically, too, from 
conditions (analogous to the expected values of parameters in G,,,) on degrees, pair- 
degrees, and complementary degrees. 


The Hajés Construction 


FACTS 


F8: [Ha61] Every graph of chromatic number at least k can be constructed from the 
complete graph K, by a sequence of operations of the following three types: 


(1) Insert new vertices and/or edges. 


(2) Having constructed vertex-disjoint graphs G; and Go, select edges u,v; in G; 
(i = 1,2), remove wiv, and ugve, identify wu, with ue, and insert the new edge 
U1 U2. 


(3) Identify nonadjacent vertices. 


F9: [Gr96] Every graph of choice number at least k can be constructed from any one 
complete bipartite graph of choice number k by a sequence of the operations (1) and (2) 
above, and the following third type: 


(3’) Having constructed a graph G = (V, F) that has an uncolorable list assignment D 
where |L,| > & for all v € V and two nonadjacent vertices u,v have the same list 
in L, then identify u with v. 


Lovasz’s Topological Lower Bound 


DEFINITION 


D14: The neighborhood complex of a graph G = (V, F) is the simplicial complex 
N(G) whose vertices are the vertices of G, and such that the set X C V is a simplex if 
all the « € X have a common neighbor in G. 


FACT 


F10: [Lo78] If the neighborhood complex M(G) of a graph G is a k-connected topo- 
logical space, then y(G) > k + 3. 
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Alon and Tarsi’s Graph Polynomial Characterization 


DEFINITION 


D15: The graph polynomial, also called the edge difference polynomial, of a 
graph G = (V,F) with E #Q@ and vertex set V = {v1,..., Un} is 
Pa = Pe(a1,.--,2n) = Il (xi — 25) 
i<j 
ViVG CE 
Given a list assignment L on G, we also define 


Q: = Qi(zi) = II (req) 


gel; 
fori=1,...,n. 
FACT 
F11: [AlTa92] A graph G admits a list L-coloring if and only if its graph polynomial 
Pg does not belong to the ideal generated by the polynomials Q;. 
List Reduction 
FACT 


F12: [TuVo97] Let G = (V, E) be a graph, let L be a k-assignment on G (k > 2), let 
X CV be any subset, and let F C E be the set of those edges which have at least one 
vertex in X. If (V,F) is a 2-choosable graph, then one can properly color the set X 
with colors p(x) € Lz, for all x € X, in such a way that at most one color occurs in the 
neighborhood of each vertex v € V \ X. In particular, ch(G) < ch(G-— X)+1. 


5.1.2 Vertex Degrees 


DEFINITIONS 


D16: The (Erdés—-Hajnal) coloring number of G, denoted col(G) (and also called 
the Szekeres—Wilf number) is the smallest positive integer k for which there exists 
an order v1 v2:--Un of the vertices, such that every vu; has fewer than k neighbors v; 
with j > 2. 


D17: A graph is d-degenerate if none of its subgraphs has minimum degree larger 
than d. (col(G) is the smallest integer k such that G is (k — 1)-degenerate.) 


D18: The clique number of a graph G, denoted w(G), is the size of a maximum 
clique in G. 


FACTS 
F13: For every graph G, 


w(G) < x(G) < ch(G) < col(G) < AG) +1 
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F14: (Nordhaus-Gaddum Theorem) [NoGa56] For any graph G = (V,£) and its 
edge-complement G, 


MG) AxX(G) = |Vie1 


F15: [ErRuTa79] The Nordhaus—Gaddum upper bound also holds for list colorings: 


ch(G) + ch(G) <|V| +1 and col(G) + col(G) < |V|+1 
F16: [HaSz70] Every graph G = (V,&) with maximum degree A admits a proper 
(A + 1)-coloring such that each color class has cardinality 4 or 4] (called 


AFI AFI 
equitable coloring). See [KiKoYu09] for a short proof and further references. 


F17: [Kiko08] If d(x)+d(y) < 2D +1 for every edge xy in G, then G has an equitable 
(D + 1)-coloring. 


F18: (Brooks’s Theorem) [Br41] If the graph G is connected, then 
x(G) < A(G) 
unless G is complete or A(G) = 2 and G is an odd cycle. 


F19: [Vi76, ErRuTa79] (analogue to Brooks’s theorem for list colorings) If the graph 
G is connected, then 


ch(G) < A(G) 


unless G is a complete graph or A(G) = 2 and G is an odd cycle. (See also §5.2.1 for 
generalizations. ) 


REMARK 


R1: Concerning the sequence of inequalities w < yx < ch < col < A+1, Brooks’s 
theorem and its analogue characterize equality in x = A+1 and ch = A+1. The other 
end, w = x, is studied in the theory of perfect graphs (see §5.5). So far, the problems 
[Tu97a] of finding tight conditions for ensuring w = ch or x = ch are open, except for 
ch = 2. 


FACTS 


F20: [ErRuTa79] A connected graph is 2-choosable if and only if the sequential removal 
of degree-1 vertices yields the trivial graph K1, or an even cycle, or an even cycle plus 
a degree-2 vertex whose two neighbors are at distance two along the cycle. 


F21: [A100] There exists a sequence of real numbers €q with eg + 0 as d > co, such 
that the inequality ch(G) > ($ — €a) log, d holds whenever the graph G on n vertices 
contains a subgraph of minimum degree d. Equivalently, ch(G) > ($ — 0(1)) log, col(G) 
as col(G) gets large. 
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5.1.3. Critical Graphs and Uniquely Colorable Graphs 


The two extremes are considered here: k-chromatic graphs that are almost (k — 1)- 
colorable, and k-chromatic graphs with just one proper k-coloring. List critical graphs 
are mentioned only briefly, to indicate that they behave quite differently. 


DEFINITIONS 

D19: For k > 2, a k-chromatic graph G = (V, E) is k-critical if 
x(G-e)=k-1 

for every edge e € E; and G is k-vertex-critical if 
x(G-v)=k-1 

for every vertex v € V. 


D20: A graph G = (V,E£) is k-list critical if there is a (k — 1)-assignment L for G 
such that every proper subgraph of G is L-colorable, but G itself is not L-colorable. 


FACTS 


F22: Every k-vertex-critical graph is connected, without any vertex cutsets inducing 
a complete subgraph, and contains a k-critical spanning subgraph. 


F23: [To78] A graph G is vertex-critical if and only if the complement of each block 
in the complementary graph G is vertex-critical. 


F24: For k = 2 and k = 3, a graph is k-critical if and only if it is k-vertex-critical. 
The unique 2-critical graph is K2, and the 3-critical graphs are the odd cycles. 


F25: (same as Brooks’s Theorem) Every k-critical graph G has minimum degree at 
least k — 1, and if G is (k — 1)-regular, then either G = K;, or k = 3 and G is an odd 
cycle. 


F26: [Mi92] Every k-critical graph G # K; (k > 4) contains all trees of k edges as 
subgraphs. 


F27: [Ga63] If G is k-critical, then its vertices of degree k — 1 induce a subgraph in 
which every block is a complete graph or an odd cycle. Conversely, if H is a K,-free 
graph of maximum degree at most k — 1 > 3 where each block is a complete graph or 
an odd cycle, then there exists a k-critical graph G in which the vertices of degree k — 1 
induce a subgraph isomorphic to H. (This graph H may also be the null graph.) 


F28: [St85] For k > 4, let F be a connected graph of maximum degree at most k — 1, 
such that each of its blocks is a complete graph or an odd cycle. There exist only finitely 
many k-critical graphs in which the vertices of degree k — 1 induce F' and in which the 
other vertices induce either K,_, or a (k — 2)-colorable graph. 


F29: [GrLo74] A graph H is a proper subgraph of some k-critical graph if and only if 
HT and each of its edge contractions H/e (for e € E(H)) is (k — 1)-colorable. 
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F30: [Si72, To72] There exists an infinite sequence of 4-critical graphs on n vertices 
with minimum degree at least }n!/%. (It is not known whether the minimum degree 
can be as large as cn in 4-critical or 5-critical graphs.) 


F31: [KoYal2] The number of edges of a k-critical graph, k > 4, on n > k vertices is 


at least [Sen oe This bound is tight for alln > 6 ifk =4, foralln=1 


(mod (k — 1)) if & > 5, and also for all n = 2 (mod 4) with n > 10 if k = 5. (Classical 


lower bounds are “+n + £53 in [Di57] and 454n + Wea? in [Ga63].) 


F32: [Lo73] For the largest possible independence number a,,;, in k-critical graphs on 
n vertices, the difference n — ap, is at least gkn'/ (k—2) and, for infinitely many values 
of n, at most 2kn!/(F-2), 

F33: [MoRe01] Let ka be the largest integer k’ such that (k’+1)(k’+2) < A. There is 
an absolute constant Ag such that for any A > Ag and any k < kag, if G has maximum 
degree A and if y(G) > A—k then G contains a (A — k + 1)-chromatic subgraph of 
maximum degree at most A+1. On the other hand, for every A > 2 and k > kag, there 
exist arbitrarily large (A — k + 1)-critical graphs with maximum degree A. Viewing it 
from the algorithmic side, on graphs with A large (fixed), (A — k)-colorability can be 
tested in linear time if k < ka, but it is NP-hard if ka < k < A—2. (It is conjectured 
that the condition A > Ao can be omitted.) 


F34: [R6Tu85] Let t(k,¢) denote the minimum number of edges in graphs with k 
vertices and independence number less than ¢ (i.e., the complement of the Turan number, 
cf. §8.1). If G is (kK + 1)-critical and has at least 2t(k, 2) — 1 vertices, then in order to 
obtain an ¢-colorable subgraph, one must delete at least t(k,2) edges. This bound is 
tight. Moreover, for unrestricted (k+1)-chromatic graphs, the minimum is t(k+1, +1). 


F35: [R6Tu85] Let k > 2, and G be a graph with n vertices and m edges. Suppose 
that the automorphism group of G acts transitively, either 


(a) on the vertices and x(G) > 2k +1, or 
(b) on the edges and y(G) > k? +1. 


Then, to obtain a k-colorable subgraph of G, one has to delete at least ,/n vertices in 
Case (a) and ,/m edges in Case (b). (It is not known whether vertex/edge transitive 
4-chromatic graphs can always be made bipartite by removing a bounded number of 
vertices /edges.) 


F36: (V. Rédl in [To85]) The number of nonisomorphic 4-critical graphs is at least 
cn for some constant c > 1. 


F37: Every k-critical graph G is k-list critical, by taking the list assignment DL, = 
{1,...,4—1} for all vu € V(G). 


F38: [StTuVo09] For every & > 5, every complete graph on at least k vertices is k-list 
critical, whereas there are only finitely many 4-list critical complete graphs. In partic- 
ular, a list critical graph may contain other list critical graphs as (induced) subgraphs, 
with the same sizes of ‘critical lists’ (whereas k-critical graphs contain no k-critical 
proper subgraphs); moreover, a graph may be critical with respect to many different 
values of list sizes. 
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OPEN PROBLEMS 


P1: Determine tight asymptotic bounds on the maximum number of edges in k-critical 
graphs on n vertices (k > 4 fixed, n > oo). 


P2: (J. NeSetiil and V. Rédl) For & > 3, does there exist a function f,(n) such that 
every (& + 1)-critical graph with at least f;,(m) vertices contains a k-critical subgraph 
with at least n vertices? 


CONJECTURES 


C1: Double-critical graph conjecture (Erdés and Lovasz) If x(G — u-—v) = 
x(G) — 2 holds for any two distinct vertices of G, then D is a complete graph. 


C2: (Hedetniemi) The equality y(G x H) = min(y(G), x(#)) holds for any two graphs 
G and H. (See [Zh98] for a survey. The fractional version of the equality is valid; see 
§5.2.1.) 


FACT 

F39: [Keke54] The function f3(n) exists, i.e., every ‘large’ 4-critical graph contains a 
‘long’ odd cycle. 

EXAMPLES 

E5: The complete join of two disjoint cycles of length 2¢+ 1 is a 6-critical graph on 
n = 4t +2 vertices with 4n? +n edges ([Di52]). More generally, the number of edges in 
k-critical graphs on n vertices can be as large as (5 — cy)n”, where cy, + 0 as k > 00. 
E6: Onn = 8¢ +4 vertices, let V; (1 <i < 4) be disjoint vertex subsets of cardinality 
2t+1 each, V; and V2 induce the cycle Cg:41, V3 and V4 be independent and completely 


joined, and let the V,; — V3 edges and the V2 — V4 edges form a perfect matching. This 
graph is 4-critical, with more than n?/16 edges ([To70}). 


Uniquely Colorable Graphs 


DEFINITION 


D21: A graph G = (V,E) is uniquely k-colorable if it admits just one proper 
k-coloring with k = x(G) colors (apart from the renumbering of colors). 


The uniquely 1-colorable graphs are the edgeless graphs, and therefore we assume below 
that k > 2. 


EXAMPLES 


E7: The uniquely 2-colorable graphs are the connected bipartite graphs. 
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E8: If G— v is a uniquely k-colorable graph, and v a vertex of degree k — 1 whose 
neighbors are in mutually distinct color classes of G—v, then also the graph G is uniquely 
k-colorable. In particular, uniquely 4-colorable planar graphs are obtained from a planar 
embedding of K4 by sequentially inserting new degree-3 vertices inside triangular faces 
(where the neighbors of each new vertex are the vertices of the corresponding face). 


FACTS 
F40: [ChGe69] Every uniquely k-colorable graph is (k — 1)-connected. 


F41: [Bo78] If the graph G is k-colorable on n vertices, with minimum degree larger 
than 3 n, then G is uniquely k-colorable. This lower bound is best possible for 
infinitely many n. 


F42: [Bo78] Let the graph G be k-colorable on n vertices, with a k-coloring where the 


union of any two color classes is connected. If the minimum degree is larger than f—2 n, 


then G is uniquely k-colorable. This lower bound is best possible for infinitely many n. 


5.1.4 Girth and Clique Number 


The results below show that the exclusion of short cycles does not make the chromatic 
number bounded; but on graphs without large complete subgraphs, the general upper 
bounds in terms of vertex degrees can be improved. 


FACT 


F43: [Er59] For a suitably chosen constant c, and for every k > 1 and g > 2 there 
exists a non-k-colorable graph of girth g on at most ck?9 vertices. Moreover, for k > 4, 


the maximum girth of k-chromatic graphs on n vertices grows with O( ez) as nN — oo. 


REMARK 


R2: The proof in [Er59] is probabilistic. Constructions usually are of much larger size 
than guaranteed by Erdés’s theorem. For g small, some examples are listed below; for 
unrestricted g, see, e.g., [L068, NeR679]. A fairly small general construction, involving 
Ramanujan graphs on O(k*9) vertices, can be found in [LuPhSa8g]. 


EXAMPLES 


E9: ([Zy49] If G, is a k-chromatic triangle-free graph, let Gy41 consist of k vertex- 
disjoint copies Gi, of G; (i = 1,...,k), together with a new independent set X of size 
|V(G;)|*. To each k-tuple (v1,...,vzr) € V(G1) x ++» x V(Gx), join a distinct common 
neighbor in X. This Gy+1 is (k + 1)-chromatic and triangle-free. 


E10: [De48] If G;, is a k-chromatic graph of girth at least 6, take an independent set X 
of cardinality s = k|V(G;,)|—k+1, together with m = (vee) vertex-disjoint copies Gi, 
of G;, (i =1,...,m). For each of the m distinct |V(G;)|-tuples Y; C X, draw a perfect 
matching between Y; and V(Gi.). The resulting graph G,41 is a (k+1)-chromatic graph 
of girth 6. 
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E11: [My55] For each vertex vu of a k-chromatic triangle-free graph G;, take a distinct 
new vertex vu’ adjacent to all neighbors of v, and join a new vertex w to all of these v’. 
The graph obtained is (k + 1)-chromatic and triangle-free. 


E12: (Kneser graphs) [Lo78, Ba78] The vertices of the Kneser graph K(n,k) are the 
k-element subsets of {1,...,n} (n/2 > k > 1), and two vertices are adjacent if and only 
if the corresponding two k-sets are disjoint. Then y(K(n,k)) =n—2k+2, and K(n,k) 
is triangle-free if n < 3k. Moreover, a vertex-critical subgraph of K(n, k) is induced by 
the vertices corresponding to the k-tuples that have no pair of consecutive elements in 
the cyclic order {1,2,...,n} ([Sc78]). 


CONJECTURE 


C3: [Re97| For every graph G, 


FACTS 


F44: [Re97] For every m there is a number A,, such that, if A(G) > A,, and w(G) < 
A(G) + 1 — 2m, then x(G) < A(G) +1—m. Moreover, there is a constant € > 0 such 
that if A(G) > 3, then x(G) < (1 — «)(A(G) + 1) + €w(G). 


F45: Conjecture C3 is true for line graphs of multigraphs [KiReVe07] and for several 
other graph classes [RaSc07], [KiRe08], [KoSc10]. 


F46: Combining Fact F3 with the known estimates [AjKoSz80] on the Ramsey numbers 
R(s,t), it follows that if w(G) < t, then for some constant c = c(t), we have 


fi 1-1/t 
< 
x(G) Se (=) 


F47: [BokKo77, Ca78, La78] Let t > 3. If w(G) <t < A(G), then 


t 


ee 
x(G@) Sq 


(A(G) + 2) 


F48: [Er67] The maximum ratio in the set 


{Srey | Nel=n} 


grows with @(n/ log? n) as n > oo. 


F49: [Jo96a] For every r € N there exists a constant c, such that if the graph G is 
K,-free, then 
cy A(G) log log A(G) 

log A(G) 


ch(G) < 


F50: [Jo96] If the graph G is triangle-free, then ch(G) < cA(G)/log A(G), for some 
constant c independent of A(G). 
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F51: [Ki95] If the graph G has girth at least 5, then 


(1 + EA(G)) A(G) 
log A(G) 


ch(G) < 


where ca ~>0as A> ow. 


F52: [NoReWul2] If |V(G)| < 2x(G) 4+ 1, then ch(G) = x(G). (This is equivalent 
to ch = w on complete multipartite graphs with n < 2w +1; formerly called Ohba’s 
conjecture.) 


F53: [Tu97a] Every induced subgraph H of the square C? of a cycle (n > 3) satisfies 
ch(H) = w(#). 


F54: [KoMa77] For every integer A > 3, there exists a triangle-free graph G with 
maximum degree A and x(G) > Tea 

F55: [Br02] Let G = (V, E) be a maximal triangle-free d-regular graph, with d > |V|/3. 
Then x(G) < 4; moreover, if G has a nontrivial automorphism, then (G) < 3. 


F56: [Ko78] For every A there is a g = g(A) such that y(G) < $A + 2 whenever G 
has girth at least g and maximum degree at most A. 


F57: [MiSc04] If the cycles in G have at most p distinct even lengths and at most q 
odd lengths, then y(G) < min{2p + 3,2q + 2}. The bound is tight for all p and gq. 


F58: [RaSc01] If every induced cycle of G has length 4 or 5, then x(G) < 3. (This 
generalizes the fact [Su81] that every triangle-free graph without induced Ps and Cg is 
3-colorable.) Also, all pairs F, Fy of graphs with the property that every graph without 
induced F; and Fy is 3-colorable can be characterized ([Ra04]). 


CONJECTURE 

C4: (Erdés—Faber—Lovasz Conjecture) The union of any n edge-disjoint copies of K,, 
has chromatic number n. 

FACT 


F59: [Ka92] If G is the edge-disjoint union of n complete graphs of n vertices, then 
x(G) = n+ o(n) and also ch(G) = n+ o(n) as n > oo. 


The Conjectures of Hadwiger and Hajés 


DEFINITIONS 


D22: A contraction of an edge e = wv in a graph G is obtained by removing the 
edge e and identifying the vertices u and v, i.e., replacing u and v by a single vertex w 
adjacent to those vertices that were adjacent to u or v. A graph G is contractible to 
a graph H if H can be obtained from G by a sequence of edge contractions. 
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D23: A graph H is a minor of a graph G if a subgraph of G is contractible to H. An 
H-minor is a minor isomorphic to H. 


D24: A subdivision of an edge ce = wv in a graph FH is obtained by replacing e with 
a new vertex w and two new edges uw and wv. A graph G is a subdivision of a graph 
HA if G can be obtained from H by a sequence of edge subdivisions. 


D25: A graph G is said to be an odd K,-expansion if G consists of p vertex-disjoint 
trees, every two of which are joined by an edge, and the vertices of the trees can be 
two-colored so that the edges of the trees are bichromatic but that the edges between 
trees are monochromatic. 


CONJECTURES 
C5: (Hadwiger’s Conjecture) [Ha43] Every k-chromatic graph contains a K,,-minor. 


C6: (Hajés’s Conjecture) For k < 6, every k-chromatic graph contains a subdivision 
of the complete graph Kx. 


C7: (Gerards and Seymour — “Odd Hadwiger Conjecture”) If G contains no odd 
Ky41-expansion, then x(G) < k. 


FACTS 
F60: [Di52] Both Hadwiger’s and Hajdés’s conjectures are true for k = 4. 


F61: [Ca79] If x(G) = 4, then G contains an odd-triangle subdivision of K4, where 
each of the four cycles corresponding to the triangles of A’4 has odd length. Moreover, 
there is a polynomial-time algorithm that either properly colors an input graph G with 
3 colors or outputs an odd-triangle subdivision of K4 in G ([Za98]). (It is not known 
whether G also contains a subdivision of K4 where each of the six paths obtained from 
the edges of K4 have odd length.) 


F62: [Ca79] For every k > 7 there exist graphs that are k-chromatic but do not contain 
any subdivision of Kx. 


F63: [Wa37] The 4-colorability of all planar graphs implies the 4-colorability of all 
graphs not contractible to K5. Hence, for k = 5, Hadwiger’s conjecture (stated later 
than Wagner’s theorem) is equivalent to the Four Color Theorem on planar graphs (cf. 
5.2.2). 


F64: [RoSeTh93] For k = 6, the validity of Hadwiger’s conjecture can be deduced 
from the Four Color Theorem. For k = 7, as a partial result toward the conjecture, it is 
known that every 7-chromatic graph contains a K7-minor or a K44-minor ([KaTo05]). 
For general k, estimates in terms of connectivity and also of independence number can 
be found (e.g., see [KaSo07]). 


F65: [BoCaEr80, Ko82] For the random graph on n vertices, Hadwiger’s conjecture is 
valid with probability 1 — o(1) as n > ow. 

F66: [Ko84] Let h(G) denote the largest number of vertices in a complete graph to 
which G can be contracted. Then the inequality h(G) + h(G) < |6n/5| holds for every 
graph G on n > 5 vertices, and the upper bound is tight. 
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F67: There exists a constant c such that every graph G without a K,-minor has 
col(G) < cklogk, hence G is O(k log k)-colorable and also O(k log k)-choosable as k > 
oo ([Ko84a], [Th84]). As a lower bound, for every k there exists a K,-minor-free graph 
which is not |4(& — 2)/3|-choosable ([BaJoWo11]}). 


F68: [GeGeReSeVe09] There is a constant c such that, for every k > 1, if a graph G 
contains no odd K;-expansion, then y(G) < ck/logk. (See §5.2.1 for related results.) 


5.1.5 Edge-Coloring and x-Binding Functions 


TERMINOLOGY NOTE: In this subsection we shall explicitly use the term multigraph 
in those cases where multiple edges are allowed. 


DEFINITIONS 


D26: A proper edge-coloring of a graph or multigraph G is an assignment of colors 
to the edges of G, such that all edges incident with the same vertex get distinct colors. 


D27: The chromatic index of a graph or multigraph G, denoted y‘(G), is the smallest 
number of colors in a proper edge-coloring of G. 


D28: Definitions D26 and D27 for edge-colorings can be generalized to list edge- 
colorings in the natural way, as for vertex-colorings. The edge choice number (or list 
chromatic index or list edge chromatic number) is the minimum list-size that 
guarantees a list edge-coloring of G; it is denoted by ch’(G) (or by x/(G)). 


D29: A proper total coloring of G = (V,£) is an assignment y of colors to the 
vertices and the edges of G, such that y induces a proper coloring on both V and E, 
and such that y(e) 4 y(v) whenever e € E£ is incident with vu € V. 


D30: The smallest number of colors in a proper total coloring of G is denoted by 
y’"(G); and the analogous quantity for the smallest size of lists is denoted by ch’ (G) 
(or by x7 (G)). 


D31: Let G = (V,F) be a graph or multigraph. 

e The line graph L(G) of G has as its vertices the edges of G, two of them being 
adjacent in L(G) if they share a vertex in G. 

e The total graph T(G) has V U E as its vertex set, and its subgraphs induced by 
V and E are isomorphic to G and L(G), respectively; moreover, v € V is adjacent 
to e € E in T(G) if v is an endpoint of e in G. 

e The square G? has the same vertex set as G; two vertices are adjacent in G? if 
they are at distance 1 or 2 apart in G. 


D32: A y-binding function (also called y-bounding in a part of the literature) on 
a class G of graphs is a function f : NN such that y(G) < f(w(G)) for all GEG. 
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REMARKS 
R3: Clearly, 
x(G) =x(L(G)) and ch'(G) = ch(L(G)) 


and similarly, 


x"(G@) =x(L(G)) and ch" (G) = ch(T(G)) 


In general, however, much stronger results are valid for y’ and ch’ than for y and ch, 
due to the restricted structure of line graphs. (The one-to-one correspondence between 
edge colorings of G and vertex colorings of L(G) is valid for almost all kinds of coloring; 


one exception is known to have opposite bijection, namely, 3-consecutive coloring; see 
85.2 for details.) 


R4: The cliques in the line graph L(G) (and in the total graph T(G)) correspond to 
the stars (with their centers) and to the — possibly multiple-edged — triangles in the 
(multi)graph G. Thus, 


A(G) < w(L(G)) < 5A(G) 
Moreover, ch’(G) > y'(G) > A(G). 


R5: The total graph T(G) of G = (V, E) is isomorphic to the square of the bipartite 
graph B = B(G) whose bipartition classes are V and E, where v € V and e € EF are 
adjacent in B if v is an endpoint of e in G. 


FACTS 
F69: (Vizing’s Theorem) [Vi64] If G is a simple graph, then 
MG) SAG) 1 
If a multigraph G has maximum edge multiplicity u(G), then 
x'(G) < A(G) + u(G) 


For unrestricted edge multiplicity the upper bound y/(G) < 3A(G) follows (first proved 
in [Sh49]). Thus, f(w) = w+ 1 is a y-binding function on the class of line graphs of 
simple graphs, and f(w) = 3w/2 is one on the class of all line graphs. 


F70: [KiSc83] If the graph G contains no induced A, 3 (claw) and no induced K; — e, 
then y(G) < w(G) +1. 


F71: (K6nig’s Theorem) [K616] If G is a bipartite multigraph, then y/(G) = A(G). 


F72: [MoRe98] There exists a constant C such that x(G) < A(G)+C holds for every 
graph G. 


F73: [Ga95] If G is a bipartite multigraph, then ch’(G) = A(G). 


F74: [ElGo96] Suppose that G is a d-regular multigraph with x’(G) = d. If G has an 
odd number of proper edge d-colorings, or if G is planar, then ch’(G) = d. 


F75: For every multigraph G, ch’’(G) < ch'(G) +2. 
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F76: [Ka96] For every graph G of maximum degree A, ch'(G) < (1 + 0(1))A, where 
o(1) > 0 as A> ~w. 


CONJECTURES 


C8: Total Coloring Conjecture (Vizing [Vi64]; Behzad) For every multigraph G 
with maximum edge multiplicity (G), 


x"(G) SA(G) + W(G) +1 
In particular, if G is a simple graph, then /’(G) < A(G) +2. 


C9: Overfull Subgraph Conjecture (A. J. W. Hilton) If y/(G) = A(G) +1 and 
A(G) > |V(G)|/3, then there is an “overfull” subgraph H in G, such that 
1 
eal > ace | 5Ivan| 


C10: (Goldberg; Seymour) Every multigraph G with x/(G) > A(G) + 2 has 


|E(H)| 


x'(@) = HCG, |V(H)|>2 faa 


C11: List Coloring Conjecture (Vizing; Gupta; Albertson and Collins; Bollobds 
and Harris) For every multigraph G, we have ch'(G) = x'(G). 


C12: (Gravier and Maffray) More generally than Conjecture 11, for every claw-free 
graph G we have ch(G) = x(G). 


C13: (Borodin, Kostochka and Woodall; Juvan, Mohar and Skrekovski; Hilton and 
Johnson) For every graph G we have ch" (G) = x'(G). 


C14: (Kostochka and Woodall) More generally than Conjecture 13, ch(G?) = y(G?) 
for the square of every graph G. 


Snarks 


DEFINITIONS 


D33: A graph is cyclically k-connected if at least k edges must be deleted in order to 
leave two components, each containing a cycle. 


D34: A snark (often called a “nontrivial snark”) is a 3-regular, cyclically 4-edge- 
connected graph of girth at least 5, that is not edge-3-colorable. 


D35: A snark G is irreducible if every nontrivial edge-cut (i.e., leaving at least two 
vertices in each component) separates G into 3-edge-colorable components. 
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REMARKS 


R6: According to the two possible cases provided by Vizing’s theorem, a commonly 
used terminology says that a graph G is of Class 1 or Class 2 if its chromatic index is 
equal to A(G) or A(G) +1, respectively. Hence, snarks represent the smallest nontrivial 
subfamily of Class 2. There has been much effort to construct snarks with various 
specific properties. Several methods of construction with further references can be 
found, e.g., in [Ko02] and in the earlier paper [CaMeRuSp98]. 


R7: The idea of an irreducible snark is that it cannot be “reduced” to a smaller snark 
by separating from G a proper subgraph H which is itself not 3-edge-colorable and by 
restoring 3-regularity. 


FACTS 


F77: [NeSk96] A cubic graph G with chromatic index 4 is an irreducible snark if and 
only if G is bicritical, i.e., G—{u, v} is 3-edge-colorable for every pair of distinct vertices 
u,v € V(G). 


F78: [ChSk10] Every irreducible snark G can be factorized into a collection {H,,..., H,} 
of cyclically 5-edge-connected irreducible snarks such that G can be reconstructed from 
them by repeated dot products. Moreover, such a collection is unique up to isomorphism 
and ordering of the factors. 


F79: [Ko09] For each surface of genus at least 5 there exists a snark that admits a 
polyhedral embedding on the surface. (This disproves Griinbaum’s conjecture for every 
orientable surface of genus at least 5; see §7.6.) 


F80: [Go85] If the Cycle Double Cover Conjecture (see §7.6.1) is false, then a smallest 
counterexample is a snark of girth at least 8. 


F81: [Ce84] If the 5-flow conjecture (see §5.2.2) is false, then a smallest counterexample 
is a cyclically 5-edge-connected snark of girth at least 7. 


F82: [Ko96] For every g > 5 there exists a cyclically 5-edge-connected snark of girth 
at least g. 


Uniquely Edge-Colorable Graphs 


DEFINITION 


D36: A graph G = (V, E) is uniquely edge k-colorable if it admits just one proper 
edge-coloring with k = x'(G) colors (apart from the renumbering of colors). 
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EXAMPLES 


E13: The uniquely edge 1-colorable graphs are the matchings. The uniquely edge 
2-colorable graphs are the paths and the even cycles. The star graph Ky 3 and the 
complete graph Ky4 are uniquely 3-edge-colorable. (No complete characterization is 
available for k = 3.) 


E14: [Tu76] The graph consisting of two 9-cycles ajaz...a9 and bib2...bg9 and the 
further edges a;ba; (1 < i < 9, subscript addition modulo 9) is uniquely 3-edge-colorable. 
This is the only known triangle-free non-planar example for k = 3. 


FACTS 


F83: [Th78] The star graph K,,, is the only uniquely edge k-colorable graph, for each 
k>4. 


F84: If G is 3-regular and uniquely 3-colorable, then the following operation called 
“Y—A replacement” yields again a uniquely 3-colorable 3-regular graph. Let wy, u2, us 
be the neighbors of vertex v. Remove v and insert the new vertices v1, v2,v3 and the 
new edges v1 V2, V1U3, V2v3 and u,;v; for i = 1, 2,3. 


Further x-Bound Graph Classes 


CONJECTURES 


C15: (Gydarfds [Gy75]; Sumner [Su81]) For every tree T, there exists a x-binding 
function on the class of graphs not containing T' as an induced subgraph. 


C16: (Gyarfds [Gy87]) There exists a y-binding function for the class of graphs in 
which every induced cycle has length 3 or 4. 


C17: Divisibility conjecture (McDiarmid and Hoang) If a graph contains no in- 
duced odd cycle of length at least 5 (i.e., an odd hole), then it can be 2-colored so that 
no maximum clique is monochromatic. (Equivalently, every induced subgraph of G has 
such a 2-coloring precisely when the graph contains no odd hole longer than 3.) 


FACTS 


F85: [KiPe94] A y-binding function exists on the class of graphs not containing T, if 
the forbidden induced tree T has radius 2. 


F86: [Sc97] For every tree T, there exists a y-binding function on the class of graphs 
not containing any subdivision of T as an induced subgraph. 


F87: The class of intersection graphs of axis-aligned rectangles in the plane is y-bound 
([AsGr60]), while that of the triangle-free intersection graphs of axis-aligned boxes in 
R® isn’t ({[Bu65]). 
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F88: [PakoKrLaMiTrWal2] The class of intersection graphs of line segments in the 
plane is not y-bound; it contains triangle-free graphs with any large chromatic number. 
(This fact also disproves Scott’s conjecture for a large class of non-planar graphs.) On 
the other hand, the class of complements of those graphs is y-bound ([PaT694]). 


F89: [BoTh12] If H is a triangle-free graph with diameter 2, then the class of graphs 
not containing any induced subdivision of H is y-bound. (Hence Scott’s conjecture is 
true for maximal triangle-free graphs.) 


F90: [RaSc01] There exists no linear y-binding function for the class of graphs in which 
every induced cycle has length 3 or 4. 


REMARKS 


R8: The validity of Conjecture C17 would imply by induction that the corresponding 
class of graphs has y-binding function 2”~+. While perfect graphs need to exclude both 
the odd holes and the odd antiholes (i.e., induced subgraphs that are the complements 
of odd holes), to make a class y-bound it would suffice to exclude odd holes. The 
exclusion of odd antiholes, however, is not sufficient, as shown by the graphs of girth 6; 
they can have any large chromatic number. 


R9: For more details on y-bound classes of graphs, see [Gy87] and [RaSc04]. 


5.1.6 Coloring and Orientation 
Paths and Cycles 


FACTS 


F91: [Ga68, Ro67] A graph has y(G) < k if and only if G admits an orientation 
without directed paths on more than k vertices. 


F92: [Mi62] (Minty’s Condition) A graph G has chromatic number ,(G) < k if and 
only if G admits an orientation such that every cycle C C G has at least |C|/k arcs 
oriented in each of the two directions around C. 


F93: [Tu92] It suffices to assume Minty’s Condition above just for the cycles of length 
|C| = 1 (mod k). (This implies the Gallai-Roy Theorem (F91), too.) Moveover, if such 
an orientation is given, then a proper k-coloring of G can be found in polynomial time. 
In particular, if the undirected graph G contains no cycles of length 1 modulo k, then 
G can be properly k-colored in linear time. 


F94: [Bo76] In every strongly connected orientation of a k-chromatic graph (k > 2), 
there is a directed cycle of length at least k. 
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Eulerian Subgraphs 


DEFINITION 


D37: An Eulerian spanning subgraph of an oriented graph G= (V, E) is asubgraph 
FH = (V,F) of G with the same vertex set V, and with di,(v) = dy(v) for all v € V. 
(Some or all vertices of H may be isolated; hence, F = @ shows that every G has at 
least one such subgraph.) 


FACT 


F95: [AlTa92] If the number of Eulerian spanning subgraphs of an oriented graph re 
with an even number of edges differs from the number of those with an odd number of 
edges, then G is L-colorable whenever the list assignment L satisfies |L,| > d+(v) for 
all v € V. (The spanning subgraph condition holds in every bipartite graph; but this 
class can be handled in a simpler way using kernels; see below.) 


Choosability and Orientations with Kernels 


Some background and references for the following results can be found in [Tu97]. 
FACTS 


F96: Suppose that in the oriented graph G= (V, EB), every induced subgraph A 
contains an independent set Y C V such that from each vertex v ¢ Y of FT there is at 
least one arc to Y. If L is a list assignment with |L,| > d*(v) for all v € V, then G 
admits a list coloring. 


F97: The condition above holds in every bipartite directed graph. A more general 
class where the required subsets Y (termed kernels) exist is the class of so-called 
kernel-perfect graphs (see §2.5). In those graphs, it suffices to consider orientations 
without directed 3-cycles. 


F98: If every induced subgraph of a graph G has average degree at most 2k, then G 
has an orientation with maximum out-degree at most /, and such an orientation can be 
found in polynomial time (in several ways, via the Kénig—Hall theorem or by sequential 
improvements of suboptimal solutions). In particular, if G is bipartite and satisfies the 
average-degree condition, then ch(G) < k+1 and a list coloring can be found efficiently 
for any given (k + 1)-assignment. 


More results can be derived by applying this machinery on edge-colorings and planar 
graphs (see §5.1.5 and §5.2.2). 


Acyclic Orientations 
DEFINITION 


D38: (Cf. [MaTo84] and [Ai88, p. 323]) The acyclic orientation game starts with 
an undirected graph. In each round, Player A (‘Algy’) selects a non-oriented edge e of 
G, and Player S (‘Strategist’) orients that edge in one direction, under the condition 
that no directed cycles may occur. The game is over when the graph G admits just one 
acyclic orientation that extends the partial orientation obtained so far. The goal of A is 
to finish the game in as few rounds as possible, while S aims at making the game long. 
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NOTATION: We denote by c(G) the number of rounds when both A and S play optimally 
on graph G. 


FACTS 


F99: [AiTrTu95] If G = (V,£) is a graph on n > 6 vertices with c(G) = |E|, then 
|E| < in, and for n > 7 equality holds if and only if G is the complete bipartite graph 
K\n/2),[n/21- 


F100: [Pil0] There exists a constant c > 0 such that c(G) > em 282 holds for all 
graphs with n vertices and m edges. Moreover, as n — oo, a general asymptotic upper 
bound is c(G) < $n? + 0(n?) for all graphs on n vertices. 


F101: [AiTrTu95] For every g, there exists a graph G = (V,£) with girth at least g 
and c(G) < |E|. 


F102: [AlTu95] For the random graph G,,,, with n vertices and edge probability p, 
c(Gnp) = O(n log n) with probability 1 — 0(1) as n > oo whenever p > 0 is fixed. For 


unrestricted p, a general upper bound is c(Gn») = O(nlog® n). 


F103: [Pil0] It is APX-hard to determine the value c(G); it cannot be approximated 
within 74/73 —e for any ¢ > 0. (Perhaps it is PSPACE-complete to determine c(G).) 


5.1.7 Colorings of Infinite Graphs 
FACTS 


F104: [BrEr51] For any k € N, an infinite graph G has y(G) < & if and only if every 
finite subgraph of G' is k-colorable. 


F105: [Jo94] For any k € N, an infinite graph G has ch(G) < k if and only if every 
finite subgraph of G is k-choosable. 


F106: [Bo77] If x(G) = o, then for every infinite arithmetic progression A C N, G 
contains a cycle whose length belongs to A. 


F107: [Gako91] That .(G) is well-defined for every graph G is equivalent to the set- 
theoretic Well-Ordering Theorem, and assuming that every set has a cardinality, it is 
equivalent to the Axiom of Choice. 


F108: [Ko88] The following assertion is consistent: There exists a graph G = (V, E) 
such that |V| = Nui, x(G) = Ni, and x(H) < No whenever |V(H)| < Xu. 
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Coloring Euclidean Spaces and Distance Graphs 


DEFINITIONS 


D39: The unit distance graph U” has the points of R” as its vertices; the edges are 
the pairs of points whose Euclidean distance is 1. 


D40: Given a (finite or infinite) “distance set” D = {d,,d2,...} C N, the distance 
graph G(D) has vertex set Z; two vertices i, 7 € Z are adjacent if and only if |:—j| € D. 


D41: A packing coloring of graph G = (V, E) is a vertex partition Vi U--- UV, =V 
such that, for all 1 <7 < & and for any two u,v € V; (u # v), d(u,v) > @ holds for 
their distance. The packing chromatic number (also called broadcast chromatic 
number), denoted by y,(G), is the smallest k for which G admits a packing coloring. 
FACTS 

F109: [Ha45] In the plane, y(/?) < 7. 

F110: [MoMo61] By a 4-chromatic subgraph on seven vertices, y(U*) > 4. 

F111: [Co02] In 3-dimensional space, y(U3) < 15. 

F112: A 5-chromatic subgraph on 9 vertices yields y(U3) > 5. 


F113: [LaRo72] Asn —> co, x(U”) is at most (3 + o(1))”. 


F114: [FrWi81] As n > oo, y(U") is at least (1 + 0(1)) (8)". 


F115: [Sc95] The choice number ch(U”) is countably infinite if and only if n = 2 or 
n = 3. (Infinity was first observed in [JeTo95].) 


F116: For every finite distance set D, y(G(D)) < |D| +1. 


F117: For every D with |D| = 3, x(G(D)) has been determined ([Zh02]). For k > 3, 
however, the chromatic number for k-element distance sets has not been characterized 
so far. 


F118: [EgErSk85] If D is the set of all prime numbers, then y(G(D)) = 4. Moreover, 
x(G(D \ {3})) = 3 and x(G(D \ {2})) = 2. 


F119: [Ka01, RuTuVo02, PeSc10] Let ¢1 > e2 > --- > 0 be a sequence of positive reals, 
and let D be the family of distance sets D = {d1,d2,...} such that dj41/d; > ¢; for all 
i> 1. If limp406; > 0, then x(G(D)) is finite for all those D; and if limj,~ ¢; = 0, 
then there exists a D € D such that y(G(D)) =o. 


F120: [AlKol1] For any finite set X C R’, and for any k € N, there exists a list 
k-assignment to the points of R? such that any coloring of the plane from those lists 
contains a monochromatic isometric copy of X. 


F121: The packing chromatic number of the square lattice (infinite grid) G(Z?) is 
at least 12 and at most 17 (both bounds attained by using computer programs, the 
latter published in [SoHo10]); but the triangular lattice and also G(Z”)OK2 — i.e., two 
layers of the square lattice — have infinite packing chromatic number ({FiRal0] and 
[FiK1Li09], respectively). The packing chromatic number of the hexagonal lattice is 
equal to 7, but its Cartesian product with Pg has yp = oo. 
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INTRODUCTION 


In this section we consider variants of graph coloring, and also the algorithmic com- 
plexity of the problems. Some of the concepts here (e.g., face coloring of planar graphs) 
may be viewed as equivalents of proper vertex-coloring on restricted classes of graphs, a 
perspective for which there is an extensive literature. Some other topics here are gener- 
alizations in various directions. We also include a new area in the theory of hypergraph 
coloring, which provides a general framework for many kinds of coloring problems. 

From the many interesting variants of coloring, several ones are discussed in detail 
in [ChZh09], while [Ku04] is a collection of shorter surveys. 

The current section applies several concepts introduced in $5.1. Some familiarity 
with §5.1 is assumed. (Please see the chapter glossary as needed.) 


5.2.1 Multicoloring and Fractional Coloring 
DEFINITIONS 


D1: A fractional vertex-coloring of G is a real function y* : S > R=° on the 
collection S of all independent vertex sets in G = (V, £) such that for all v Ee V 


pS) = 4 
{SES|vEeS} 


This definition extends naturally to other types of coloring (e.g., fractional edge-color- 
ing), leading in this way to the fractional versions of further graph invariants. 
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D2: The fractional chromatic number of G is 


x"(G) = min) 9°(S) 
SES 


where the minimum is taken over all fractional vertex-colorings y* of G. (The fractional 
chromatic index y'*(G) is defined analogously.) Equivalently, the fractional chromatic 
number of a graph G = (V, £) is definable as the minimum ratio p/q such that there 
exists a cover of V by p independent sets $1, S2,...,.5, (not necessarily distinct), with 
each v € V contained in precisely q of them. 


D3: For two functions f and g from V to N, with g(v) < f(v) for all v € V, the graph 
G = (V, E) is (f, g)-choosable if for every list assignment L with |L,| = f(v) there can 
be chosen subsets Cy C L, such that |C,| = g(v) for all v € V and CO, NC, = O for 
all wv € E. The functions f and/or g may be constant; e.g., the terminology “(a, b)- 
choosable” means that f(v) = a and g(v) = b for all v € V. If g(v) = 1 for every vertex 
v, we simply say that G is f-choosable. 


EXAMPLE 
El: For every t € N, the odd cycle Co:41 is (2t + 1, t)-choosable ([AlTuVo97]) and its 
fractional chromatic number is 2 + +. 
FACTS 
F1: For every graph G, we have 
w(G) <x"(G) < x(G) 
In particular, if G is a perfect graph, then y*(G) = x(G). 
F2: For every graph G = (V, E), we have ,*(G) > |V|/a(G). 


F3: For every graph G on n vertices, the value x*(G) = p/q is attained for some 
q < n"/? ((ChGaJo78]), but there exists a constant C (C > 1.34619) and infinitely 
many graphs G for which g > C” is necessary ([Fi95]). 


F4: [ReSe98] For every natural number t, if y*(G) > 2t, then some subgraph of G is 
contractible to Ky41. (Cf. Hadwiger’s conjecture in §5.1.4.) 


F5: [Lo75] For every graph G, 


x(G) < (1 + log a(G)) max 
As a consequence, y/x* is bounded above by 1 + logn (where n is the number of 


vertices). 


F6: (Zh1la] For any two graphs G and H, we have y*(G x H) = min{y*(G), x*(H)}. 
(Cf. Hedetniemi’s conjecture in §5.1.3.) 


F7: [KiRe93] For fractional colorings of simple graphs, the Total Coloring Conjecture 
(cf. §5.1.5) holds true, i.e., x/*(G) < A(G) + 2 for every graph G. 
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F8: [Ka00] The edge choice number asymptotically equals the fractional chromatic 
index, i.e., for every « > O there exists a k = k(c) such that y’*(G) > k implies 
ch'(G) < (1+ €)x’*(G), for every multigraph G. 


F9: [AlTuVo97] For every graph G, the minimum value of a/b such that G is (a, b)- 
choosable is equal to y*(G). The theorem can be generalized for list colorings of hyper- 
graphs, too ([MiTuVo99}). 


F10: (Brooks’s Theorem for unequal lists) [ErRuTa79] Suppose that G is connected, 
and that at least one of its blocks is neither a complete graph nor an odd cycle. If f(v) 
is the degree of v for every vertex v, then G is f-choosable. 


F11: For multicoloring: [TuVo96] Under the same conditions as Fact F10, G is also 
(mf,m)-choosable for all m € N. 


F12: The List Reduction method (see [TuVo97] in §5.1.1) can be applied for (km, m)- 
choosability, too. 


F13: [TuVo96a] Every 2-choosable graph is (2m,m)-choosable, for every m € N. 


OPEN PROBLEMS 

Pi: [ErRuTa79] Is every (a, b)-choosable graph (am, bm)-choosable for all m € N? 
P2: [ErRuTa79] Given any pair of graphs G and H on the same set of vertices, is 
ch(GU H) < ch(G)ch(H)? 


REMARK 


R1: An affirmative answer to Problem 1 would imply an affirmative answer on Prob- 
lem 2 also. Various particular cases of the former are proved in [TuVo96]. 


5.2.2 Graphs on Surfaces 


Historically, for more than a half century, the theory of graph coloring dealt with 
face colorings of maps, which can equivalently be interpreted as vertex-colorings of their 
dual graphs. Via duality, there is a natural correspondence between total colorings of 
graphs and simultaneously coloring the edges and faces of maps on surfaces. Adjacency 
of faces of a map means sharing an edge on their boundary, and incidence of a vertex 
or edge with a face means belonging to its boundary walk. 


DEFINITIONS 
D4: A plane graph is a planar graph together with a given imbedding in the plane. 


D5: A triangulation (in the plane or on a higher surface) is a graph imbedding in a 
surface such that all the face boundaries are cycles of length 3. 


D6: A graph is outerplanar if it has an imbedding in the plane such that all vertices 
lie on the boundary walk of the exterior face. 
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FACTS 
F14: (Five Color Theorem) [He:1890] Every planar graph is 5-colorable. 
F15: (Four Color Theorem) [ApHa77, ApHaKo77] Every planar graph is 4-colorable. 


F16:  [Ta:1880] A plane triangulation G is 4-colorable if and only if its dual G* is 
3-edge-colorable. 


F17: [Bo79] Every planar graph has a proper 5-coloring such that the union of any 
two color classes induces a forest (termed acyclic coloring). 


F18: (Grétzsch’s Theorem) [Gr59] Every K3-free planar graph is 3-colorable. 


F19: [He:1898] A planar triangulation is 3-colorable if and only if all of its vertices 
have even degrees. 


F20: Every planar graph is 5-choosable ([Th94]), but there exist non-4-choosable 
planar graphs ([Vo93]). 


F21: [KrTu94] Every K3-free planar graph is 4-choosable. 


F22: Every planar graph of girth at least five is 3-choosable ([Th95]), but there exist 
non-3-choosable /‘3-free planar graphs ([Vo95]). 


F23: [AlTa92] All bipartite planar graphs are 3-choosable. 


F24: [ChGeHe71] Every planar graph has a vertex partition into two classes where 
each class induces an outerplanar graph. 


F25: [He93] The edge set of any planar graph can be partitioned into two outerplanar 
graphs. 
REMARKS 


R2: A simpler proof to the Four Color Theorem is given in [RoSaSeTh97], but so far 
no proof without the extensive use of a computer is known. 


R3: A very short proof of Grotzsch’s Theorem can be derived from the lower bound 
Sn=2 on the number of edges in a 4-critical graph of order n ([KoYa12]; see Fact F31 in 


§5.1.3). 


Heawood Number and the Empire Problem 


DEFINITION 
D7: The Heawood number of a closed surface S of Euler characteristic € is 
7+ /49 — 24e 
H(e) = <5 


More generally, for every natural number m we write 


Hlem) = ett J(6m +1)? 2) 
ie 2 
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FACTS 


F26: [He:1890] If a surface S has Euler characteristic € < 2, then the connected regions 
of any map drawn on S — or equivalently, any graph imbedded in S — can be properly 
colored with at most H(e) colors. 


F27: [Fr34] Every graph drawn on the Klein bottle is 6-colorable. 


F28: [RiYo68] On any other surface S except the Klein bottle, the maximum chromatic 
number of graphs imbeddable in S' is equal to H(e). 


F29: [Di52] (—1 #4 « £1), [AlHu79] (e = —1 or 1) If S is a surface of Euler characteristic 
€ < 2, other than the Klein bottle, then every H(e)-chromatic graph imbedded in S$ 
contains K’7;(-) as a subgraph. 


F30: [Bo6MoSt99] With the possible exception of « = —1, every graph G imbedded in 
a surface S of Euler characteristic € with ch(G) = H(e) contains the complete graph 
Kae) ; 


F31: [AlMoSa96] As € > oo, every graph imbedded in a surface with Euler characteris- 
tic € admits an acyclic coloring with at most O(e4/”) colors (and at least Q(€4/7/ log” e) 
colors are necessary for some graphs). 


F32: [{He:1890] If each country on a surface of Euler characteristic € consists of at most 
m connected regions (m > 2), then the countries can be colored with at most H(e,m) 
colors so that any two neighbor countries are colored differently. (For planar maps this 
means 6m colors.) 


F33: [JaRi84] For every m > 2, there exist planar maps with countries of m regions 
each, where 6m colors are necessary. 


F34: H(e,m) colors are necessary for every m > 2 on the torus (H. Taylor in [Ga80]), 
on the projective plane ({JaRi83]) and on the Klein bottle ([JaRi85] for m > 3 and 
[Bo89] for m = 2). 
OPEN PROBLEM 


P3: (Empire Problem) For which surfaces S and for which values of m do there exist 
maps on S with at most m connected regions in each country (here called an empire or 
an m-pire), such that H(e,m) colors are necessary for a proper coloring of all countries? 


Nowhere-Zero Flows 


DEFINITION 


D8: Let G = (V,E) be an oriented multigraph and k > 2 an integer. A nowhere-zero 
k-flow is a function ¢: E > {1,2,...,k — 1} such that 


S> d(uv) = Y> d(vw) 
uve vweB 


holds for every vertex uv € V. 
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FACTS 


F35: [Tu54] A plane graph G is k-colorable if and only if its planar dual G* admits 
an orientation with a nowhere-zero k-flow. (The analogous property holds for a graph 
imbedded in any orientable surface.) 


In particular, the Four Color Theorem is equivalent to the assertion that every planar 
graph without cut-edges has a nowhere-zero 4-flow, and Grétzsch’s theorem asserts (in 
dual form) that every 4-edge-connected planar graph has a nowhere-zero 3-flow. 


F36: [Tu50, Mi67] A 3-regular multigraph is bipartite if and only if it has a nowhere- 
zero 3-flow, and it is 3-edge-colorable if and only if it has a nowhere-zero 4-flow. In 
particular, the former assertion generalizes the fact that the skeleton (i.e., the graph) of 
every Eulerian planar triangulation is 3-colorable. 


F37: [Se81] Every graph without cut-edges has a nowhere-zero 6-flow. 


F38: [LoThWuZh12] Every 6-edge-connected graph has a nowhere-zero 3-flow. 


CONJECTURES 


C1: 5-flow conjecture [Tu54] Every graph without cut-edges has a nowhere-zero 
5-flow. 


C2: [Tu54] Every 4-edge-connected graph has a nowhere-zero 3-flow. 


Chromatic Polynomials 


DEFINITION 


D9: The chromatic polynomial P(G,.), A € N, of graph G = (V, F) is the function 
whose value at A (A = 1,2,3,...) is the number of proper colorings y : V > {1,...,A} 
of G with at most \ colors. Here, two colorings are counted as different even if they 
yield the same color classes by renumbering the colors. 


EXAMPLE 


E2: The chromatic polynomials of the edgeless graph and the complete graph on n 
vertices are, respectively, 


PER AeaXx” and PUK wa) = (“nt = A= 2) (=n 41) 


FACTS 


F39: (Deletion-Contraction Formula) For every graph G = (V,£) and every edge 
e € FE, we have P(G,A) = P(G-—e,A) — P(G/e,), where ‘—e’ and ‘/e’ mean the 
deletion and contraction of edge e, respectively. 
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F40: [Bil2] If the graph G has n vertices, then P(G,A) is a polynomial of degree n 

in A, with integer coefficients, and .(G) is the smallest natural number \ such that 

P(G,A) £0. 

F41: (Golden Identity) [Tu70] If G is a planar triangulation on n vertices, then 
P(G,7 +2) = (7 +.2)7°""°P?(G,7r +1) 


where T = denotes the golden ratio. 


+75 
2 


REMARK 


R4: Although one can deduce from the Golden Identity that P(G,7 + 2) > 0 holds 
for every planar triangulation (7 + 2 = 3.618...), this does not seem to lead closer to a 
computer-free proof of the Four Color Theorem. (As a matter of fact, P(G,7 +1) is 
nonzero for every connected graph G.) 


5.2.3 Some Further Types of Coloring Problems 


Variants of Proper Coloring 


We briefly mention some further coloring concepts, most of them with only a few refer- 
ences. 


DEFINITIONS 


D10: A Grundy coloring of a graph is a proper vertex-coloring y : V + N such that 
every vertex vu has a neighbor of color 7 for all 1 <i < y(v). The Grundy number of 
a graph (also called Grundy chromatic number and online chromatic number) 
is the largest number of colors in a Grundy coloring. 


D11: An achromatic coloring of a graph is a proper vertex-coloring such that each 
pair of color classes is adjacent by at least one edge. The largest possible number of 
colors in an achromatic coloring is called the achromatic number. 


D12: A b-coloring of a graph is a proper vertex-coloring such that each color class 
contains a vertex adjacent to some vertices in every other color class. The b-chromatic 
number of a graph is the largest number of colors in a b-coloring. 


D13: A \-coloring (also called a radio coloring or an L(2,1)-labeling) is a vertex- 
coloring y: V > {0,1,...,&} (ie., & +1 colors may be used) such that if wv € E then 
|p(u) — p(v)| > 2, and if vertices u and v have a common neighbor then y(u) 4 y(v). 
More generally, for integers r;) > --- > rq > 1, an L(ri,...,ra)-labeling is a vertex- 
coloring y : V + {0,1,...,k} such that, if two vertices u,v are at distance d(u,v) = 
i < d apart, then |y(u) — y(v)| > ri. The difference between the largest and smallest 
color is called the span of the coloring. 


D14: Given a set T C {0,1,2,...}, a T-coloring is a mapping y : V > N such that 
|\y(u) — y(v)| € T for all edges uv € E. 
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D15: A circular C-coloring is an assignment of a real number y(v) (with 0 < y(v) < 
C) to each vertex v, in such a way that if wv is an edge then (y(v) — p(w) modulo C) > 1. 
The circular chromatic number x.(G) of G (called star chromatic number in 
early papers) is the minimum C for which G has a circular C-coloring. 


D16: An harmonious coloring of a graph is a partition of the vertex set into 
independent sets such that the union of any two induces at most one edge. 


D17: A rainbow connection of a connected graph G is an edge coloring such that 
any two vertices are joined by a path on which no color occurs more than once. A 
state-of-the-art account on this subject is given in [LiSu12]. 


FACTS 


F42: The smallest possible number of colors in a Grundy coloring of a graph G is just 
x(G). On the other hand, the algorithmic complexity of finding the Grundy number 
— the largest number of colors — has not been determined so far. See, e.g., [ChSe79], 
[HeHeBe82]; and see also [Si83] with a different terminology. 


F43: It is NP-hard to determine the achromatic number, even for trees ((CaEd97]), 
but polynomial-time solvable for trees of bounded degree ({CaEd98]). It is also hard to 
approximate the achromatic number on a general input graph within a factor 2 — ¢, for 
any € > 0 ([KoKr01)}). 


F44: [HaReSe12] For any positive integer p there exists a constant A, such that every 
graph with maximum degree A > A,, has an L(p,1)-labeling with span at most A?. (It 
remains an open problem to prove or disprove that every graph with maximum degree 
A admits an L(2,1)-labeling with span at most A?, for all A > 2, conjectured by Griggs 
and Yeh [GrYe92].) 


F45: For every graph G, the circular chromatic number satisfies y(G) — 1 < x-(G) < 
x(G). In this way, y- is a refinement of y, and y is an approximation of y.. Both bounds 
are tight: If G is uniquely colorable, then y-(G) = x(G) ([Zh96]); if G is k-critical and 
has girth g, then y.(G) < x(G) —1+1/s, where s = |(g — 1)/(k — 1)], ie., for fixed 
chromatic number, large girth makes a critical graph have y, close to y — 1 ({[StZh96]). 


F46: The values of circular chromatic number form a dense set: Among the rational 
numbers r, for any r > 2 and for any integer g there is a graph with girth at least g 
and with x. = r ([StZh96]); for any 2 < r < 4 there is a planar graph with y. = r 
([Mo97, Zh99]); and for any 2 <r <n-—1 there is a K,,-minor-free graph with y. =r 
({LiPaZh03}). 


REMARKS 


R5: We refer to [IrMa99] and [KrTuVo02] for various estimates and complexity results 
regarding b-colorings (e.g., hardness on connected bipartite graphs). 


R6: For results and further references on A-colorings, see the surveys [Ye06], [Grkr09], 
and [Cal]]. 
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R7: For results on T-colorings, see, e.g., [Te93], [Wa96], and [ChLiZh99]. It is worth 
noting that in the list coloring version of T-coloring, the case 0 ¢ T leads to interesting 
questions. 


R8: Results on circular colorings are surveyed in [Zh01]. The digraph version of the 
concept has also been introduced; see, e.g., [BoFiJuKaMo04]. 


R9: The goal of harmonious coloring investigations is to minimize the number of vertex 
classes. See [Ed97] for a survey. 


Graph Homomorphisms 


DEFINITION 


D18: A homomorphism from a graph G = (V,£) to a graph H = (X,F), where 
loops may occur in both G and H, is a vertex mapping y: V + X such that 


p(u)y(v) € F for alluve€ E 


A homomorphism into the complete graph K;, can be viewed as a proper k-coloring and 
vice versa. Moreover, a homomorphism into H is also called an H-coloring of G. 


D19: A list homomorphism from G = (V,E) to H = (X, F) is a homomorphism 
y with the further constraint that each v € V has a prescribed list L, C X such that 
p(v) € Ly. 


NOTATION: The notation G + H means that G has at least one H-coloring, and G A H 
denotes that G is not H-colorable. The concept is extended to digraphs in a natural 
way (i.e., where uv and y(u)y(v) are ordered pairs). 


FACT 


F47: [NeTa99] Let G; and G2 be graphs such that y(Gz2) > 3 and G; > Gp» but 
G2 # G,. Then there exists a graph G with G; ~ G—> Gz and G2 AGA G. 


For complexity results on H-coloring, see §5.2.5. Detailed discussion and references can 
be found in the book [HeNe04] and in the surveys [He03], [HeNe08], and [He12]. 


Coloring with Costs 


DEFINITIONS 


D20: A cost set C = {c1,c,...} associates a cost c; > 0 with each color i. It is 
assumed without loss of generality that 0 < c, < co <-:-, and also that |C| > x(G). 


D21: Given a graph G = (V, EF) and a cost set C, the cost of a coloring y: V > N 
is the sum ) cy Cy(v). We denote by Uc(G) the smallest possible cost of a proper 
vertex-coloring y of G. If C =N, this notion simplifies to U(G) := ming Yo ,cy y(v) 
and usually is called the chromatic sum or color cost of G. 
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D22: The cost chromatic number of G with respect to a cost set C is the smallest 
possible number of colors in a minimum-cost coloring. If C = N, this parameter is often 
called the strength or chromatic strength of G. 


FACTS 
F48: [ThErAlIMaSc89] If G is connected and has m edges, then 


[vam] < 26) < [Fm+1) 


F49: [MiMo97] For every finite cost set C there exists a tree T’ whose cost chromatic 
number is equal to |C\. 


F50: [Tu90] For every s > 2, the minimum number of vertices in a tree of strength s 
equals ((2 + /2)*-1 — (2— /2)*-!)/./2. Moreover, for every s > 3, there exist precisely 
two trees of strength s, which are minimal in the sense that every tree of strength at 
least s is contractible to at least one of them. 


F51: [MiMo97] Every tree of maximum degree A has cost chromatic number at most 
[A/2] +1. 


F52: The bound in Fact F51 is tight for every A with the cost set C = {1,1.1,1.11,...} 
({MiMoSc97]) and also with C = N ([JiWe99]). 


REMARK 


R10: Beside the chromatic sum, various notions concerning coloring with costs have 
been motivated by scheduling problems. In some of them, given numbers of colors have 
to be assigned to the vertices. For results of this type, see, e.g., [Ma02]. 


Vertex Ranking 


DEFINITION 


D23: A vertex ranking of graph G = (V,£) is a (necessarily proper) coloring 
yp:V—->N with the property that for any two vertices u,v of the same color, every 
u-v path contains some vertex z with y(z) > y(u). The smallest possible number of 
colors, called ranking number, will be denoted by x;(G). In the directed analogue 
for digraphs, the requirement is put on directed paths only. 


EXAMPLE 


E3: [BoDeJaKIKrMiTu98] The line graph of K;, (n > 2) has ranking number $(n? + 
g(n)), where the function g(n) is defined recursively with g(1) = —1, g(2k) = g(k), and 
g(2k +1) = g(k +1) +& for every natural number k. (No closed formula is available.) 


FACTS 


F53: For every graph G, col(G) < x,(G), and if the ranking number is at most k, then 
the graph is (km,m)-choosable for every m € N ([TuVo96]). 
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F54: [BoDeJaKIKrMuTu98] If x,(G) = x(G), then also w(G) = x(G). Moreover, 
Xr(H) = w(H) holds for every induced subgraph H of G if and only if G contains no 
P, and Cy as an induced subgraph. 


Non-Repetitive (Thue) Colorings 


DEFINITION 


D24: A non-repetitive coloring of a graph G = (V,£) is a vertex coloring y 
such that, for all r > 1, no path v,v2...v2, with an even number of vertices satisfies 
p(u;) = v(v,4;) for all l<i<r. 


D25: The Thue chromatic number of G, denoted by 7(G), is the smallest & such 
that G has a non-repetitive k-coloring. 
FACTS 


F55: [Th06] If P, is a path on n > 4 vertices, then 7(P,,) = 3. (For cycles, 7(C,) = 4 
for n = 5,7,9,10, 14,17, and 7(C,,) = 3, otherwise [Cu02].) 


F56: The value of 7(G) is finite for graphs of maximum degree at most A [AlGrHaRi02] 
(the largest value is between eas and cA?) and for graphs of treewidth at most w 
[KiPe08] (then 7(G) < 4”). 


F57: [BaWo08] Every graph has a subdivision which is non-repetitively 3-colorable. 


CONJECTURE 


C3: The value of 7(G) is bounded on the class of planar graphs. 


REMARK 


R11: For planar graphs of order n the currently best upper bound seems to be O(log n) 
({[DuFrJoWol12]). Further results and conjectures are surveyed in [Gr07]. 


Partial Colorings and Extensions 


Partial colorings and their extensions are of interest in several aspects: from the view- 
point of theory and also as a proof technique. We mention here some concepts with 
only a few references. 


DEFINITIONS 


D26: A partial coloring of graph G = (V, F) on a vertex subset W C V is a coloring 
pw: WN. 


D27: In the Precoloring Extension problem, abbreviated PrEat, we are given a 
graph G, a color bound k, and a proper partial coloring yw. The question is whether 
yw can be extended to a proper k-coloring of the entire G. 
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D28: For a nonnegative integer t, the problem t-PrEzt is the restricted version of 
Precoloring Extension where the given partial coloring uses each color at most ¢ times. 
(Hence, 0-PrExt is an equivalent formulation of asking whether the graph in question 
is k-colorable.) 


D29: In an on-line coloring the vertices v1, v2,...,Un of graph G are received one 
by one in some unknown order. When v; appears, we also get the information which its 
neighbors in {v,,...,0;-1} are. A color has to be assigned to v; without any information 
on its adjacencies to the vj, i<j <n. 


FACTS 


F58: If G = (V, E) is k-colorable, and t < k is a positive integer, then G has a proper 
partial coloring on at least ¢|V| of its vertices. 


F59: [AlGrHa00] If G is a k-colorable graph with n vertices, then for every list t- 


assignment it has a partial proper list coloring on at least (1 — (44)') n vertices. 


F60: If G is a k-choosable graph with n vertices, then it has a partial list coloring on 
more than $ n vertices, for every list t-assignment (1 < t < k) [Ch99]. Moreover, if 
G has maximum degree k or the union of lists contains at most k colors, then at least 
tn/k vertices can be colored from their lists [Ja01]. (It is conjectured in [AlGrHa00] 
that this lower bound is valid for all G and all t-assignments.) 


F61: (Cf. §5.1.5) If T is a tree of radius 2, then there exists an on-line y-binding 
function on the class of graphs not containing T as an induced subgraph ([KiPeTr94]), 
but the class of induced-P,-free graphs is not on-line y-bound ([GyLe91)). 


REMARKS 


R12: In several graph classes, efficiently testable necessary and sufficient conditions 
can be given for the extendability of partial colorings. Details can be found in [HuTu96]. 


R13: Results on on-line coloring are surveyed in [Ki98a]. 


R14: An on-line version of list coloring is paintability. It can be described as a two- 
person game played on a graph G = (V, F) where each vertex v € V has an initial bound 
f(v) € N, which corresponds to f-choosability in the off-line setting. In each round, 
Alice marks a subset of vertices having positive current f-value, and Bob removes an 
independent subset of the marked ones; the marked but non-removed vertices decrease 
their f-value by 1 and erase the marking. Alice wins if some f-value decreases to zero 
in some step; Bob wins if all vertices get removed. Graph G is k-paintable if Bob has 
a winning strategy on G when f(v) = k holds for all v € V. Every k-choosable graph is 
k-paintable, but the converse implication is not valid. Some sufficient conditions on list 
colorability can be extended for paintability, too; e.g., the kernel method is a sequential 
selection of monochromatic subsets and can easily be interpreted in the on-line scenario. 
Further major results on list colorings can also be generalized to paintability; see, e.g., 
the seminal papers [Sc09] and [Sc10]. 
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R15: An on-line version of Precoloring Extension can be presented as a game where 
an uncolored graph G and a color bound & are given, and two persons alternately color 
the vertices so that a proper partial coloring is obtained after each step. The game 
ends when no legal move is available. In the Achievement game the person making 
the last move wins. In the Avoidance game the person making the last move loses. 
Although these games were introduced two decades ago ([HaTu93]), still very little is 
known about them. 


R16: The game chromatic number y,(G) of graph G = (V,£) is the smallest 
number & of colors for which the first player can force to color the entire vertex set 
V, in the game where two players alternately color the vertices properly. (It follows, 
in particular, that if |V| is odd, then the first player has a winning strategy in the 
Achievement game, and if |V| is even, then he/she can win the Avoidance game.) It is 
known, for example, that x,(7’) < 4 for every tree T ([FaKeKiTr93]), and the analogous 
upper bound ch,(T’) < 4 remains valid in the list coloring version of the game, too (i.e., 
where the color of every v € V has to be chosen from a given list L,). While the 
graphs with y,(G) = 2 are the star forests, it is already a little more complicated to 
characterize the graphs with ch,(G) = 2 ([BoSiTu07]). Some strategies for the coloring 
game to obtain bounds on x, are surveyed in [BaGrKiZh07]. 


Coloring Cubic Graphs with Triple Systems 
DEFINITIONS 


D30: A Steiner triple system is a pair S = (X,B) where X is an underlying set 
called the set of points and B is a collection of 3-element subsets of X, called blocks, such 
that any two x,x’ € X occur together in precisely one block B € B. Less restrictively, 
a partial triple system (X,8) with a collection B of 3-element subsets of X requires 
that any two x2,2’ € X are contained in at most one B € B. Trivial system means 
|X| = 3 and B= {XxX}. 


D31: For a partial triple system S = (X,B), an S-coloring of a cubic (i.e., 3-regular) 
graph G = (V, F) is an assignment ¢: E + X such that the colors of any three edges 
meeting at a vertex form a block of S. 


D32: A k-line Fano coloring is one that uses at most k different blocks (lines) as 
color patterns around the vertices. (The Fano plane is the smallest nontrivial Steiner 
triple system, which has seven points and seven blocks.) 


D33: The Cremona-Richmond configuration is a symmetrical configuration R of 
fifteen points and fifteen lines such that each line contains three of the fifteen points 
and each point lies on three of the fifteen lines. Combinatorially, it can be represented 
by the partial triple system consisting of fifteen points corresponding to the 2-element 
subsets of the set {1,2,3,4,5,6} and the fifteen “perfect matchings,” i.e., blocks formed 
by triples of points that cover all the six elements of {1, 2,3, 4,5, 6}. (The configuration 
was discovered in the context of real projective geometry in the 19th century and can 
be realized in the 4-dimensional projective space over R.) 
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FACTS 


F62: [HoSk04] Every bridgeless cubic graph has an S-coloring for every nontrivial 
Steiner triple system S. 


F63: [MaSk05] Every bridgeless cubic graph G has a 6-line Fano coloring. Moreover, 
the bridgeless cubic graphs have a 4-line Fano coloring if and only if they contain 
three perfect matchings with no edge in common. That is, the existence of 4-line Fano 
colorings is equivalent to the Fan-Raspaud conjecture. 


F64: [KrMaPaRaSeSk09] A bridgeless cubic graph has a coloring with the blocks of the 
Cremona-Richmond configuration if and only if it has a family of six perfect matchings 
such that each edge belongs to precisely two of those matchings. Hence, the bridgeless 
cubic graphs are R-colorable if and only if Fulkerson’s conjecture ({[Fu71]) is valid. 


Neighbor-Distinguishing Colorings 
REMARK 


R17: There are many variations of colorings with the aim to distinguish between 
vertices: vertex and edge colorings, with or without lists, distinguishing all vertices or 
only the adjacent ones, assuming or not assuming that the coloring is proper, considering 
for distinction the set or the multiset of colors in the neighborhood, using abstract colors 
or natural numbers or elements of a group, etc. For example, it is proved in [GyPa09] 
and [HoSo10] that if G is a connected graph with .(G) > 3, then exactly [log y(G)]+1 
colors are needed as the minimum in a (non-proper) edge coloring such that, for any 
two adjacent vertices u,v in G, there is a color incident with precisely one of wu and v 
(i.e., the color sets on their stars are distinct). For bipartite graphs, two or three colors 
suffice in a coloring with this property ([GyHoPaWo08}). 


Maximizing the Number of Colors 


REMARKS 


R18: Coloring constraints may be of opposite type as well. For example, if for each 
vertex uv the number of distinct colors occurring in the neighborhood of v is bounded 
from above, the parameter of interest is the largest possible number of colors which can 
be achieved on the entire vertex set. Results of this kind are surveyed in [BuTul1]. See 
also mixed hypergraphs and their generalizations in §5.2.4 below. 


R19: A pair of vertex and edge colorings, of interest for a special reason, is as follows. 
A 3-consecutive vertex coloring of graph G = (V, £) is a mapping gy: V > N such 
that for any 3-vertex path P = (v1, v2, v3) in G, the middle vertex vp has the same color 
as one (or both) of the ends v1, v3 of P. Similarly, a 3-consecutive edge coloring of 
G is a mapping ¢: E > N such that for any three consecutive edges €1, €2,e3 (forming 
a cycle C3 or a path P, in this order), the middle edge eg has the same color as one 
(or both) of e; and e3. (The conditions force every triangle to be monochromatic in 
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both y and ¢.) As indicated already in §5.1, these colorings are very special in the 
following sense. If G has minimum degree at least two, then there is a bijection between 
the 3-consecutive vertex colorings of G and the 3-consecutive edge colorings of its line 
graph L(G) ([BuSaTuDoPul2]). This is the opposite of the standard correspondence 
between edge colorings of G and vertex colorings of L(G), which is valid for many kinds 
of colorings. Since both y and @¢ are allowed to make the entire G monochromatic, 
the relevant parameter to consider for 3-consecutive colorings is the maximum possible 
number of colors. 


R20: The maximum number of colors — as well as colorability — is of interest in 
the context of interval edge coloring, too. For a graph G = (V,£), an interval edge 
coloring is a proper edge coloring ¢: E — N such that, for each v € V, the colors on 
the edges incident with v form an interval of consecutive integers. Recent bounds and 
further references can be found in [KaPel2]. 


Partitions with Weaker Requirements 


REMARKS 


R21: There are many papers dealing with vertex- or edge-partitions into parts that are 
not necessarily independent, but satisfy some weaker properties. Usually it is assumed 
that the property to be satisfied in each part is hereditary or induced-hereditary, i.e., if 
it holds for a graph H then it also holds for all (induced) subgraphs of H. A detailed 
discussion on the general theory can be found in the survey [BoBrFrMiSe97]. 


R22: Sometimes conditions are imposed on the vertex degrees in each partition class. 
For results and references on this kind of problem, see, e.g., the surveys [Wo01] and 
[BaTuVa10]. 


5.2.4 Colorings of Hypergraphs 


Beside some results on the coloring of finite set systems (hypergraphs), here we also 
mention the basic definitions and a few facts from the recently fast-developing theory 
of mixed hypergraph coloring (and beyond). See the informative monograph [Vo02] for 
a detailed account, and the survey [BaBuTuVol10] for open problems. 


DEFINITIONS 


D34: A hypergraph H = (X,F) has vertex set X; its edge set F consists of subsets 
of X. We assume that F 4 @ and that |F| > 2, for all F € F. 


D35: <A proper vertex k-coloring of a hypergraph H is a mapping y : X > 
{1,2,...,k} such that no edge of H is monochromatic. Equivalently, it is a vertex 
partition into & classes such that no color class contains any edge. 


D36: A proper edge-coloring of a hypergraph H is an edge partition such that 
the edges in the same class are mutually vertex-disjoint. 
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D37: The chromatic number x(H) and chromatic index y'(H) are the smallest 
numbers of colors in a proper vertex and proper edge-coloring, respectively. List col- 
oring, choice number, choice indez, (a,b)-choosability, etc., can be defined for 
hypergraphs analogously. 


D38: A hypergraph is r-uniform if every edge has precisely r vertices. 


D39: The complete r-uniform hypergraph of order n (n > 1), denoted K7,, has 
|X| =n, and its edge set consists of all the r-element subsets of X. 


D40: A mixed hypergraph H = (X,C,D) has vertex set X, and two types of edges: 
the C-edges in C and the D-edges in D, respectively. It is called a bi-hypergraph if 
C =D, a C-hypergraph if D = 0, and a D-hypergraph if C = @. We shall assume 
that at least one of C and D is nonempty, and also that every (C- and D-) edge has at 
least two vertices. 


D41: A strict k-coloring of a mixed hypergraph is a vertex-coloring with exactly k 
colors, such that every C-edge has two vertices with a common color and every D-edge 
has two vertices with different colors. (In this way, the D-hypergraphs are just the 
hypergraphs in the usual sense with respect to proper vertex coloring.) 


D42: <A mixed hypergraph is said to be colorable if it admits at least one strict 
coloring, and uncolorable if it doesn’t. 


D43: Ifa mixed hypergraph H is colorable, then the smallest and largest number of 
colors in a strict coloring is called the lower and upper chromatic number, denoted 
x(H) and X(H), respectively. 


D44: A mixed hypergraph H is said to be uniquely colorable if y(H) = X(H) and 
H. has only one strict coloring (apart from renaming the colors). 


D45: More general models for hypergraph coloring are the color-bounded and stably 
bounded hypergraphs. The latter means a six-tuple H = (X,F,s,t, a,b), where 
(X,F) is a hypergraph, while each of the four color-bound functions s, t, a, b assigns 
positive integers to the edges. A proper vertex coloring of H requires that for each 
edge F' € F the number of distinct colors assigned to the vertices of F' is at least s(F’) 
and at most ¢(F’); moreover, the cardinality of the largest monochromatic subset inside 
F is between a(F’) and b(F’). The color-bounded hypergraphs are just the four-tuples 
H = (X,Ff, s,t). The concepts of colorability, upper chromatic number, etc., extend to 
color-bound and stably bound hypergraphs, too, in a natural way. 


EXAMPLES 


E4: If H is a C-hypergraph, then y(H) = 1, since the entire vertex set may be colored 
with the same color, and if H is a D-hypergraph, then ¥(H) = |X|, because its vertices 
may get mutually distinct colors. 


E5: The complete hypergraph (viewed as a D-hypergraph) has x(K7,) = [47], and 
when viewed as a C-hypergraph, it has x(K7) = r-—1. IfC = K®, D = K4, and 
n > (p—1)(q—1), then H is uncolorable. 
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E6: [ErLo75] Let X = X,U---UX,, with |X;| = 7 for all 1 <i <r, and let an 
r-element set be an edge if and only if for some 7 it contains X; and has precisely one 
vertex in every X; with i < j <r. This r-uniform hypergraph is not 2-colorable, for all 
r> 2. 


E7: [TuVo00] The following mixed hypergraphs H = (X,C,D) are uncolorable: starting 
from a k-chromatic graph G = (V, EF), set X = V, D = E, and let a k-subset Y C V be 
a C-edge if the subgraph of G induced by Y has a hamiltonian path. 


E8: [KoKi01] The following planar hypergraph H = (X,C,D) has strict color- 
ings with two and four colors, but not with three colors: X = {a,b,c,d,e,f}, C = 
{abe, bce, bcf ,cdf}, D = {ab, bc, cd, da, af , de}. 


FACTS 


F65: [Er64] For every r > 2,there exists a non-2-colorable r-uniform hypergraph with 
fewer than r? 2"*+ edges. 


F66: [RaSr00] For sufficiently large values of r, every r-uniform hypergraph with at 
most 0.7,/r/Inr 2” edges is 2-colorable, and efficient algorithms can also be designed 
to find a proper 2-coloring. (The previous bound r3~* 2” is given in [Be78].) 


F67: [AlKol1] For every r,k € N there is a d,(k) such that if an r-uniform hypergraph 
has average degree at least d,(k) and any two of its edges share at most one vertex, 
then it is not k-choosable. 


F68: (See [Lo68, NeR679] in §5.1.4.) For every triple of integers r,k, g > 3, there exists 
an r-uniform hypergraph with chromatic number at least k and girth at least g. 


F69: [KrVu01] For every positive integer r and for every real-valued function p = p(n) 
with 
n-@-W/Arte <p < 0.9 (€ > 0 fixed) 
the random r-uniform hypergraph with n vertices and edge probability p = p(n) has 
ch(Hn ») = (1+ 0(1))x(Hn,p) with probability 1 — o(1) 
Also, if p> Cn'~" for a sufficiently large constant C, then 
ch(Hn»)/x(Hnp) = (1+ 5p) r/"-Y almost surely 
where 6, + 0 as n”~!p > ov. 


F70: [KoSt00] If a hypergraph on n vertices is (k + 1)-chromatic, critical, and does 
not contain any 2-element edges, then the number of its edges is at least (k — 3/Wk) n. 


F71: [BuTu09] For every fixed k > 2, and also for k = 0(n'/3), the minimum number of 
2 


edges in a k-uniform C-hypergraph of upper chromatic number k — 1 is (1+0(1))?(,,",) 
as n — oo. In other words, this is the minimum number of edges in a k-uniform 
hypergraph whose edges cover all vertex partitions with k non-empty classes. For k = 3 
the exact minimum is [n(n — 2)/3] ({[DiLiRaZh06], [ArTe07]) and for k = 2 it isn—1 
(tree graphs). For k = n — 2 the minimum is equal to the smallest number of edges in 
a graph whose complement has girth at least 5; i.e., it is the complementary value of 
the Turan function ex(n, {C3,C,}), whose determination is a famous open problem in 


extremal graph theory. 
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F72: [TuVo00] The largest possible value of ¥(C) — y(D) in uncolorable mixed hyper- 
raphs H = (X,C,D) with |X| =n vertices is equal to n — 4. 


F73: [TuVo00] The List Coloring problem can be reduced to the colorability problem 
of mixed hypergraphs, in the following sense. For every graph G = (V, E) and for every 
list assignment L on G, there can be constructed (in linear time) a mixed hypergraph 
H on |V| + (See L,| vertices such that H is colorable if and only if G is L-colorable. 


F74: [TuVoZh02] Every colorable mixed hypergraph is the induced subhypergraph 
of a uniquely colorable mixed hypergraph. More generally, given a colorable mixed 
hypergraph H and two of its coloring partitions, say P with t classes and P’ with t’ 
classes such that P’ is a refinement of P (i.e., every class in P’ is a subset of some class 
in P), there exists a mixed hypergraph H’ with x(H’) =t and X(H’) = t’ that contains 
H. as an induced subhypergraph; moreover, 1 has just one coloring with ¢ and also with 
t’ colors, and those two colorings induce P and P’ on H. 


F75: [JiMuTuVoWe(02] For every finite set S C N\{1} there exists a mixed hypergraph 
H such that H admits a strict k-coloring if and only if k € S. Moreover, for every r > 3 
and for every S = S’US” with S’ CN\{1,2,...r—1}, and S” =@ or S” = {£,...,r—1} 
for some 2 < @< r—1, there exists an r-uniform bi-hypergraph H such that H admits 
a strict k-coloring if and only if k € S ([BuTu08}). 


F76: [Kr04] For every finite sequence (11, 1r2,...,7¢) of nonnegative integers with r; = 0 
and r¢ > 0, there exists a mixed hypergraph H such that x = @, and for each 1<k< @ 
there are exactly r; different strict k-colorings of H apart from renumbering of colors. 
(For the cases where r; € {0,1} holds for all k, the smallest possible number of vertices 
is investigated in a series of papers starting with [ZhDiWa12].) 


F77: ([KrKrPrVo06]; A. Niculitsa) If each edge of a mixed hypergraph H induces a 
subtree in some fixed tree, then the set of values & for which H. has a strict k-coloring 
consists of consecutive integers. 


F78: As a function of k € N, the number of proper colorings y : X — {1,...,k} of 
a colorable stably bounded hypergraph H = (X,F, s, t, a, b) with at most k colors is a 
polynomial of degree ¥(H). (The subsets of the color-bounding functions s, t, a, b gen- 
erate subclasses of stably bounded and mixed hypergraphs, to which the corresponding 
subclasses of chromatic polynomials are associated in the natural way. Those subclasses 
of polynomials are partially ordered under inclusion; the Hasse diagram of this poset is 
determined in [BuTu07].) 


REMARKS 


R23: Starting with [MiTu97], several papers study the strict colorings of Steiner 
systems (block designs). It is not known, for example, whether there exists an infinite 
family of Steiner quadruple systems S such that the bi-hypergraphs with C= D=S 
can have any large upper chromatic number. For a survey, see [MiTuVo03}. 


R24: The perfectness of mixed hypergraphs is introduced via the comparison of in- 
dependence number and upper chromatic number, since ¥(#) cannot exceed the inde- 
pendence number of (X,C) for any H = (X,C,D). The characterization problem of 
C-perfect hypergraphs, raised in [Vo95al, is still open, even in the 3-uniform case. For 
subtrees of a tree, C-perfectness is characterized ({BuTul0]), with several algorithmic 
consequences, as well. 
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R25: In a very general model of hypergraph coloring, called pattern hypergraph, the 
allowed color partitions can be specified locally for each edge ([DvKdéKrPal0]). 


Clique Hypergraphs 
DEFINITION 


D46: The clique hypergraph of graph G = (V,E) has the same vertex set V; a 
subset H C V of cardinality at least two is considered to be a hyperedge if it induces a 
maximal complete subgraph in G. Its chromatic number will be denoted by yc(G). 
OPEN PROBLEM 

P4: [DuSaSaWo91] Does there exist a constant & such that xc(G) < k for every perfect 
graph G? 

FACTS 


F79: [BaGrGyPrSe04] The following upper bounds are valid on yc(G) for every graph 
G: the domination number plus one, the independence number unless G is a complete 
graph or G = Cs, and 2,/n (on n vertices). 


F80: xc(G) < 2 holds for comparability graphs ([DuSaSaWo91]), claw-free graphs 
without induced odd cycles longer than 3, and also the complements of such graphs 
({[BaGrGyPrSe04]). 


F81: [DukiTr91] For the complements of comparability graphs, yco(G) < 3. 


F82: [MoSk99] If the graph G is planar, then yc(G) < 3, and the clique hypergraph 
is 4-choosable also if G is imbeddable in the projective plane. 


5.2.5 Algorithmic Complexity 


FACTS 


F83: Bipartite (i-e., 2-colorable) graphs can be recognized and properly 2-colored in 
linear time. 


F84: The 2-choosable graphs can be recognized in linear time (by the structural 
characterization theorem in [ErRuTa79]). 


F85: [Ma68, FiSa69] The coloring number (see §5.1.2) can be determined in polynomial 
time. 


F86: [Zh11] It is NP-complete to decide whether y(G) < col(G) holds for a generic 
input graph G. 
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F87: [Ka72] For every k > 3, it is NP-complete to decide whether y(G) < k. Also, it 
is NP-complete to decide whether a planar graph of maximum degree 4 is 3-colorable 
([GaJoSt76]). 


F88: [KoYal2] Let k > 4 and G a graph. If each subgraph H C G has fewer than 
Aya |V(H)| — ane edges, then G can be properly (k — 1)-colored in polynomial 
time, in at most O(k?° n®&° log n) steps. (The condition on the sparseness of subgraphs 
Hf is set to ensure that G cannot contain any k-critical subgraph and hence is (k — 1)- 


colorable; cf. Fact F31 of §5.1.3.) 


F89: [K616] Every bipartite graph G can be properly edge-colored in polynomial time 
with A(G) colors. 


F90: [Ho81] (k = 3), [LeGa83] (k > 4) It is NP-complete to decide whether a k-regular 
graph is edge k-colorable. 


F91: [{McSa94] For every k > 3 it is NP-complete to decide whether a k-regular 
bipartite graph admits a total coloring with & + 1 colors. 


F92: [BrLo98] For every fixed k, it can be decided in polynomial time whether G 
admits a vertex partition into k-element sets, each of which is independent in G or in 
its complement G. 


F93: [GrLoSc84] The chromatic number of perfect graphs can be determined in poly- 
nomial time. 


F94; [KrKrTuWo01] For a fixed graph H, let Forb(H) be the class of graphs not 
containing any induced subgraphs isomorphic to H. Determining .(G) for the graphs 
G € Forb(#) is polynomial if H is an induced subgraph of P, or of P3 U Ky (the path 
of length two plus an isolated vertex), and is NP-hard for any other H. 


F95: Although the chromatic number of graphs containing no induced Ps is NP-hard to 
compute (as implied by Fact F94), for every fixed k there is a polynomial-time algorithm 
that decides whether a P5-free graph is k-colorable, and finds a proper k-coloring if it 
is ([HoKaLoSaSh10]). 


F96: The 3-colorability of graphs without any induced Pg subgraph can be decided, 
and a 3-coloring can be found if it exists, in polynomial time ([RaSc04]). On the other 
hand, deciding the 4-colorability of Pj2-free graphs and the 5-colorability of Ps-free 
graphs are NP-complete ({[LeRaSc07] and [SgWo01], respectively). 


F97: [KrTu02] For every k > 3, it is NP-complete to decide whether the clique 
hypergraph of a perfect graph G with w(G) = k is 2-colorable. 


F98: [KrTu02] If G is planar, then the decision of 2-colorability (and the determination 
of chromatic number) of its clique hypergraph is polynomial-time solvable. 


F99: Planar graphs can be properly 5-colored, and also properly 4-colored, in polyno- 
mial time. On the other hand ([VeWe92]), it is ##P-complete to determine the number 
of proper 4-colorings of a planar graph. 


F100: [Th94] Given a planar graph with a 5-assignment LD on its vertices, a proper 
L-coloring can be found in linear time. 
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F101: [HeNe90] For every non-bipartite graph H it is NP-complete to decide whether 
an input graph G is H-colorable. 


F102: [FeHeHu03] The graphs H (with or without loops) have been characterized 
into which the list homomorphism (list H-coloring) problem G —> H for a generic input 
graph G can be decided in polynomial time. (There is a dichotomy between polynomial- 
time solvability and NP-completeness.) In particular, if all vertices of H have loops, 
then list H-coloring is solvable in polynomial time if and only if H is an interval graph 
({FeHe98]), and if H has no loops, then list H-coloring is solvable in polynomial time if 
and only if the complement H of H is a circular-arc graph ([FeHeHu99]). 


F103: [DyGr00] The number of homomorphisms G + H (where H is fixed and G is 
the input graph) can be determined in polynomial time if each component of H is a 
complete graph with loops at all of its vertices, or a complete bipartite graph without 
loops, or an isolated vertex; and it is ##P-complete otherwise. 


F104: [HeNeZh96] Suppose that the following property holds for the digraph H: A 
digraph G is not H-colorable if and only if there exists an oriented tree T such that 
T—> Gand T A H. Then H-colorability is decidable in polynomial time. 


F105: [GuWeWo92] There exist oriented trees T' such that it is NP-complete to decide 
whether G — T. 


F106: For every k > 3, it is IIf-complete to decide whether ch(G) < k ([GuTa09]). It 
remains IIS-complete for k = 4 on planar graphs and for k = 3 on triangle-free planar 
graphs ([Gu96]). 


F107: [JaSc97] It is NP-complete to decide whether a complete bipartite graph with 
given lists on its vertices is colorable. 


F108: [KrTu94] The List Coloring problem remains NP-complete on the instances 
satisfying all the following three conditions (also if restricted to planar graphs): every 
list has at most 3 colors, every color occurs in at most 3 lists, and every vertex has 
degree at most 3. On the other hand, both the decision and search versions of the 
problem can be solved in linear time if every list has at most 2 colors, or every color 
occurs in at most 2 lists, or every vertex has degree at most 2. 


F109: [Lo73] For every k > 2, it is NP-complete to decide whether a hypergraph is 
k-colorable. It remains NP-complete for k = 2 on 3-uniform hypergraphs. For all k > 2 
and r > 2 it is NP-complete also on r-uniform hypergraphs in which any two edges share 
at most one vertex, except for k = r = 2 ([PhR684]). Also, deciding 14-colorability of 
a Steiner triple system is NP-complete. 


F110: [TuVoZh02] It is NP-complete to decide whether a mixed hypergraph is col- 
orable, and given a mixed hypergraph H with a strict coloring, it is CoONP-complete to 
decide whether H is uniquely colorable. 


F111: [Mall] For every k > 2, on the clique hypergraphs of graphs it is ©5-complete 
to test k-colorability, and IIS-complete to decide whether a given list assignment on the 
vertices admits a proper list clique coloring. (To decide whether the clique hypergraph 
of every induced subgraph is k-colorable is also II-complete.) 
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F112: {[BuTu09a] If a color-bounded interval hypergraph H = (X,¥F, s, t) is colorable, 
then y(H) = maxrer s(F), and there is a polynomial-time algorithm that transforms 
any proper vertex coloring of H to one with .(H) colors. (Time complexity results 
for coloring problems on several functional and structural subclasses of stably bounded 
hypergraphs — and also on subclasses of mixed hypergraphs — are systematically sum- 
marized in [BuTul13]). 


F113: On an unrestricted input graph it is NP-hard to determine the chromatic sum, 
but it is polynomial on trees ([KuSc89]) and also on the line graphs of trees ([GiKu00]). 


F114: [Ja97| If the cost set contains at least four colors, then on bipartite graphs it is 
NP-hard to determine the minimum cost of a proper coloring. 


F115: [HuTu93] Precoloring extension on the complements of bipartite graphs, and 
also on split graphs, is solvable with exactly the same efficiency (in polynomial time) as 
the Bipartite Matching problem, but it is NP-complete on bipartite graphs. It remains 
NP-complete on bipartite graphs even if just 3 colors may be used ([Kr93]). 


F116: [BiHuTu92] On interval graphs, 1-PrExt (see D28) is solvable in polynomial 
time, but 2-PrExt is NP-complete. (On unit interval graphs, the unrestricted PrExt 
problem is NP-complete [Ma06].) 


F117:  1-PrExt is polynomial-time solvable on chordal graphs ([Ma07]) but NP- 
complete on permutation graphs ([Ja97]). 


F118: On the class of trees, to decide whether there exists a labeling with span at 
most 2 (2) for L(r1,7r2)-labeling with fixed r1,7r2 is polynomial-time solvable if 1, is 
a multiple of r2, and NP-complete otherwise [FiGoKr08]; (ii) for L(r1,1)-labeling is 
polynomial-time solvable even when 7; is part of the input [ChKeKuLiYe00]; and (iz) 
for L(2,1,1)-labeling is NP-complete [GoLiPal0], despite that the minimum span for 
L(2,1,1) on any tree T is either w(T?) — 1 or w(T?) [FiGokKr04]. 


F119: [GoHeHeHaRa08] It is NP-complete to decide whether a graph admits a packing 
coloring with 4 colors (see definition in 85.1.7), while the graphs with x, < 3 can be 
recognized in polynomial time. 


F120: [BoDeJaKIKrMiTu98] It is NP-hard to determine the ranking number of bi- 
partite graphs, and also of complements of bipartite graphs. 


F121: [LaYu98] The ranking number of line graphs is NP-hard to compute. 


F122: The ranking number can be determined in polynomial time for line graphs 
of trees ({(dToGrSc95]), for graphs contained in chordal graphs of bounded clique size 
({[BoDeJak]KrMiiTu98}]), interval graphs ([AsHe94]), and graphs in which there is only 
a polynomially bounded number of minimal separators ([BrKIKrMii02}). 


F123: [KrTu99] It is NP-complete to decide whether an acyclic, planar directed graph 
has ranking number at most 3 (while for any k, the undirected connected graphs of 
ranking number at most k can be recognized in constant time [BoDeJaK]KrMiuTu98)}). 
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REMARKS 


R26: The 5-coloring and 5-list-coloring algorithms on planar graphs are efficient. 
Although the proofs of the Four Color Theorem also yield polynomial algorithms that 
4-color a planar graph, those algorithms are not practical. 


R27: Most applications of the Lovdsz Local Lemma |[ErLo75] can be made algorithmic 
by the method presented in [MoTal10]. 


R28: Further algorithmic aspects of coloring problems are discussed in the Special 
Issue on Computational Methods for Graph Coloring and Its Generalizations, Discrete 
Applied Math. 156:2 (2008), and in the survey [GaHe06]. 


Approximation 


DEFINITIONS 


D47: Let r(n): NR? be a function. An algorithm is an r(n)-approximation for 
chromatic number if, for every n and every input graph G with n vertices it outputs 
an integer & such that y(G) < k < r(n)x(G). Analogous terminology applies to any 
minimization problem, e.g., to determine ch(G). 


D48: A doubly-periodic graph is an infinite graph whose vertices are labeled vj;¢ 
(i,j € Z, €€ {1,...,n}); the subgraphs induced by {vij1, vija,.--,Vign} — called cells 
— are isomorphic for all pairs 7, 7, any other edge joins neighboring cells (i.e., cells (7, 7) 
and (i’, j’) where |i—7’| < 1 and |7—j7’| < 1), and both mappings i» i+1 and jw j+1 
are automorphisms of G. 


FACTS 


F124: [Zu07] Unless P = NP, no polynomial-time O(n!~*)-approximation exists for 
x(G), with any ¢ > 0. (Earlier non-approximability results were proved for O(n!/7~*) 
assuming P 4 NP [BeGoSu98] and for O(n!~*) assuming ZPP #4 NP [FeKi98]. The 
analogue of the latter is proved for uniform hypergraphs in [KrSu98].) 


F125: [KhLiSa00] Unless P = NP, no polynomial-time algorithm can possibly color 
the k-colorable graphs with k + 2|k/3| — 1 colors. 


F126: The chromatic number can be approximated in polynomial time within the 
ratio O(n(log log n)?/(log n)*) ({Ha93]), and also within 


max {O(n/ log’! n), O(A log log n/ log n)} 


for any fixed m ([Pa01]}). 
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F127: [KaTh12] There is a combinatorial algorithm that colors any 3-colorable graph in 
polynomial time using O(n4*/!) colors, and applying semi-definite programming meth- 
ods the number of colors can be made O(n®-?°%8). Moreover, there is a randomized 
polynomial-time algorithm that colors any graph with at most 


min 4 OCA: log!/? A log n), O(n? -3/(#+1) log!/? n)} 
colors, where k = x(G) > 3 and A = A(G) ([KaMoSu98}). 


F128: Due to the inequalities clog col(G) < ch(G) < col(G), the choice number 
is constant-approximable on classes of graphs with bounded choice number, which is 
closely related to assuming bounded average degree. 


F129: [DuF 97] The difference n — x(G) is approximable within 360/289. 
F130: [BoGiHak195] The ranking number can be approximated within O(log? n). 


F131: [Ki98] For every k there is an on-line algorithm that properly colors every 
k-colorable graph with at most O(n!~!/*') colors. For k = 3 and k = 4 the bound can 
be improved to O(n2/3 log'/? n) and O(n°/6 log!/© n), respectively. 


F132: ([Bu84] For every integer k > 3, there exists a doubly-periodic planar graph 
G of maximum degree 4 and a properly colored finite subgraph F' C G such that it is 
undecidable whether the coloring of F can be extended to a proper k-coloring of G. 
An analogous result holds for the undecidability of whether a partial homomorphism 
F — H can be extended to a homomorphism G — H, whenever H is a finite non- 
bipartite graph ([DuEmGi98]). For bipartite H, the necessary and sufficient conditions 
of (un)decidability are not known. 
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INTRODUCTION 

Finding maximum cliques and maximum independent sets are among the most appli- 
cable problems in graph theory. We give an overview of both algorithmic and theoretical 
results on these problems. 


5.3.1 Basic Definitions and Applications 


In this section, all graphs are simple, i.e., they do not have self-loops or multi-edges. 


Some Combinatorial Optimization Problems 


DEFINITIONS 


D1: For a graph G, a set S' of vertices is an independent set if no two vertices in S 
are adjacent. 


D2: The number of vertices in a maximum-size independent set of G is called the 
independence number of G and is denoted ind(G). 


D3: A clique in a graph G is a maximal set of mutually adjacent vertices of G. The 
clique number, denoted w(G), is the number of a vertices in a largest clique of G. 


D4: A vertex cover ina graph G is a set S of vertices such that at least one endpoint 
of every edge of G is in S. 


D5: A matching in a graph G is a set of mutually non-adjacent edges of G. 
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REMARKS 


R1: The mazimum-clique problem (determining w(G) for a given graph G) and the 
mazimum-independent-set problem (determining ind(G)) are the main problems 
considered in this section. Observe that w(G) = ind(G) for any graph G, where G 
denotes the edge-complement graph. 


R2: Also considered in this section is the minimum-vertex-cover problem of finding 
a vertex cover of minimum cardinality. 


R3: Sometimes, we will consider graphs with non-negative weights on their vertices. 


EXAMPLE 


El: It is easy to verify for the Peterson graph G shown in Figure 5.3.1 that w(G) = 2, 
ind(G) = 4, and a minimum vertex cover has size 5. 


Figure 5.3.1: The Petersen graph. 


Vertex- Weighted Graphs 


DEFINITIONS 


D6: A graph G is vertex-weighted if every vertex x is assigned a non-negative weight 
w(a); the graph is denoted (G,w). 


D7: The weight of a vertex set S in a vertex-weighted graph is the sum of the 
weights of the vertices in S. 


NOTATION: The weight of a maximum-weight independent set in G is denoted ind(G, w). 
The weight of a maximum-weight clique in G is denoted w(G, w). 


REMARKS 


R4: The maximum-weight-independent-set problem (determining ind(G,w)) 
generalizes the unweighted one: assign weight 1 to every vertex of a graph. A simi- 
lar remark holds for the maxitmum-weight-clique problem. 


R5: The first three of the following facts are immediate consequences of the definitions. 
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FACTS 

F1: For every vertex-weighted graph (G, w), 
ind(G, w) = w(G,w), 

which establishes a duality between the two problems. 


F2: A set S of vertices in a graph G = (V, £) is a vertex cover if and only if V — S is 
an independent set. 


F3: The size of a maximum matching in a graph G is less than or equal to the size of 
a minimum cover of G. 


F4: [Ko31, Eg31] If G is a bipartite graph, then the maximum size of a matching in G 
equals the minimum cardinality of a vertex cover of G. 


Applications Involving Hamming Distance 


There are numerous and varied applications of the combinatorial optimization problems 
introduced above. Perhaps among the most studied are those from coding theory. 


DEFINITIONS 
D8: The Hamming distance between a pair 

U = (t1,---,Un),U = (U1,---,Un) 
of binary vectors is the number of indices i for which u; 4 2,;. 


D9: The vertex set of the Hamming graph H(n, d) consists of all binary vectors with 
n coordinates. A pair u,v of vertices in H(n,d) are adjacent if the Hamming distance 
between them is at least d. 


REMARKS 


R6: The Hamming graph is of interest for error-correcting codes. In particular, a set 
of binary vectors in which any two have Hamming distance at least d forms a binary 
code that can correct |(d—1)/2]| errors [MaS179]. 


R7: A natural question arises: How many vectors with n coordinates can be in a code 
in which any two vectors have Hamming distance d? It follows from the definitions that 
this number equals the clique number, w(H(n, d)). 


R8: For further discussions and results of Hamming distance, see, for example, [BoBu- 
PaPe99] and [OsO01]. Other applications include fault diagnosis [BePe90, HaPaVa93], 
machine learning [HoSk89, HaJa90], and detecting embedded network structures in lin- 
ear programs [GuGuMiMa00, GuGuMiZv04]. 
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5.3.2 Integer Programming Formulations 


The simplest formulation of the maximum-weight-clique problem is based on the 
edges of the input graph. 


Edge-based formulation: Let G = (V,£E) be a vertex-weighted graph with V = 
{v1 ...,Un} and weights w; = w(v;). The maximum weight of a clique can be found by 
solving the following integer program: 


nm 
maxw = So wii 
i=1 
subject to ata; <1 Viu,uj}¢E 
ve; =Oo0rl, @=1y.yn 


FACTS 


F5: In the edge-based formulation, each feasible solution x corresponds to the clique 
S= {vu EV: a4; = 1}. 


F6: [NeTr74, NeTr75] Let x be an optimal (0, 5, 1)-valued solution to the linear relax- 
ation of the edge formulation, and let J = {j : x; = 1}. Then there exists an optimal 


solution z* to the edge-based formulation such that x; = 1 for every j € J. 


REMARKS 


R9: Unfortunately, the result in the last fact appears to be of relatively minor computa- 
tional value since optimal solutions of the linear relaxation of the edge-based formulation 
normally have only a small number of integer components, and the gap between opti- 
mal solutions of the edge-based formulation and its linear relaxation is usually too large. 
[BoBuPaPe99] 


R10: Using Fact Fl we can transform the edge-based formulation of the maximum- 
weight-clique problem to the corresponding formulation of the maximum-weight- 
independent-set problem, which is equivalent to the following nonlinear optimization 
problem first studied in [Sh90]. 


Shor formulation: 


n 
minw = So win; 
w=1 
subject to eee 0 Miu} EE 
a? — a, =0 tS 1ongn 


COMPUTATIONAL NOTE: — Shor [Sh90] reported very good computational results using 
his formulation. 
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Two More Formulations of the Maximum-Clique Problem 


While the formulations above are relatively straightforward, the following ones obtained 
by Motzkin and Straus [MoSt65] are less obvious. 


DEFINITIONS 


D10: The standard simplex A in R” is defined as follows: 


A={xeE RR”: «a, >0, i=1,...,n, y= 1} 
i=1 


NOTATION: For a graph G with adjacency matrix Ag and « € R”, let g(x) = x? Agz. 


D111: Let G = (V,E£) be a graph with vertices v1,...,Un and let S C V be arbitrary. 


The characteristic vector 7° € R” is defined as follows: «5 = (x?,a3,..., 2°), where 


b] n 
oo = I if vu; € S and x? = 0, otherwise. 


Motzkin—Straus Theorem [MoSt65]: Let G be a graph and let 

x* =argmazr{g(x): « € A} 
(i.e., a* € A and g(a*) > g(x) Va € A). Then w(G) = 1/(1—g(a*)). Moreover, a subset 
S of vertices of G is a maximum clique if and only if x9 = argmar{g(x): x € A}. 


REMARK 


R11: One drawback of the Motzkin-Straus formulation is the fact that some solutions 
of argmax{g(a) : x € A} are not characteristic vectors [PaPh90, PeJa95]. Thus, the 
following variation of the Motzkin-Straus formulation due to Bomze [Bo97] is of interest. 


NOTATION: For a graph G with adjacency matrix Ag, let f(x) = x7 Agx + (a? x)/2. 


Bomze Theorem [Bo97]: Let S be a subset of vertices of a graph G. Then 
(a) S is a maximum clique in G if and only if 2° = argmar{ f(x): x € A}. 
(b) S is a clique in G if and only if x° is a local maximizer of { f(x): ax € A}. 
(c) All local maximizers x of { f(x): a € A} are characteristic vectors. 


5.3.3 Hardness Results 


The maximum-clique problem is one of the first shown to be NP-hard [Ka72]. Since 
then many researchers have tried to gain a more precise understanding of the difficulty 
of the problem (see, e.g., [AuCrGaKaMaPr99, BoBuPaPe99}). 


NOTATION: If z is an algorithm for the maximum-independent-set problem, then 7(G) 
denotes the independent set produced by a when the input graph is G. 


FACTS 


F7: {Ha99] Unless P = NP, there can be no polynomial-time algorithm that approxi- 
mates the maximum clique to within a factor better than O(n!~‘), for any € > 0. 
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The following positive result has a nice (and short) proof based on some ideas of 
Paul Erdés (see [Ha98}). 


F8: [BoHa92] There is a polynomial-time algorithm 7 such that for any n-vertex graph 
G, ind(G)/|n(G)| = O(n/ log? n). 


This result was improved by Fiege [Fi04]. 


F9: [Fi04] There is a polynomial-time algorithm 7 such that for any n-vertex graph 
G, ind(G)/|x(G)| = O(n(log log n)?/ log? n). 


REMARK 


R12: Since the approximation in Fact F9 is still weak, it is natural to consider the 
approximation for graphs for which certain parameters are restricted. One such param- 
eter is the maximum degree, A(G), of a graph G. For an overview of approximation 
algorithms whose performance is measured in terms of A(G), see [LaTi01]. 


FACTS 


F10: [AlFeWiZu95] Unless P = NP, there exists a constant « > 0 such that there is no 
polynomial-time algorithm m for which ind(G)/|a(G)| = O(A(G)*) for every graph G. 


The following result is due to Vishvanathan (see [Ha98]). 


F11: There is a polynomial-time algorithm 7 such that for every graph G, 


ind(G@)/|m(G)| = O(A(G) log log A(G)/log A(G)) 


REMARKS 


R13: There is a simple polynomial-time algorithm for the minimum-vertex-cover prob- 
lem that provides a 2-approximation (i.e., no worse than twice the optimum): find a 
maximum matching M in a given graph G and output the vertices of M as a vertex 
cover of G. For slightly better approximation results, see [AuCrGakKaMaPr99]. 


R14: The last remark and Fact F7 are, in a way, at odds with each other. The 
maximum-clique and the minimum-vertex-cover problems are dual, in a sense (via the 
maximum-independent-set problem as noted earlier). Nevertheless, while the former 
cannot be approximated to any good degree, the latter can be. This ‘strange’ situation 
is somewhat resolved by the next fact. Observe that a feasible solution of the maximum- 
clique problem is a set of vertices that induces a complete subgraph. 


FACT 


F12: [GuVaYe03] Let w be any polynomial-time algorithm for the maximum-clique 
problem and let p(n) be any polynomial function of n. Unless P = NP, there exists an 
n-vertex graph G such that w(G) has fewer vertices than at least 7 ae of the complete 


subgraphs of G. The analogous fact holds for the minimum-vertex-cover problem. 
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DEFINITIONS 


D12: [ImPa99] The exponential time hypothesis states that there is an € > 0 such 
that 3-SAT cannot be solved in time O(2°), where n is the number of variables. 


D13: The k-clique problem is the problem of deciding whether a graph has a clique 
with at least k vertices. 


D14: Parameterized complexity studies problems equipped with a small integer 
parameter k; such problems are called parameterized problems. A parameterized 
problem is said to be fixed-parameter tractable if there is an algorithm for solving 
it on inputs of size n in time f(k)nOC“, where f is an arbitrary function in k only. For 
more information on parameterized algorithms and complexity, see [DoFe99, F1Gr06]. 


FACTS 
F13: It is easy to solve the k-clique problem in time O(k?n*). 


F14: [ChHuKaXi06] showed that the k-clique problem cannot be solved in time n°(*) 
unless the exponential time hypothesis fails. 


REMARK 


R15: Downey and Fellows [DoFe95] defined a hierarchy of parameterized problems, the 
W hierarchy, that they conjectured are not fixed-parameter tractable; they proved that 
the k-clique problem is hard for the first level of this hierarchy, W[1]. Thus, according 
to their conjecture, the k-clique problem is not fixed-parameter tractable. Moreover, 
this result provides the basis for proofs of W/1]-hardness of many other problems, and 
thus serves as an analogue of the Cook—Levin theorem for parameterized complexity. 


5.3.4 Bounds on Independence and Clique Numbers 


Every maximum-clique or independent-set heuristic provides an ‘algorithmic’ lower 
bound to the corresponding problem. In this subsection, we consider ‘analytical’ ones 
that require only certain parameters of the input graph. 


Lower Bounds 


Caro and Wei obtained the lower bound given in Fact F15, and Alon and Spencer gave 
an elegant probabilistic proof of this bound [AlSp92]. Sakai, Togasaki, and Yamazaki 
generalized Fact F15 to vertex-weighted graphs (Fact F16). 


NOTATION: For a vertex v in a graph G, N(v) is the set of vertices adjacent to v and 
N{v] = N(v) U {v}. 


FACTS 


F15: [Ca79,We81] Let G = (V, E) be a graph. Then ind(G) > So cy 1/(deg(v) +1). 
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F16: [SaToYa03] Every vertex-weighted graph G = (V, £) contains an independence 
set S of weight at least 


max{ S> w(e)/(deg(«) +1), S> w(e)?/[ > w(y)]}. 


cEeV cEeV yEN[z] 
Moreover, a set S with the above property can be found in polynomial time. 
Selkow [Se94] improved the Caro-Wei bound using additional information. 


F17: [Se94] Let G = (V, E) be a graph. Then 


1 
ind(G (1 OF aA 
“ ae yr! ee +17 2a Tealuy Fi )+1 b): 


The next few results involve the eigenvalues of the adjacency matrix Ag of G. 
DEFINITION 


D15: The Perron root p(G) is the largest eigenvalue of Ac. 


FACTS 


F18: The adjacency matrix of a connected graph G is irreducible, symmetric, and has 
all non-negative entries; hence, all its eigenvalues are real (see, e.g., [HoJo85]). 


F19: [Wi86] For a connected graph G on n vertices, w(G) > 
was improved by Budinich [Bu03].) 


nave)’ (This bound 


Upper bounds 


FACTS 


F20: [Wi67] For a connected graph G, w(G) < Ap(G) + 1. Equality holds if and only 
if G is complete. 


NOTATION: For a connected graph G, let A_1(G) denote the number of eigenvalues of 
Ag that do not exceed —1. 


F21: [AmHa72] For a connected graph G, w(G) < A_1(G) +1. Equality holds if and 
only if G is complete multipartite. 


F22: [Bu03] For a connected graph G on n vertices, w(G) <n — SrankAg. 


F23: Each of the three lower bounds given above can be computed in time O(n°), and 
one can find examples that show the bounds are sharp [Bu03]. 


COMPUTATIONAL NOTE: — Budinich [Bu03] tested the lower bounds on a set of 700 
random graphs of order 100 and 200. For these graphs, the smaller of the first two 
upper bounds was almost always better than the third upper bound. 
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5.3.5 Exact Algorithms 
Clique Enumeration 


Harary and Ross [HaRo57] initiated an algorithmic and theoretical study of the enu- 
meration of all cliques in a graph. This topic has a variety of applications (see, e.g., 
[Bo64, HaRo57, PaUn59]). The first significant theoretical result (see Fact F24) is due 
to Moon and Moser [MoMo65]. 


FACTS 


F24: [MoMo65] The maximum number of cliques in an n-vertex graph equals 
3”/3 if n = 0 (mod 3) 
4.3(-4)/3 if n = 1 (mod 3) 
2-3("-2)/3 if n = 2 (mod 3) 


(Some extensions of this result are discussed in [SaVa02].) 


F25: [ToTaTa06] There is an algorithm for listing all cliques of an n-vertex graph in 
time O(3"/°). This is a modification of the backtracking algorithm of Bron and Kerbosch 
[BrKe73]. 


DEFINITIONS 


D16: We say that an algorithm A for generation of all cliques has time delay T if T 
bounds from above the time spent by A between two consecutive cliques and before the 
first clique is generated. 


D17: Given a graph G, the arboricity is the minimum number of edge-disjoint acyclic 
subgraphs whose union is G. 


FACTS 


F26: [TsIdAvSh77] There is an algorithm for listing all cliques of a graph G = (V, E) 
with time delay O(|V]||E]). 


F27: [ChNi85] There is an algorithm for listing all cliques of a graph G = (V, F) with 
time delay O(a|E|), where a is the arboricity of G. 


F28: [MaUn04] There is an algorithm for listing all cliques in an n-vertex graph with 
time delay O(M(n)), where M(n) is the time required to multiply two n x n matrices. 


F29: [MaUn04] There is an algorithm for listing all cliques in a graph G = (V, E) with 
time delay O(A*), where A is the maximum degree of G; the algorithm also requires 
O(|V |||) time before the first clique is generated. 
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Maximum-Clique and Maximum-Weight-Clique Algorithms 


Clearly, the algorithms mentioned previously can be used (directly or after simple mod- 
ifications) to find a maximum clique in a graph. However, the maximum-clique problem 
has attracted more attention than the clique-enumeration problem, and the use of cer- 
tain procedures has made algorithms for the maximum-clique problem quite fast. 


FACTS 


In the following three facts, n denotes the number of vertices in the input graph. 


F30: ([TaTr77] There is an algorithm for solving the maximum-clique problem with 
time complexity O(1.2599”). 


F31: [Ro86] There is an algorithm for solving the maximum-clique problem with time 
complexity O(1.2108”"). 


F32: [Ro01] There is an algorithm for solving the maximum-clique problem with time 
complexity O(1.1889”). This appears to be the fastest (with respect to the worst-case 
complexity) currently known algorithm for the problem. 


COMPUTATIONAL NOTE: Branch-and-cut algorithms were used with great success 
for several combinatorial optimization problems; see, e.g., Section 4.6. However, for 
the maximum-clique problem, branch-and-cut algorithms currently remain, in general, 
inferior to the state-of-the-art branch-and-bound algorithms [RoSm01]. 


COMPUTATIONAL NOTE: There are several quite efficient branch-and-bound algo- 
rithms that use fast coloring heuristics to produce upper and lower bounds. See, e.g., 
algorithms in [Os02, ToKa07]. 


R16: Branch-and-bound algorithms for the maximum-weight-clique problem are dis- 
cussed in [BoBuPaPe99, Os01]. 


R17: Some algorithms have been compared on random graphs and on special families of 
graphs inspired by applications. For the maximum-clique problem, frequently occurring 
special families of graphs are collected in [JoTr96] and explained in [HaPaVa93]. [Os01] 
introduces a special family of instances for the maximum-weight-clique problem. 


COMPUTATIONAL NOTE: There are a few maximum-clique and maximum-weight- 
clique computer codes freely available for research purposes; see, e.g., [Di,NiOs03]. 


5.3.6 Heuristics 


When the instance of the maximum-weight-clique problem under consideration is 
of large size or the data is not precise (which is the case in many applications) or a 
solution has to be obtained very quickly, one should resort to heuristics rather than 
exact algorithms. Moreover, heuristics form important parts of many exact algorithms. 
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Construction Heuristics and Local Search 


D18: A construction heuristic produces a feasible solution without any attempt to 
improve it. 


REMARKS 


R18: Construction heuristics are normally very fast and provide quick solutions, and 
lower or upper bounds. However, their solutions cannot be expected to be of high 
quality. 


R19: The simplest construction heuristic for the maximum-clique problem is to add 
one vertex at a time to an emerging clique. It is logical to choose in each iteration 
an eligible vertex of maximum degree [KoRu87]. Alternatively, one may delete vertices 
from the given graph one by one until a clique is obtained [KoRu87]. 


R20: Most approximation algorithms for various optimization problems are, in fact, 
construction heuristics. Examples of construction heuristics used in maximum-clique 
approximation algorithms may be found in [BoHa92] and [Fi04]. 


DEFINITIONS 


D19: A local search (LS) heuristic starts from a feasible solution and in each iteration 
until termination chooses the next solution from a neighborhood of solutions that are in 
some prescribed sense close to the current solution. 


D20: An improvement LS (or simply local improvement) is a local search that 
always chooses a solution that is better than the current one and terminates when it 
cannot find one in the neighborhood. 


COMPUTATIONAL NOTE: — Local improvement does not appear to perform particularly 
well for the maximum-clique problem [GuGuMiZv04] as it may terminate at a relatively 
small clique. Local search algorithms that do not require monotonic improvement are 
much more flexible in that they can escape from local optima that are non-maximum 
cliques; perhaps the most flexible among them is briefly discussed below. 


Tabu Search 


This metaheuristic appears to provide a good trade-off between computational time and 
solution quality. 


DEFINITION 


D21: Tabu search is a local search in which solutions that are worse than the current 
one can be chosen provided that they are not in any of the so-called tabu lists. 


REMARKS 


R21: Tabu search was introduced independently by Glover [G189, G190] and Hansen 
and Jaumard [HaJa90a]. ‘Pure’ tabu search techniques for the maximum-clique problem 
were implemented in a number of papers; see, e.g., [FrHeWe89, SoGe96]. 
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COMPUTATIONAL NOTE: Tabu-search algorithms use parameters that have to be 
fine-tuned in order to achieve good results. This slows down the development and use 
of tabu-search computer codes. Battiti and Protasi [BaPr01] deal with this issue by 
adjusting the parameters using an internal learning loop. 


R22: For brief descriptions and discussions of other metaheuristics applied to the 
maximum-clique problem, see [BoBuPaPe99]. 
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INTRODUCTION 

The vast body of work on factors and factorizations has much in common with 
other areas of graph theory. Indeed, factorization significantly overlaps the topic of 
edge-coloring (cf. 85.1), since any color class of a proper edge-coloring in a graph is just 
a matching. Moreover, the hamiltonian cycle problem (cf. §4.5) can be viewed as the 
search for a connected 2-factor. Due to space constraints, we will treat factors of finite 
undirected graphs only. Nevertheless, several papers dealing with infinite graph factors 
and directed graph factors are included in our list of references. 


5.4.1 Preliminaries 


DEFINITIONS 


D1: Given a graph (multigraph, general graph) G, we say that H is a factor of G if 
H is a spanning subgraph of G. 


D2: A factor that is n-regular is called an n-factor. 


D3: A factor defined only in terms of the degrees of its vertices is called a degree 
factor. 


D4: A factor described in terms of graph-theoretic properties other than its vertex 
degrees is called a component factor. 


D5: Ifa graph G can be represented as the edge-disjoint union of factors FP), Fo,... Fx, 
we shall refer to {F,, Fo,... Fy} as a factorization of graph G. 
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FACTS 


Today most workers in the field attribute the birth of graph factorization to two theo- 
rems of the Danish mathematician Julius Petersen. The analogous result of Babler for 
regular graphs of odd degree did not appear until almost fifty years later. 


F1: [Pe1891] A 3-regular multigraph with at most one cutedge contains a 1-factor (and 
hence also a 2-factor). 


F2: [Pel891] Every 2k-regular multigraph contains a 2-factor (and, hence, it has a 
factorization into 2-factors). 


F3: [Ba38] Every 2-edge-connected (2k + 1)-regular multigraph contains a 2-factor. 


REMARKS 


R1: The names degree factors and component factors for the two main categories of 
factors treated in the literature seem to be due to Akiyama and Kano [AkKan85al. 


R2: These two main problem categories overlap. For example, finding a 1-factor and 
finding a factor each component of which is an edge amounts to the same thing. 


R3: A thorough survey tracing the descendants of Petersen’s factorization results for 
regular graphs may be found in [Vo95]. 


5.4.2 1-Factors 


The most studied of degree factors are those in which each component is a single 
edge. We observe that §11.3 applies matchings to assignments. In our complementary 
approach here, we are principally interested in those properties of 1-factors that most 
naturally extend to analogous properties of more general factors. 


Conditions for a Graph to Have a 1-Factor 


DEFINITIONS 


D6: A 1-factor (or perfect matching) of graph G is a set of vertex-disjoint edges 
in G which together span V(G). 


D7: The bipartite graph K1,3 is often called a claw. A graph containing no Ky 3 as 
an induced subgraph is said to be claw-free. 


D8: Graph G is said to have the odd-cycle property if every pair of odd cycles in G 
either have a vertex in common or are joined by an edge. 


D9: The toughness of graph G, denoted by tough (G), is defined to be +oo when G 
is complete and otherwise to be 


min{|S|/c(G — S)|S C V(G)} 


where the minimum is taken over all subsets S C V(G) and c(G—S') denotes the number 
of components of G — S. 
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D10: The binding number of a graph G, denoted bind (G), is defined to be 
min {|N(X)|/|X| | 04 X CV(G) and N(X) 4 V(G)} 


D11: We call a sequence of non-negative integers d;,...,d, graphical if there exists 
a graph G of order n the vertices of which have, in some order, degrees d1,..., dn. 


D12: If |V(G)| > 2m +2 and G has a perfect matching, then G is m-extendable if 
every matching of size m extends to (i.e., is a subset of) a perfect matching. 


D13: Let T be a set of vertices in graph G. 
e A T-path is a path beginning and ending in T. 
e A T-path covering is a union of vertex-disjoint T-paths covering T’. 


FACTS 


Arguably, the most influential theorem in the study of 1-factors has been the seminal 
result called Tutte’s 1-factor Theorem. 


F4: [Tu47] Tutte’s 1-Factor Theorem: A graph G has a 1-factor if and only if for 
each S' C V(G), co(G — S) < |S|, where c,(G — S) denotes the number of components 
of G — § which have an odd number of vertices. 


F5: [Pel891] Petersen’s Theorem: Every 2-edge-connected 3-regular multigraph 
has a 1-factor. 


F6: [Ple79] If G is an r-regular, (r — 1)-edge-connected multigraph of even order, then 
G has a 1-factor not containing any of r — 1 prescribed edges. (And hence G has a 
1-factor containing any prescribed edge.) 


F7: [Kai08] If r > 2 and G is an r-regular, r-edge-connected graph with T a subset of 
V(G), then every edge of G is contained in a T-path covering. 


F8: [Su74, Su76], [La75] If G is a connected claw-free graph of even order, then G has 
a 1-factor. 


F9: [Su76] If G is an n-connected graph of even order, and if G has no induced subgraph 
isomorphic to the bipartite graph Ky,»+41, then G has a 1-factor. 


F10: [FuHoMc65] If G is r-regular of even order and has the odd-cycle property, then 
G has a 1-factor. 


F11: [Ni78, Ni79] If G is a k-connected graph (k > 4) of even order and if 7(G) < 
k(k—2)/4, then G has a 1-factor. (As in Chapter 7, y(G) denotes the (orientable) genus 
of G.) 


F12: If G is of even order and tough(G) > 1, then G has a 1-factor. This follows 
immediately from Tutte’s 1-factor Theorem. 


F13: — [Ku73], [Lo74] There exists a graph G having a 1-factor and degree sequence 
d,,d2,...,dy if and only if both the sequences d),...,d, and d,; —1,...,d, — 1 are 
graphical. 
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F14: (essentially due to Anderson [An73]) Let G be a graph of even order. If, for all 
X CV(G), 
3 2 
IN(X)] > ming 1V(@)), tI] ~ 3b 


then G has a 1-factor. This theorem can be regarded as a binding number result. 


F15:  [LiGrHo75] If G is a connected graph of even order the automorphism group 
of which acts transitively on V(G), then G has a 1-factor containing any given edge. 
Highly symmetric graphs of even order are guaranteed to have 1-factors by this result. 


F16: [AIP111] If m > 0 and G is a 5-connected even planar triangulation with |V(G)| > 
2m +2, and M is a matching in G with |M| < m such that the distance between any 
two of its edges is at least 5, then G has a perfect matching that contains M. 


REMARKS 


R4: A number of sufficient conditions quite similar to that of Anderson above are 
collected and compared in [Wo90]. A similar condition sufficient for a bipartite graph 
to have a k-factor (respectively, [a, b|-factor (see below)) may be found in [EnOtKan88] 
(respectively, [Kan90a], [Che93]). 


R5: There are now many papers investigating the existence of 1-factors containing or 
excluding specified edge sets. However, space does not permit us to treat these results 
and for the case of 1-factors, we direct the interested reader to two survey articles [P194, 
P196] and the recent book [YuLi09]. 


The Number of 1-Factors: Bounds 


DEFINITIONS 


D14: A graph G is said to be bicritical if G — x — y has a 1-factor for every choice of 
two different vertices x and y. (For further reading on bicritical graphs, see [LoP186].) 


D15: <A hexagonal system is a 2-connected plane graph in which every face is a 
hexagon. 


D16: A 3-connected cubic plane graph with 12 pentagonal faces and all remaining 
faces hexagons is called a fullerene. 


NOTATION: ®(G) denotes the number of 1-factors in graph G. 


FACTS 

F17: Let G be connected and have a unique 1-factor. Then: 
(a) [Ko59] G has a cutedge belonging to the 1-factor; 
(b) [LoP186] G contains a vertex of degree < |log (p+ 1)|; and 
(c) [Hetyei (unpublished)] |E(G)| < (|V(G)|/2)?. 
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F18: If G is k-connected and has a 1-factor, then either 
(a) G has at least k! 1-factors, or else 


(b) G is bicritical. It seems somewhat counterintuitive that bicritical graphs should 
be the exception here. It has proven much more difficult to bound 6(G) in the 
bicritical case. 


F19: If G is bicritical, then ®(G) > |V(G)|/2+ 1. Study of the perfect matching 
polytope of G, PM(G), (see [LoP186]) can be utilized to give the bound in this result. 


F20: Ifthe graph G is k-connected and contains a 1-factor, and if |V(G)| is sufficiently 
large, then G has at least k! 1-factors. 


F21: The number of perfect matchings in a cubic bridgeless graph is exponential. More 
specifically: 

[EsKaKi KrNol11] If G is cubic and bridgeless, then ®(G) > 2!V(@)1/3656, 

(Hence, in particular, this bound holds for fullerenes.) 


F22: [AlFr08] If G is a simple graph of even order and degree sequence dj, d2,...,dn, 
then ®(G) < [[P_, (di!) 72%. 


REMARKS 


R6: Gabow, Kaplan and Tarjan [GaKaTa99, GaKaTa01] developed an O(|E|log* |V|) 
algorithm to test whether a graph has a unique 1-factor and find it, if it exists. 


R7: One can bound ®(G) below by the matrix function called a Pfaffian. (For details, 
see [LoP186; §8.3].) In the case when G is planar, the Pfaffian can be used to exactly 
compute ®(G) in polynomial time. 


R8: The connectivity of the graph G can also be employed to yield a lower bound on 
®(G) in some cases. 


R9: Computing ®(G) when G is a hexagonal system or a fullerene is of interest to 
chemists. For a popular introduction to the former, see [Ri01]. Hexagonal systems are 
used to model benzenoid hydrocarbons. As for fullerenes, the literature is vast, but a 
brief background on the ®(G) problem can be found in [Dos02], [KaKrMiSe09]. Some 
other mathematical questions about fullerenes may be found in [Ma00]. 


R10: A long-standing open conjecture about fullerenes states that they all contain a 
hamiltonian cycle. Currently, the best lower bound for a longest cycle in a fullerene of 
order n is 6n/7 + 2/7. In [ErKaMi09] this bound is obtained by studying the 2-factors 
of the fullerene. 
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1-Factors in Bipartite Graphs 


In the special case of bipartite graphs, the story of 1-factors has two principal historical 
roots, one in a result due to P. Hall [Ha35], and the other in a result due to Kénig 
[Ko31, Ko33]. 


DEFINITIONS 


D17: A vertex cover of a graph G is a subset C C V(G) such that every edge of G 
has at least one endvertex in C. 


D18: The vertex-covering number of a graph G is the size of any smallest vertex 
cover in G. Notation: 7(G). 


D19: The matching number of a graph G is the size of any largest matching in G. 
Notation: v(G). 


D20: The permanent of an n x n matrix A, denoted per A, is given by 


per A= S arn(1)42n(2) “++ Ong(n) 


where the sum extends over all permutations 7 of the set {1,...,n}. 


FACTS 


F23: [Ha35] Hall’s Theorem: Let G be a bipartite graph with vertex bipartition 
V(G) = AUB. Then G has a matching of A into B if and only if |N(X)| > |X], for all 
X CA. 


F24: [Fr1912] Marriage Theorem: Let G be a bipartite graph with vertex bipartition 
V(G) = AUB. Then G has a 1-factor matching A onto B if and only if 


(a) [A] = |B] and 
(b) |N(X)| > |X], for all X C A. 
This earlier result of Frobenius is an immediate consequence of Hall’s Theorem. 


F25: It is clear that in any graph G, the matching number and the vertex-covering 
number are related by the inequality v(G) < 7(G). 


F26: [Ko31, Ko33] K6nig’s Theorem: If G is bipartite, then v(G) = 7(G). 


F27: [Ha48] Let G be a simple bipartite graph with bipartition V(G) = AU B, and 
assume that each vertex in A has degree at least k. If G has at least one 1-factor, then 
it has at least k! 1-factors. 


F28: Let G bea simple k-regular bipartite graph on 2n vertices. Then 


k n 
nl (=) < 8G) < (kN! 
n 
The first inequality is equivalent to the famous van der Waerden Conjecture [Wa26] on 
permanents, which was proved independently by [Fa81] and [Eg80, Eg81]. The second 
inequality was proved by [Br73]. 
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F29: [Sc98b] If G is a k-regular bipartite graph of order 2n, then 


aa)> (Hy 


REMARKS 


R11: In fact, it can be shown that Hall’s Theorem and Konig’s Theorem are equivalent. 


R12: Since Konig’s Theorem asserts the equality of the maximum of one quantity 
and the minimum of another, it is often referred to as a minimaz theorem, especially in 
the study of linear programming. For an introduction to such ideas within the confines 
of graph theory, and for the associated polytopal ideas, see [LoP186; Ch. 7 and 12]. 


The Number of 1-Factors: Exact Counting 


DEFINITIONS 


D21: A graph G is a threshold graph if there is a real-valued assignment of weights 
to the vertices such that e = wv if and only if w(u) + w(v) > 0. 


D22: A graph G is a split graph if V(G) = AU B where A spans a complete graph 
and B spans an independent set. 


D23: A graph G is Pfaffian if the edges can be oriented so that every alternating 
cycle C (i.e., an even cycle C such that G—V(C) contains a 1-factor) has an odd number 
of edges oriented clockwise. 


D24: If G is a graph drawn in the plane and M is a 1-factor in G, then the crossing 
number of M, denoted by cr(M), is the number of pair-wise crossings of edges of M. 


REMARK 


R13: As mentioned previously, ®(G) can be computed in polynomial time if G is 
planar. 


FACTS 


F30: [OkUeUn10] ®(G) can be computed in polynomial time if the graph is a threshold 
graph. 


F31: [Kas61, Kas63, Kas67, Fi61, TeFi61] If G is Pfaffian, then ®(G) can be computed 
in polynomial time. 


F32: [McRoSeTh97, RoSeTh99, Mc04, Thomas06] There is a polynomial algorithm to 
decide if a bipartite graph is Pfaffian or not. 


F33: [No08] A graph G is Pfaffian if and only if there exists a drawing of G in the 
plane such that cr(M) is even for every 1-factor M of G. 
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F34: [ArLaSe91] 6(G) can be computed in polynomial time if G has bounded tree- 
width (cf. Chapt. 2). 


F35: [Va79a] ®(G) is #P-complete for bipartite graphs. 
F36: [DaLu92] ®(G) remains #P-complete even if the bipartite graphs are 3-regular. 


F37: [OkUeUn10] Determining ®(G) is #P-complete for chordal graphs, split graphs 
and even chordal bipartite graphs. 


F38: [OkUeUn10] The complexity of computing 6(G) is presently unsettled for the 
classes of bipartite permutation graphs, proper interval graphs, interval graphs and 
bipartite interval graphs. 


REMARKS 


R14: The problems of counting (and bounding) the number of 1-factors have important 
applications to the so-called dimer problem of crystal physics. Here the underlying 
(bipartite) graphs are d-dimensional rectangular lattices; the edges in the lattices are 
called dimers. (A perfect matching in such a lattice is called a dimer tiling.) We 
refer the reader to three references which give succinct overviews of the dimer problem: 
[KeRaSi96, BeSu99, Sc03]. 


R15: Recently, possible connections between the dimer problem and its relation to 
string theory and the theory of black holes have come under investigation (cf. [HeVa07].) 


5.4.3 Degree Factors 


REMARK 
R16: The names degree factors and component factors for the two main categories of 
factors treated in the literature seem to be due to Akiyama and Kano [AkKan85]. Note 


that they sometimes overlap, however. For example, a 2-factor is just a factor in which 
each component is a cycle. 


k-factors 


DEFINITIONS 
D25: A k-factor of a graph G is a k-regular subgraph that spans G. 


D26: A graph G is hypohamiltonian (respectively, hypotraceable) if G does not 
have a hamiltonian cycle (respectively, path), but G — v does, for all v € V(G). 


D27: For any graph G, let o2(G) = min{de(z) + da(y)|a,y € V(G),x # y, xy ¢ 
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D28: The face-width of a graph G embedded in a surface is the minimum number of 
times any non-contractible curve in the surface intersects G. (Also called representa- 
tivity; see Section 7.6.) 


NOTATION: The face-width of an embedded graph G is denoted fw(G). 


D29: The Euler genus of a surface © is twice the ordinary genus (or handle number) 
if the surface is orientable and equal to the ordinary genus (or cross-cap number) if the 
surface is non-orientable. 


FACTS 


F39: [EnJaKatSa85] If tough (G) > k, then G has a k-factor. (This was conjectured 
by Chvatal [Ch73].) 


F40: [Nis89] Let G be a graph and k an even non-negative integer. If 
K(G) > max{k(k + 2)/2, (k + 2)a(G)/4} 


then G has a k-factor. (a(G) and «(G) are the independence number and the connec- 
tivity, respectively.) 


NOTATION: We denote the minimum degree in a graph G by 6(G), and the minimum 
degree sum, deg(u) + deg(v), over all pairs of non-adjacent vertices u and v, by 02(G). 


F41: [IiNi91] Let k be a positive integer and let G be a graph of order n > 4k —5 with 
d(G) > k and kn even. If o2(G) > n, then G has a k-factor. (The condition on o2(G) 
is called an Ore condition after Ore who first introduced a condition of this type and 
showed it sufficient for the existence of a hamiltonian cycle.) 


F42: [KaYo02] Let G be a graph with |V(G)| > 4k +1 and let e be an edge of G. 
If o2(G) > |V(G)| + 1, then G contains a 2-factor F' such that F’ contains edge e and 
consists of exactly k cycles. 


F43: [iNi97] Let & > 2 be an integer and let G be a connected graph of order 
n with 6(G) > k. Suppose that kn is even and n > 9k — 1 — 4\/2(k—1)? +2. If 
|Ne(u) U Ne(v)| > (1/2)(n + k — 2) for each pair of non-adjacent vertices u and v, 
then G has a k-factor. (The sufficiency condition here is called a neighborhood union 
condition.) 


F44: [Nis92] Let G be a connected graph of order n and let k be an integer > 3 such 
that kn is even, n > 4k — 3 and 6(G) > k. Then if max{d(u),d(v)} > n/2, for all pairs 
of non-adjacent vertices u and v, G has a k-factor. 


F45:  [FaFaF1LiLi99] If a graph G is claw-free of order n with 6(G) > 4, then G has 
a 2-factor with at most [6n/(d(G) + 2)] — 1 components. Moreover, there is an O(n?) 
algorithm to construct such a 2-factor. 


F46: [Ku73] If k is a positive integer and the sequences d1,...,d,, and dj —k,...,dn—k 
are both graphical, then d,,...,d, can be realized by a graph G which contains a k- 
factor. 
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F47: [Kat83] If G is a graph and k is an even non-negative integer, then if G— v has 
a k-factor for all v € V(G), G also has a k-factor. 


F48: [Sa91] Suppose G is a graph with a 1-factor F and order at least four and let k 
be a positive integer. Then if G — {u,v} has a k-factor for each edge uv € F, G itself 
has a k-factor. 


F49: [Kat83] If G is either hypohamiltonian or hypotraceable, then G has a 2-factor. 


F50: [Nis90] Let G be a graph and m a non-negative integer. 
(i) If m > 2 and even, and 6(G) > m+1, then L(G) has a 2m-factor. 
(ii) If G is connected, |E(G)| even, and 6(G) > m+2, then L(G) has a (2m+1)-factor. 


F51: [Kat90] Let G be a bipartite graph with bipartition V(G) = X UY and k bea 
positive integer. Suppose that the following three conditions hold: 


Gy. |x| = is 
(ii) 6(G) = [|X|/2] 2 &; 
(iii) |X| > 4k — 4V& +1 when |X| is odd, and |X| > 4k — 2 when |X| is even. 
Then G has a k-factor. 


F52: [Nis91] If k > 2 is an integer and G is a connected graph with k|E(G)| even and 
if 6(L(G)) > (9k + 12)/8, then L(G) has a k-factor. 


F53: [Kan93] Let k be a positive integer and let G be a connected graph of order n 
with 6(G) > k, where kn is even and n > 4k — 3. If o2(G) > n, then G has both a 
hamiltonian cycle C and a k-factor F’, and hence, G has a connected [k, k + 2]-factor. 


F54: [AlEgFuOtSal11] 


(a) Let k and n be integers with k > 2 and n > 3, and let G be a k-edge-connected 
Ky n-free graph. If (k,n) 4 (2,3) and 6(G) > n-24 (n—-1)/(k -— 1), then G 
contains a 2-factor. 


(b) Let n > 3 be an integer. Every 2-connected Ky,,-free graph G with 6(G) > n 
contains a 2-factor. 


(c) Let n > 4 be an integer. Every (n — 1)-connected Ay ,,-free graph has a 2-factor. 


(d) A corollary to (a) says that if n > 3, every n-edge-connected K ,,-free graph has 
a 2-factor. 


F55: [KaOz11] (a) If G is a 4-connected graph embedded in a surface of Euler genus g, 
d(G) > 5, and fw(G) > 4g — 12, then G has a 2-factor. (b) If G is a 5-connected graph 
of even order embedded on a surface of Euler genus g and fw(G) > max{44g — 117, 5}, 
then G has a 3-factor. 
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REMARKS 


R17: [KatWo87, EgEn89] proved theorems similar to Fact F27, concerning the binding 
number. 


R18: [K]Wa73] gives an alternative proof of Fact F32 and also a polynomial algorithm 
for constructing the graph G containing the k-factor. 


R19: Suppose that G is r-regular and has edge-connectivity X. All values of k for which 
a multigraph G is guaranteed to have a k-factor are known [BoSaWo85]. Similarly, all 
such k are known when G is simple [NiesRa98]. 


R20: Hendry [He84] initiated the study of graphs with unique k-factors and his con- 
jecture on the maximum number of edges that such a graph may have was proved by 
Johann [Joh00a, Joh00b}. 


R21: There are many results on the existence of 2-factors having special properties 
such as (a) having a given number of cycles [BrChFaGoLe97]; (b) having cycles of 
specified lengths [Go01]; and (c) having k cycles containing k specified vertices [EgEn- 
FaLiSc03]. In [FaFaRy08] there is a complete list of single forbidden subgraphs and pairs 
of forbidden subgraphs which guarantee that a 2-connected graph contains a 2-factor. 


f-factors 


Whereas a k-factor is a subgraph with the same degree at every vertex, an f-factor may 
have a prescription of different degrees. 


DEFINITIONS 


D30: Let G be a multigraph possibly with loops and f a non-negative, integer-valued 
function on V(G). Then a spanning subgraph H of G is called an f-factor of G if 
deg y7(v) = f(v), for all v € V(G). 


D31: A set S C V(G) such that co(G— S$) > |S| is called a 1-barrier or antifactor 
set. (Recall that by Tutte’s 1-factor Theorem, a graph G with no 1-factor has a 1- 
barrier.) 


NOTATION: Let eg(A, B) denote the number of edges in graph G joining vertex sets A 
and B. 


FACTS 


F56: [Tu54] Tutte’s f-Factor Theorem: The graph G has an f-factor if and only 
if the following two conditions hold for all disjoint sets D,S C V(G): 


(i) f(D) — f(S) +de_p(S) —¢a(D, S, f) > 0, where qa(D, S, f) denotes the number 
of components C' of G— (DUS) such that eg(V(C), S) + f(V(C)) = 1( mod 2); 


(ii) f(D) — f(S) + de—v(S) — da(D, S, f) = F(V(G))( mod 2). 


F57: [Tu81] A graph G has an f-factor if and only if it does not have an f-barrier. 
(An f-barrier is a generalization of a 1-barrier. We omit the details here.) 
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F58:  [Ko01] Let G be a connected graph without multiple edges or loops and let p 
be an integer such that 0 < p < |V(G)|. Let f be an integer-valued function on V(G) 
such that 2 < f(v) < deg,(v) for all v € V(G). If every connected induced subgraph 
of order p of G' has an f-factor, then G' has an f-factor, or else )7,, f(v) is odd. 


F59:  [KatT's00] Let G be a graph, and let a < b be two positive integers such that 
d(G) => IV (@)| and |V(G)| > 8(b+a-— 3). If f is a function from V(G) to 
{a,a+1,...,b} such that >>, f(v) is even, then G has an f-factor. 

F60: [JaWh89] If G is a 2-edge-connected graph with a unique f-factor F’, then some 
vertex has the same degree in F as in G. 


REMARK 


R22: There is a procedure for reducing the f-factor problem on a graph G to the 
1-factor problem on a larger graph G’. See Chapter 10 of [LoP186]. 


[a,b]-factors 

DEFINITIONS 

D32: Let a and b be integers such that 1 <a<b. An |[a,b|-factor of a graph G isa 
subgraph H such that a < deg;;(v) < 6, for all vu € V(G). (Thus, it is an f-factor such 
that a < f(v) < 8, for all v € V(G).) 


D33: Let f bea function from V(G) to the odd positive integers. A spanning subgraph 
F of graph G in which degp(v) € {1,3,..., f(v)} is called a (1, f)-odd-factor of G. 


D34: A [k,& + 1]-factor is sometimes called an almost regular (or semiregular) 
factor. 


D35: A graph G is an [a, b]-graph if a < deg(v) < b, for every vertex v € V(G). 


NOTATION: Let i(G) denote the number of isolated vertices of G, and if S C V(G), let 
n(S,7) denote the number of vertices of degree j in G— S. 


FACTS 


F61: [HeHeKiLi90] If 1 < a < b, then G has an [a, b|-factor if and only if for every set 
SCV(G), Vocj<ala— J)n(S,5) <b: |S]. 


F62: [LiLi98] If G is a 2-connected claw-free graph, then G has a connected [2, 3]-factor. 


F63: [LiZhCh02] If G is a 2-connected claw-free graph containing a k-factor where 
k > 2, then G contains a connected [k, k + 1]-factor. 


F64: [KanSa83] Suppose that k,r,s and ¢ are integers such that 0 <k<rand1<t. 
If ks < rt, then an [r,r + s]-graph has a [k, k + t]-factor. 
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F65: [Nis94] If G is a graph, if a and b are integers such that 1 < a < b, and if 
6(G) > (a(G)/2) + 1, where a(G) is the independence number, then the line graph 
L(G) has an [a, b]-factor. 


F66: [La78, AmKan82] Let n > 2 be an integer, and let 7(G) denote the number of 
isolated vertices of graph G. The graph G has a [1, n]-factor if and only if i(G—S) < n|S|, 
for all S C V(G). This is an analogue of Tutte’s 1-factor theorem. 


F67: [YuKan88] Let G be a graph and f a function from V(G) to {1,3,...}. Then 
G has a (1, f)-odd-factor if and only if ¢.(G—S) < Viyeg f(v), for all § C V(G). This 
is another generalization of Tutte’s 1-factor theorem. (See also [KaKa03].) For an n® 
algorithm for finding a (1, f)-odd-factor see [KaKa07].) 


F68: [Lo70, Tu78] If G is r-regular, then G has a [k,k + 1]-factor for all k, O<k <r. 


F69: [Th81] If Gis an [r,r+1]-graph, then G has a [k, k+1]-factor for allk,O<k <r. 


REMARKS 


R23: A sufficient condition in the spirit of binding number for the existence of an 
[a, b]-factor is given in [Kan90a]. 


R24: A characterization of graphs with |a, b]-factors can be derived immediately from 
Lovasz’s (g, f)-factor theorem. (See Fact F70 below.) 


R25: [HiWo05] surveyed the area of [r,r + 1]-factors in [d,d + s]-graphs. Somewhat 
later, Hilton [Hi08, Hi09] introduced an additional parameter into the mix by asking: 
For which values of d,r and x does a [d,d+ 1]-graph admit an [r,r + 1]-factorization 
into x [r,r + 1]-factors? 


(9,f )-factors 


DEFINITIONS 


D36: Let G be a finite general graph, and let f,g be mappings of V(G) into the 
non-negative integers. A (g, f)-factor of G is a spanning subgraph F' such that g(v) < 
deg r(v) < f(v) for all v € V(G). 


D37: Let G be a finite general graph, and let f,g be mappings of V(G) into the 
non-negative integers. Graph G is said to have all (g, f)-factors if and only if G has 
an h-factor for every h such that g(v) < h(v) < f(v) for all v © V(G). Notice that if 
f =g = 1, then a (g, f)-factor (i-e., a (1, 1)-factor) is just a 1-factor. 


FACTS 


F70: [Lo70] (g, f)-factor theorem: The graph G has a (g, f)-factor if and only if 
both of the following conditions hold: 


(i) ec(V(C), 5) + f(V(C)) = 1( mod 2); 


(ii) f(D) — g(S) + dege_p(S) — Gg(D,S,9, f) = 0 for all pairs of disjoint sets 
D,S C V(G), where gc(D,S,g, f) denotes the number of components C' of 
G — (DUS) having g(v) = f(v) for all v € V(C). 
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F71: [La78] let G be a graph and f and g two integer-valued functions defined on 
V(G) such that 0 < g(x) < 1< f(a). Then G contains a (g, f)-factor if and only if for 
every subset X C V(G), the value f(X) is at least equal to the number of connected 
components C of G[V — X] such that either C = {x} and g(x) = 1, or |C| is odd and 
|C| > 3 and g(x) = f(x) =1 for alla EC. 


F72: [EgKan96] Let G be a graph and f and g functions from V(G) to the non-negative 
integers such that g(v) < degg(v),0 < f(v) and g(v) < f(v), for all v € V(G). If 


g(t) — _f) 
degg(x) ~ deg (y) 


for every pair of adjacent vertices x and y in G, then G has a (g, f)-factor. 


F73: [Nies98] Let G be a multigraph and let g and f be as in Fact F72 above. Then 
G has all (g, f)-factors if and only of 


z —1, if ; 
g(D) ~ f(8) + de-v(S) —a&(D, Sa, >4—" UAA9 

0, if f= g 
for all disjoint sets D,S C V(G), where gé(D,S,g, f) denotes the number of components 
C of G— (DUS) such that there exists a vertex v € V(C) with g(v) < f(v) or 
ec(V(C), 5) + f(V(C)) = 1( mod 2). 


F74: [XuLiTo98] Let n > 3 be an integer and let G be a K,,,-free graph. Let f and g 
be positive integer-valued functions on V(G), such that g(v) < f(v), for all v € V(G). 
If G has a (g, f)-factor, then G has a connected (g, f +n — 1)-factor. 


F75: [An92] Let G be a graph with the odd-cycle property (D8) and with a k-regular 
spanning subgraph, and let r be an integer, 1 < r < k. Then G has an r-regular 
spanning subgraph if either r is even or |V(G)| is even and r is odd. 


REMARKS 


R26: See also [AnNa98] for more sufficiency conditions for the existence of a (g, f)- 
factor, [An90, HeHeKiLi90] for simplified existence theorems for such factors, and 
[Kan84, Kan90b, Li89] for the existence of such a factor having additional properties 
such as including or excluding prescribed sets of edges. 


R27: It is apparently unknown whether there is a polynomial algorithm to test if a 
graph G has all (g, f)-factors. 


R28: For further information on the connections between network flows and graph 
factors see [FrJu99a, FrJu99b, FrJu99c, FrJu01, KoSt93]. 
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Factors in Random Graphs 


There are several popular models of so-called random graphs. We will be content to 
refer to only one of these. 


DEFINITIONS 


D38: Let 1,...,n be a labeling of the vertices and let {e,;},1 <i <j <n, be an array 
of independent random variables, where each e;; assumes the value 1 with probability 
p and 0 with probability 1— p. This array determines a random graph on {1,...,n} 
where each (ij) is an edge if and only if e;; = 1. It is denoted by Gpp. 


D39: An event E concerning a graph G € Gp» is said to hold asymptotically almost 
surely (or a.a.s.), if lim,—.. Prob E = 1. 
FACTS 


F76:  [ErRe66] Let n be even and p = (1/n)(logn + w(n)), with lim,_,.. w(n) = oo. 
Then G € Gp,» has a 1-factor a.a.s. 


F77: [ShUp81] Let p = (1/n)(logn+ (r—1) loglogn+w(n)), with r > 1 and suppose 
limp+oo w(n) = co. Suppose further that f is a mapping from V(G) into {1,...,7r} 
with $~;_, f(a;) even. Then G € G,,» has an f-factor a.a.s. 

REMARK 


R29: For several excellent treatments of random graphs, including their factors, 
see [Ka82, Ka95, JaLuRu00, Bo85, Bo01, MoRe02, ShUp81, ShUp82, JohKaVu08] and 
Section 8.2 of this Handbook. 


5.4.4 Component Factors 


DEFINITIONS 


D40: A path factor of graph G is a spanning subgraph of G each component of which 
is a path. 


D41: A P3-packing in a graph G is a subgraph F' of G such that each component of 
F is isomorphic to P3. If, in addition, F spans G, then F' is a P3-factor. 


D42: If H is a set of graphs, then an H-factor of a graph G is a spanning subgraph 
of G each component of which is a member of H. 


D43: An F-factor is a spanning subgraph in which each component is a single edge 
or an odd cycle. 


D44: A graph G is well-covered if every maximal independent set of vertices has the 
same size. 


D45: A graph G is factor-critical if G— «x contains a 1-factor for every x € V(G). 
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D46: Let R be a factor-critical graph with V(R) = {x1,...,an}, n > 3. Add new 
vertices {yi,..-,Yn} to R, together with the edges xjy;, 1 <i <n. The resulting graph 
H is called a sun. 


FACTS 


F78: [AkAvEr80] If G is a simple graph, then G has a path factor if and only if 
i(G—S) < 2|S|, for all S C V(G). 


F79: [KaLuYul0] Ifi(G—S) < |S|/2 for all S C V(G), then G contains a {K12, Ki3, K5}- 
factor. 


F80: [Kane03] Let c,(G) denote the number of sun components of G. A graph G has 
a Ps3-factor if and only if c,(G— S$) < 2|S|, for all S C V(G). 


F81: [KakeNi01] Every 2-connected claw-free graph of order n has a P3-packing which 
covers at least |n/3]| vertices. 


F82: [KoZy08]] Every 2-connected cubic graph of order n > 8 has a P3-packing which 
covers at least [9n/11] vertices. 


F83: [KakeNi01] Every 2-connected claw-free graph of order n has a P3-packing which 
covers at least |n/3]| vertices. 


F84: [AnEgKakKawMa02] Let d be a non-negative integer and let G be a claw-free 
graph with 6(G) > d. Then G has a path factor in which all paths have at least d+ 1 
vertices. 


F85: (St82] If G is a graph then G has an F-factor if and only if |N(S)| > |S|, for 
every independent S C V(G). (This result can be viewed as a generalization of Hall’s 


Theorem to the non-bipartite case.) 


F86:  [Mu79, HeKi8la] There is a polynomial algorithm for finding an F-factor or 
showing that none exists. 


F87: [RaVe06] If G is well-covered without isolated vertices, then G has an F-factor. 


CONJECTURE 


C1: [AkKan85]: Every 3-connected cubic graph of order 3n contains a P3-factor. 


REMARK 


R30: Graphs that contain, for each edge e, a Ps,-factor, k > 2, containing e are said 
to be Ps,-factor covered. Graphs which are P>2-factor covered and those which are 
Ps 3-factor covered are characterized in [ZhHeZh09]. 
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5.4.5 Graph Factorization 


Roughly speaking, one could classify graph factorization problems as one of two 
kinds: those in which the edge set is partitioned, and those in which the vertex set is 
partitioned. 


Edge Partitions 


DEFINITIONS 


D47: A 1-factorization of a graph G of even order is a partition of E(G) into 
edge-disjoint 1-factors. 


D48: A k-linear forest is a forest in which all components are paths of length at 
most k. 


D49: The k-linear arboricity of a graph G is the minimum number of k-linear forests 
which partition E(G). 


D50: A Hamiltonian partition of an r-regular graph G is a partition of E(G) in 
r/2 Hamilton cycles when r is even and (r — 1)/2 Hamilton cycles plus a 1-factor when 
r is odd. 


D51: An H-decomposition of G (or simply an H-decomposition when G is un- 
derstood) is a partition of E(G) into edge-disjoint subgraphs all isomorphic to H. 


NOTATION: We denote the edge connectivity of graph G by A(G). 


CONJECTURES 


C2: The 1-Factorization Conjecture: Let G be a simple graph of even order n. If G 
is regular with A(G) > n/2, then y’(G) = A(G); that is, G has a 1-factorization. (See 
[Wa97, Ch. 19].) 


C3: [AkExHa80]: The linear arboricity of every d-regular graph is [(d+ 1)/2]. 


C4: The Tree-Decomposition Conjecture [BaTh06]: For each tree T there exists a 
positive integer kr such that the following holds: if A(G) > kr, and |E(T)| divides 
|E(G)|, then there is a T-decomposition of G. 


C5: The P3-packing Conjecture [AkKan85]: Every 3-connected cubic graph on 3n 
vertices has a P3-factor. 


FACTS 


F88: [ChHi89],[NiesVo90] If one replaces (/7 — 1)n/2 by (V7 — 1)/2, then the 1- 
Factorization Conjecture becomes true. This result is regarded as the best to date 
toward the conjecture. 


F89: [PITi91] Let G be a regular multigraph of even order n and multiplicity u(G) <r. 
Then if A(G) > r(5n/6+1), y/(G) = A(G). This result may be viewed as an extension 
of Fact F88 to the multigraph case. 
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F90: Given « > 0, there isa number N = N(e) such that if G is a simple graph of even 
1 
order greater than N, and A > tS + )|V(G)|, then G is 1-factorizable. This provides 


evidence in favor of the truth of the 1-Factorization Conjecture for “large” graphs. (See 
[PeRe97] and Haggkvist (unpublished).) 


F91: [ZhZh92] Every k-regular graph of order 2n contains at least |k/2] edge-disjoint 
1-factors, if k > n. This result represents another approach to the 1-Factorization 
Conjecture. 


F92: [Kan85] Let a and b be integers such that 0 < a < b. Then 


(i) a graph G has a [2a, 20]-factorization if and only if G is a [2am, 2bm]-graph, for 
some integer m; and 


(ii) every [8m + 2k, 10n + 2k]-graph has a [1,2] factorization. 


F93: [YaPaWoTo00] Let G be a multigraph and let g and f be two functions mapping 
V(G) into the non-negative integers. Let m be a positive integer and ¢ an integer with 
0< <3 and = m( mod 4). If G is an (mg + 2|m/4| + €,mf — 2|m/4| — 2) graph, 
then G is (g, f)-factorizable. (See [Ya95] for other such results.) 


F94: [Fg86] Let k > 2 be an integer. 
(i) Every r-regular graph G with r > 4k? has a [2k, 2k + 1]-factorization. 


(ii) Every (k? — 4k + 2)-regular graph G has a [2k — 1, 2k]-factorization. 


NOTATION: We denote the number of components of a graph G by w(G). 
F95: [Tu61, Na61] Let & be a positive integer. 


(i) A connected graph G can be decomposed into & edge-disjoint connected factors 
if and only if k(w(G — L) — 1) < |L£], for all L C E(G). 


(ii) A graph G contains k& edge-disjoint spanning trees if and only if 
k(w(G — L) — 1) < |Z, for all L C E(G). 


F96: [Po71, Ku74, Gu83] If \(G) > 2k, then G contains at least k edge-disjoint 
spanning trees. (See also [Cat92, Pa01].) 


F97: [Krl11] If G has a factorization into two spanning trees, then it has such a 
factorization in which all leaves of both factors have degree at most 8 in G. 


F98: [Als80] If G is a connected vertex-transitive graph of order 2p, where p = 3 
(mod 4) is a prime, then G has a Hamiltonian partition. 


F99: The Tree-Decomposition Conjecture is true when T' is the 4-path Ps. For a 


proof see [Th08a], where it is also proved that if A(G) > 1910 then there is a P5- 
decomposition of G if and only if |E(G)| is divisible by 4. 


F100: In [Th08b] the same author shows that if \(G) > 171, has a Py-decomposition 
if and only if |E(G)| is divisible by 3. 
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CONJECTURE 


C6: El-Zahar’s Conjecture [E184]: If G is a graph with n =n, +--- +n, vertices and 
d(G) > fny/2] +--+ + [nz/2], then G has a 2-factor in which the cycles have lengths 
N1,---,Mp, respectively. 


El-Zahar himself proved the conjecture true in the case k = 2 and [CoHa63] when each 
n; = 3. Further partial results can be found in [Johs00]. Abbasi [Ab98] proved the 
conjecture true in the case when n = |V(G)| is sufficiently large. 


REMARKS 


R31: Perkovic and Reed [PeRe97] make the interesting observation that if the 1- 
Factorization Conjecture is true, then it follows that for any regular graph G, either G 
or its complement has a 1-factorization. 


R32: It is not difficult to show that the linear arboricity of a d-regular graph is at least 
the bound given above. It is the inequality in the opposite direction that has proved 
intractible so far. See [A188, LiWo98] for further details. 


R33: See [PITi01] for extensions of 1-factorization to the multigraph case. 


R34: It is pointed out in [Gu83] that if there are M edge-disjoint spanning trees in a 
graph, one can find them all in polynomial time using a matroid partition algorithm. 


R35: [ShSe92, ElVa99] The complete bipartite graph K,,,, can be factored into iso- 
morphic spanning trees if and only if (m+n—1)|mn. 


R36: In order for a complete graph K,, to factor into isomorphic spanning trees, n 
must be even. The area of spanning tree factorization of the complete graph K2,, has 
attracted considerable attention. For an overview of this area, see [Kovl1, KoKu09]. 


R37: In [BaLiDaWi99] the relationship between the problem of finding k edge-disjoint 
spanning trees in hypercubes and the development of efficient communication algorithms 
in parallel computing architectures is discussed. 


R38: Connected [a, b|-factors are studied in [Nal0] and connected (f,g)-factors in 
[EINaVo02]. The subject of connected factors of various kinds is surveyed in [KoVe05]. 


R39: The study of factorizations of Ke, and Ky, has evolved into an area of com- 
binatorics all its own that includes (in the latter case) the widely studied discipline of 
latin squares. We refer the interested reader to [Wa97, DeKe74, DeKe91]. 


R40: A partition of the edge set of a graph G into matchings (not necessarily perfect) 
is exactly an edge coloring of G. For much on the topic of edge coloring, we refer the 
reader to [JeTo95]. 
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Vertex Partitions 


When does a graph G admit a decomposition of its vertex set V(G) = V; U--- V, such 
that the induced subgraphs G; = G[V;] have certain specified properties? We present 
several results of this genre. 


DEFINITION 


D52: For any graph G, let o2(G)= min{dg(x)+de(y)|z,y € V(G),« #4 y, xy ¢ E(G)}. 


FACTS 


F101: [Gy78, Lo77|] Let G be a k-connected graph and suppose v1,...,v,% are k 
distinct vertices of G. Suppose further that |V(G)| =n = n, +---+ nx is a partition 
of |V(G)| = n into k positive parts. Then there exists a subgraph G’ of G with the 
following three properties: 


(i) G’ consists exactly of k components. 
(ii) Each of the components contains exactly one of the vertices v;. 


(iii) The component containing v; contains exactly n; vertices. 


F102: [EnMa97] Let G be a graph of order n and suppose n = a, +---+ a, is a 
partition of n where each a; > 2. Suppose 6(G) > 3k — 2. Then given any & distinct 
vertices v1,...,Unr € V(G), V(G) can be partitioned as V(G) = A, U---U Ax such that 
|A;| = a;, vu; © A; and 6(G[Aj;]) > 0, for all 1 <i<k. 


F103: (a) [Th83] For each pair of positive integers (s,t), there exist positive in- 
tegers f(s,t) and g(s,t) such that each graph G with «(G) > f(s,t) (respectively, 
6(G) > g(s,t)) admits a partition of its vertex set V(G) = SUT such that the induced 
subgraphs G[S] and G[T] have connectivity (respectively, minimum degree) at least s 
and t, respectively. (b) [Ha83] Moreover, if s > 3 and t > 2, then f(s,t) < 4s + 4¢— 13. 


REMARKS 


R41: Fact F102 has been used to derive best known error bounds in certain branches 
of coding theory [CsK81]. 


R42: A nice survey of vertex partitions into cycles and paths versus 6(G) and o2(G) 
may be found in [En01]. 


R43: Conditions sufficient to guarantee the existence of an H-factor, for all connected 
HT with |V(#)| < 4, are summarized and discussed in [EgFuOt08]. 
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Factor Algorithms and Complexity 


DEFINITIONS 


D53: Let G be an arbitrary graph. A G-factor of a graph H is a set {Gi,...,Ga} 
of subgraphs of H such that each subgraph G; is isomorphic to G and the sets V(G;) 
collectively partition V(G). 


D54: The G-factor recognition problem FACT(G): INSTANCE: A graph H. 
QUESTION: Does H admit a G-factor? 


D55: The clique partition number of a graph G is the smallest number cp(G) such 
that there exists a set of cp(G) cliques in G such that the cliques form a partition of 
E(G). 


D56: A graph G is chordal if every cycle in G of length greater than 3 has a chord. 


D57: The H-decomposition Problem: Given a fixed graph H, can the edge set of 
an input graph G be partitioned into copies of H? 


D58: If graph G admits a partition of its edge set into t isomorphic subgraphs, then 
we say that G is divisible by t. (An obvious necessary condition for G to be divisible 
by ¢ is that the number of graphs in the partition must divide |E(G)|.) 


D59: [El88a, El88b] A graph G is t-rational if G is divisible by t or if t J |E(G)|. 


D60: The t-Rational Recognition Problem (or the Isomorphic Factorization 
Problem): Given a graph G and a positive integer t, is G a t-rational graph? Note 
that ¢ and G form the input to the problem. A candidate for subgraph H is not part of 
the input. 


D61: An 6-separated matching is one in which, given any two edges e and e’ of the 
matching, the length of a shortest path from a vertex of e to a vertex of e’ is at least d. 
(A 2-separated matching is more often called an induced matching.) 


D62: A problem is said to be in the class NC if it can be solved in parallel time 
polynomial in the logarithm of the size of the input by a set of parallel processors the 
number of which is polynomial in the input size. 


FACTS 
NOTATION: In the next group of facts, n = |V(G)|. 


F104: The first polynomial algorithm for matching in an arbitrary graph was formu- 
lated by Edmonds [Ed65] and has come to be popularly known as the blossom algorithm. 
Its running time is O(n*). 


F105: The fastest algorithm to date for maximum matching in a general (i.e., not 
necessarily bipartite) graph with m edges has complexity O(m,/n) and is due to Mi- 
cali and Vazirani [MiVa80]. (See also [PeLo88].) (Curiously, a proof of correctness 
of this algorithm was not published until fourteen years later! (See [Va94]).) Since 
the Micali-Vazirani algorithm was introduced, two other matching algorithms [GaTa91, 
B190] having the same complexity as Micali—Vazirani have been produced. 
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F106: Faster matching algorithms exist, however, in certain special cases. If the 
graph is 3-regular and has no cutedge, then by the classical result of Petersen [Pe1891], 
the graph must have a 1-factor. In this case, an O(nlog*n) algorithm is given in 
[BiBoDeLu01] for finding a 1-factor. An O(n) algorithm is also given, if, in addition, 
the graph is planar. 


F107: [StVa82] The maximum 6-separated matching problem is NP-complete for 6 > 2, 
even when restricted to bipartite graphs of degree 4. 


F108: The Gabow, Kaplan, and Tarjan algorithm [GaKaTa99, GaKaTa01] cited above 
can be modified to test whether a graph has a unique f-factor and find it, if it exists, 
and to check whether a given f-factor is unique, all in polynomial time. 


F109: Anstee [An85] gave algorithmic proofs of both the (g, f)-factor theorem and the 
f-factor theorem and his algorithms either return one of the factors in question or show 
that none exists, all in O(n*) time. Note that this complexity bound is independent of 
the number of edges in the graph and also independent of g and f. 


F110: A polynomial algorithm for finding a 2-factor, if one exists, was first found 
by Edmonds and Johnson [EdJo70]. If one additionally demands that the 2-factor 
be triangle-free, the problem remains polynomially solvable. (See [CoPu80].) If one 
demands that the cycle lengths to be disallowed form a non-empty subset of {5,6,...}, 
the problem has been shown to be NP-hard [HeKikKrKr88]. The complexity in the two 
remaining cases, namely, where only 4-cycles are forbidden or where only triangles and 
4-cycles are forbidden, remains unresolved. 


F111: The problem of deciding whether or not a graph has a hamiltonian cycle is one of 
first decision problems proved ([Ka72, Ka75]) to be NP-complete. The problem remains 
NP-complete, even if the graphs are restricted to be 3-regular and planar [GaJoTa76] 
or 4- or 5-regular and planar [Pi94]. 


F112: The answer to the factor recognition problem F ACT(K},) is (trivially) always 
“ves” and so FACT (K,) € P. Problem FACT(K2) is just the question of the existence 
of a perfect matching in H and hence also lies in P. More generally, if G consists of a 
disjoint union of copies of Ky and K2, then FACT(G) belongs to P. 


F113: [KiHe83] If any component of G has more than two vertices, then FACT(G) is 
NP-complete. 


F114: [Kaw02] Let G be a graph of order 4k with 6(G) > 5k/2. Then G contains a 
ik, -factor, where kK, denotes the complete graph Ky with one edge removed. 


F115: [ShWaJu88] The problem of determining cp(G) is NP-hard, for the class of 
K4-free graphs and for the class of chordal graphs. However, the problem is polynomial 
for the class of graphs which are both Ky4-free and chordal. 


F116:  [Ho8laj The problem of determining the chromatic index of a graph is NP- 
complete. If G is bipartite, however, see Fact F117. 


F117: KG6nig’s Edge-Coloring Theorem [Kol6a, Kol6b]: If G is bipartite, then 
x'(G) = A(G). The proof yields an O(mn) algorithm (m = |E(G)|) to produce an 
optimal edge-coloring. 


512 Chapter 5. Colorings and Related Topics 


F118: Presently, it seems that either an algorithm of Kapoor and Rizzi [KaRi00] or an 
algorithm of Schrijver [Sc98a] is best for edge-coloring a bipatite graph G', depending 
upon the relative sizes of |V(G)| and A(G). If the bipartite graphs involved are regular, 
then even faster algorithms exist. (See [Ri02].) 


F119: [Ho81b] Suppose n > 3. Then the problem of partitioning E(G) into copies 
of K,, is NP-complete. Holyer used the above result to prove five other edge partition 
problems to be NP-complete in the same paper. 


F120: [BrLo95] If H has no connected component with three or more edges, then the 
H-decomposition Problem is polynomial. 


F121: [DoTa97| The H-decomposition Problem is NP-complete whenever H contains 
a connected component with three edges or more. (See also [AlCaYu98].) 


F122: [BeHo97] There is a polynomial algorithm that finds a factorization of any given 
4-regular graph into two triangle-free 2-factors or else shows that such a factorization 
does not exist. 


F123: [Wo84] If r > 2t, then almost all labeled r-regular graphs cannot be factorized 
into t > 2 isomorphic subgraphs. 


F124: But curiously, there is no known example of a regular non-factorizable graph as 
in Fact F123 which satisfies the obvious necessary divisibility condition: z| |E(G)|. 


F125: [El Wo88] Let G be a multigraph and suppose ¢ is an integer such that t > y’(G). 
Then G is ¢-rational. 


F126: If Gis r-regular and t > r +1, then G is t-rational. This follows from Vizing’s 
theorem. 


F127: [E188a] Let G be a 2k-regular graph of even order that contains no 3-cycles or 
5-cycles. Then E(G) can be partitioned into 2k isomorphic subgraphs. Moreover, this 
factorization can be constructed in polynomial time. 


F128: [ScBi78, HaRoWo78] Given the complete graph K,, then there exists a graph 
HT such that K,, is the edge-disjoint union of t copies of H if and only if n(n —1) = 
0( mod 2t). 


REMARKS 


R44: It is unknown whether or not the problem of finding a maximum matching 
(or 1-factor) is in the parallel class NC. For a general reference on this subject, see 
[KaRy98]. 


R45: [HaWa77] provides some observations about connections between the Isomorphic 
Factorization Problem and combinatorial designs. Even the subject of 1-factorizations 
of graphs (that is, where the isomorphs are 1-factors) is an enormous topic unto itself 
and quickly leads one into the discipline of combinatorial design theory. See the excellent 
surveys [StGo81, MeRo85] and the encyclopedic volume [Wa97]. 
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R46: The so-called packing problems are closely allied to factor and factorization 
problems. Here instead of searching for a factor of a particular kind in a given graph 
G, one seeks a subgraph of G of maximum order which admits the factor. See [LoP090] 
for a nice survey of the state of the art. 


CONJECTURES 


C7: [BeHo97]: The problems of recognizing 
(a) which 2n-regular graphs factor into two triangle-free n-factors, and 
(b) which 2n-regular graphs factor into n triangle-free 2-factors 


are both NP-complete for all n > 3. 


C8: [Hi85]: Let G be a d-regular simple graph of order 2n and let d = py + --- +p, 
be a partition of d. If d > n, then G has a factorization into edge-disjoint subgraphs 
H, U---UH,, where H; is regular of degree p;. (The author proves the conjecture true 
in various special cases.) 


Subgraph Problems 


DEFINITIONS 


D63: The k-regular Subgraph Recognition Problem: given a graph G, does it 
contain a k-regular subgraph? (Here we do not require that the k-regular subgraph 
span G.) If k = 1 or 2, clearly the problem takes only polynomial time. 


FACTS 


F129: [Ga83] Let G be a k-regular graph of order n, and let v(G) denote the minimum 
number of extra vertices needed to ensure that there exist a (k + 1)-regular supergraph 
of G. 


(i) If the graph G has a 1-factor, then v(G) = 0. 
(ii) If the graph G has no 1-factor and if n and k are of opposite parity, then v(G) = 1. 


(iii) If the graph G has no 1-factor and n and k are of the same parity, then n < 2k 
and v(G) =k +2. 


F130: [Ta84] Every 4-regular simple graph G contains a 3-regular subgraph. (However, 
the proof here does not provide an algorithm for finding the 3-regular subgraph.) 


F131: [Ple84] The k-regular Subgraph Recognition Problem is NP-complete for all 
k > 3. 


F132: [EgOt99] If G is a graph with |V(G)| > 4k+6 and 6(G) > k+2, then G contains 
k, pairwise vertex-disjoint claws (i.e., copies of Ky,3). (The claws are not considered to 
be induced.) 
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REMARKS 


R47: There are hundreds of papers in the literature dealing with a wide variety of 
“subgraph problems” as well as “graph decompositions.” The reader is referred to the 
survey papers [ChGr81, Di90b, Ro90, Be96] and to the books [Bo90, Di90a, CoRo99]J. 


R48: There are indeed infinite analogs of some of the matching and factor theorems for 
finite graphs. See [Ra49, Ah84a, Ah84b, Ah88, Ah91, AhMaSh92, AhNa84, AhNaSh8g3, 
Br71, HoPoSt87, St77, St85a, St85b, St89, Nied91, NiedPo94]. 


R49: Since the first edition of this Handbook was published, several new books and 
survey articles on graph factors and factorization have appeared. The reader is referred 
to [AkKan11, KoVe05, P107, YuLi09]. 
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INTRODUCTION 


The construction of timetables for educational institutions and other organizations 
is a rich area of research with strong links to graph theory, especially to node- and 
edge-coloring, bipartite matching, and network flow problems. A significant amount of 
recent research has developed powerful hybrids of graph coloring/meta-heuristic meth- 
ods. The purpose of this section is to demonstrate how graph theory plays a pivotal role 
in timetabling research today and to provide insight into the close relationship between 
graph coloring and a range of timetabling problems. We concentrate on four timetabling 
problems: class-teacher timetabling, university course timetabling, university examina- 
tion timetabling and sports timetabling, and we illustrate some of the key points that 
have underpinned graph-theoretical approaches to timetabling over the years. We aim 
to highlight the role of graph theory in modern timetabling research and to provide 
some pointers to the relevant literature for the interested reader. 


Automated Timetabling: Historical Perspective 


The problem of developing computer programs and systems to solve timetabling prob- 
lems has been addressed by the scientific community for over 40 years. Bardadym in his 
1995 survey [Ba96] examines the distribution of educational timetabling publications 
from 1960 to 1995. This shows a significant growth in educational timetabling research 
throughout the 1960s and into the 1970s. There is a lowering of interest in the late 
1970s, which picks up again in the 1980s and reaches a peak of over 60 published papers 
in 1995 alone, the year of the 1st International Conference on the Practice and Theory of 
Automated Timetabling (PATAT) [BuRo96]. In 1996 the European Association of Op- 
erational Research Societies Working Group on Automated Timetabling was launched, 
and today it has over 300 members from more than 60 countries. 
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REMARKS 


R1: The broad definition of the timetabling problem covers a wide variety of important 
scheduling problems, which include school timetabling, university course timetabling, 
examination timetabling, sports timetabling, transport timetabling and a wide variety 
of employee timetabling and rostering problems. 


R2: Welsh and Powell [WeP067] observed the relationship between the graph-coloring 
problem and timetabling in 1967. This relationship has been a significant feature of 
timetabling research ever since. A broad generation of timetabling algorithms was 
based upon graph-coloring methods. 


R3: It is not our purpose to survey all of these approaches. Carter’s 1986 survey paper 
[Ca86] on examination timetabling provides an excellent review of the early examina- 
tion timetabling methods, and Carter and Laporte updated this survey paper in 1995 
[CaLa96]. There are a number of other timetabling survey papers that cover the field 
(e.g., [de85-b], [Ba96], [Wr96], [BuJakiWe97], [CaLa98], [Sc99], [QuBuMcMeLc09], and 
[Pil0]). 


TERMINOLOGY: Throughout this section, node is used instead of vertex. 


5.5.1 Specification of Timetabling Problems 


Timetabling problems are complex and vary widely in structure. Our definition is 
general enough to cover most cases. 


The General Problem 


DEFINITION 


D1: A timetabling problem is a problem with four parameters: T, a finite set of 
times; R, a finite set of resources; M, a finite set of meetings; and C, a finite set of 
constraints. The problem is to assign times and resources to the meetings so as to satisfy 
the constraints as far as possible. The parts of this definition are elaborated below. 


Times 


Although it is possible to allow arbitrary time intervals for meetings, in practice time 
is usually discretized by dividing it into a fixed finite set of intervals of equal length. 


DEFINITIONS 


D2: A time t is an element of the set of times T of an instance of the timetabling 
problem. 


D3: A time slot is a variable constrained to contain one time. 
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FACTS 
F1: Time slots are occasionally preassigned (fixed to a particular value in advance). 


F2: In practice, constraints involving time often use information about the actual time 
intervals being represented. For example, a constraint could specify that two time slots 
must contain times whose underlying time intervals are directly adjacent, or that a set 
of time slots must contain times that are spread fairly uniformly through the week, and 
So on. 


F3: Some timetables recur: they are repeated every week, or every two weeks, etc. 
School and university course timetables recur. Other timetables are used only once 
(e.g., examination timetables). 


Resources 


Meetings contain teachers, rooms, items of special equipment, students (or groups of 
students), and so on, which we call resources. 


DEFINITIONS 


D4: <A resource r is an element of the set of resources R of an instance of the 
timetabling problem. 


D5: A resource slot is a variable constrained to contain one resource. 


FACT 


F4: Resource slots are often preassigned (fixed to a particular value in advance). 
Student group slots are usually preassigned. 


EXAMPLE 


E1: The basic constraint of timetabling, that no resource appear in two meetings that 
share a time, applies equally to teachers, students, and rooms, and hence, these items 
are often treated together as part of a meeting. Other constraints may be specialized for 
different resources. For example, if a meeting contains several times, it may be required 
that a particular teacher be present in that meeting for all of those times, whereas in 
filing the room slot it may be acceptable to use a split assignment, that is, to assign 
different rooms at different times. 


Meetings 


DEFINITION 


D6: A meeting m is a named collection of time slots and resource slots. Assigning 
values to these slots means that all of the assigned resources attend this meeting at all 
of the assigned times. 
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EXAMPLES 


E2: In examination timetabling, one meeting will usually represent one examination 
and contain: one time slot, a large number of preassigned students (those students 
enrolled in the corresponding course), and one or more room slots. 


E3: In school timetabling, one meeting will usually represent one subject studied 
through one week, and will contain some small number of time slots, one preassigned 
student group slot, one teacher slot (often preassigned) and one room slot. 


E4: In staff rostering, one meeting will represent the total staff requirements for one 
time interval, and will contain one preassigned time and a number of staff slots, not 
preassigned. 


Constraints 


Timetabling practitioners have documented dozens of different constraints in the many 
organizations they have investigated, so it is not possible to give a comprehensive list 
of constraints in such a general setting. When evaluating constraints against solutions 
it is convenient to assign a value of 0 to perfectly acceptable outcomes, and to assign 
progressively higher values to less acceptable outcomes. 


DEFINITIONS 


D7: Let S be the set of all solutions to a given timetabling problem. A hard constraint 
is a constraint that must be satisfied. Associated with each hard constraint is a binary- 
valued function h: S — {0,1}, defined for each solution w € S' by 


AG 1, if w does not satisfy the constraint 
W)= 
0, otherwise 


D8: A feasible solution is any solution w € S that satisfies all the hard constraints, 
i.e., h(w) = 0 for all h. 


D9: Let S be the set of all solutions to a given timetabling problem. A soft constraint 
is a constraint that it is desirable, but not necessary, to satisfy. Associated with each 
soft constraint is a function s: S > Z+. The interpretation is that a solution w € S$ 
for which s(w) is small is preferred. 


D10: Let S be the set of all solutions to a given timetabling problem. The badness 
function of that problem is a function b: S > Zt that encapsulates in a single number 
b(w) an overall rating for a solution w € S. 


D11: The completeness constraint requires that every time slot receive a value. 


D12: The no-clashes constraint (or no-conflicts constraint) requires that each 
resource not participate in any two meetings that share a time. 
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D13: The availability constraint specifies that a particular resource is only available 
for a certain subset of the times JT’. For example, a part-time teacher might be available 
only on Thursdays and Fridays. 


EXAMPLE 


E5: Let the hard constraints for a given problem be hj, h2,...,hn and the soft con- 
straints be 81, $2,...,5m. A common approach is to choose a badness function that is 
a weighted sum of these values: 


b(S) =D _/wihi(S) + D /wj85(5) 


where the weights v; and w; are nonnegative integers chosen to reflect the importance 
of the corresponding constraints, with the v; much larger than the w;. 


REMARKS 


RA: In university course timetabling, the no-clashes constraint would typically be a 
hard constraint for lecturers but a soft constraint for students as far as optional courses 
are concerned (since it is usually impossible to satisfy every student). 


R5: When a resource slot is not preassigned, it almost always carries a resource 
type constraint, which specifies that the value is constrained to some subset of R. For 
example, a slot may require one English teacher or one science laboratory. Within 
the basic categories (rooms, teachers, etc.) these subsets are typically not disjoint; for 
example, some English teachers may also teach history. Preassignment can be viewed 
as a type constraint that constrains a slot to a subset of size 1. 


R6: The availability constraint for a particular resource may also be expressed by 
creating an artificial meeting that contains just that resource and those times when the 
resource is to be unavailable for actual meetings. 


R7: Examples of other constraints often considered are: each teacher is to have at 
least one hour free each day; each student is to have a lunch hour; large gaps between 
classes during any one day should be minimized; walking time between classes is to be 
minimized; etc. 


5.5.2 Class-Teacher Timetabling 


Class-teacher timetabling is a special case of the general problem in which each 
meeting contains one preassigned student-group slot, one preassigned teacher slot, and 
any number of time slots. We first consider this basic version of the problem, and 
then generalize it to school problems (pre-college), in which students are timetabled in 
groups rather than individually. School problems are characteristically dominated by 
hard constraints, since constraint violations that might be acceptable when they affect 
one individual are unacceptable when they affect an entire student group. 
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The Basic Class-Teacher Timetabling Problem 


DEFINITIONS 


D14: The basic class-teacher timetabling problem [Go62] is a timetabling problem 
in which each meeting contains one preassigned student-group slot, one preassigned 
teacher slot, and one completely unconstrained time slot. The no-clashes constraint is 
a hard constraint and applies to every resource. 


D15: A proper edge-coloring in a graph G is a mapping of the edge-set E(G) to a 
set of colors such that adjacent edges are assigned different colors. 


D16: The edge-chromatic number of a graph G, denoted y‘(G), is the minimum 
number of different colors required for a proper edge-coloring of G. 


FACTS 


F5: There is no requirement that each student group and teacher meet exactly once, 
or indeed at most once. We could allow each meeting to contain any number & of 
unconstrained time slots, since that would be equivalent to having k meetings between 
the given student group and teacher. 


F6: The class-teacher timetabling problem can be modeled as an edge-coloring prob- 
lem in a bipartite graph [Be83, de85-a]. Each student group is represented by a left 
node, each teacher is represented by a right node, and each meeting m is represented by 
an edge between the nodes corresponding to the student group and teacher preassigned 
to m. If a student group and teacher meet k times, there will be k parallel edges between 
the two corresponding nodes. Assigning a time to a meeting corresponds to assigning 
a color to the corresponding edge; the no-clashes constraint is equivalent to requiring a 
proper edge-coloring. 


F7: An obvious lower bound on the edge-chromatic number of a graph, and hence, on 
the number of different times needed to timetable an instance of the basic class-teacher 
problem, is the maximum vertex degree. K6nig’s theorem (Fact F8) asserts that for the 
basic class-teacher problem, this is an upper bound as well. 


NOTATION: The maximum vertex degree in a graph G is denoted A(G). Sometimes, 
when the context is clear, we use A. 


F8: [Kol6] Let G be a bipartite graph. Then x/(G) = A. (See [GrYe06], §9.3, for a 
proof.) 


REMARKS 


R8: A timetable using A different times can be constructed in low-order polynomial 
time [Be83]. The algorithm is based on finding maximum matchings in a bipartite 
graph. Matchings are discussed in $11.3 of the Handbook. 


R9: The connection between class-teacher timetabling and edge-coloring in a bipartite 
graph was first made by Csima [Cs65], according to [ScSt80]. 
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Extensions to the Basic Class-Teacher Problem 


We give some examples of extensions to the basic class-teacher timetabling problem. 
These and others are described in [Pil0] and [Pol]]. 


EXAMPLES 


E6: Some teachers may be available for only certain subsets of the full set of times. 
This was the first timetabling problem, identified as such, shown to be NP-complete 
[EvItSh76]. Allowing some times to be preassigned is essentially the same case, since 
meetings with preassigned times reduce the availability of the teachers within them. 


E7: Multiple time slots within meetings may be constrained to be contiguous. There 
is an easy reduction from the bin packing problem [GaJo79], where the bins are days, 
showing that this problem is NP-complete. 


E8: Some meetings may be “group meetings” involving several student groups com- 
ing together for a large lecture. This problem is NP-complete, but there is a good 
approximation algorithm [Asde02]. 


E9: Room slots may be added to the meetings. In most school-timetabling problems, 
each student group attends some class at every time, and therefore there must be at least 
as many rooms as there are student groups. In that case, if rooms are not differentiated 
into different types, each student group can be permanently allocated to some room. 
If rooms are typed or preassigned, we have an NP-complete problem equivalent to the 
basic problem with teacher unavailabilities described in Example E6. 


Graph Models for Subproblems of the Class-Teacher Problem 


It is frequently the case that intractable timetabling problems have tractable subprob- 
lems that may be useful to solve within a larger framework. If the subproblem has no 
solution, then the entire problem is infeasible (and analysis of the model can uncover 
the deficiency). If the subproblem reveals that there is only one feasible assignment for 
some slot, then that assignment might as well be made immediately [Go62]. 


MODELING EXAMPLES 


E10: Suppose we need to determine whether a set of meetings can be scheduled to run 
simultaneously. First we must check that preassignments or other constraints on their 
time slots do not preclude this. Then we must check that the combined resource slots 
of all these meetings can be covered by the complete set of resources R. This is trivial 
if all the resource slots are preassigned (simply check that no resource is used twice), 
but in general these slots will be constrained to overlapping subsets of R. 


Bipartite Graph Model: Each resource slot becomes a left node, each available 
resource in R becomes a right node, and an edge joins slot s to resource r whenever r is 
an acceptable resource for slot s. The meetings may run simultaneously if a matching 
touching every left node exists [CoKi93]. 


E11: Example E10 generalizes to multiple times in a way that allows us to check 
whether the resources and times available can cover all the meetings. 
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Bipartite Graph Model: There is one left node for each possible triple (m, ts, 1s), 
where m is a meeting, ts is a time slot from m, and rs is a resource slot from m. These 
triples represent indivisible units of demand for one resource at one time. There is one 
right node for each possible pair (r,t), where r is a resource and t is a time when r 
is available. These pairs represent indivisible units of supply (of resources). An edge 
joining a triple to a pair means that the given constraints are not violated by the implied 
time and resource assignment. For example, if ts is preassigned we would join triples 
containing it only to pairs containing its preassigned time; if rs requires an English 
teacher we would join triples containing it only to pairs containing resources r that are 
teachers whose capabilities include English. Clearly, if there is no matching that touches 
every triple, then the problem is infeasible. 


E12: We may have a partial solution in which some time slots have been assigned 
times and others have not. We ask whether we can extend this set of time assignments 
by assigning workable times to all currently unassigned time slots in the set M, of all 
meetings containing a particular fixed resource r (e.g., a student group). These time 
slots must be assigned distinct times, otherwise there will be a clash involving r. 


Bipartite Graph Model: The left nodes are the time slots of /,., and the right nodes 
are all the times of T. Create an edge between each time slot that is already assigned 
and the time it has been assigned. For each time slot that has not been assigned, create 
an edge between it and each of its allowable times. A time is allowable for a time slot 
if, when the time slot’s meeting is added to those meetings that already contain this 
time, the resulting collection of meetings can run simultaneously. The meetings may be 
assigned allowable times if there exists a matching in the resulting graph that touches 
every time slot node [de85-a, CoKi93]. 


E13: If the times of all meetings are preassigned it may be possible to create models 
for assigning teachers. For example, suppose that all meetings occupy one time and 
may be taught by all teachers, but that each teacher is available for a limited set of 
times and for a limited total number of classes. This problem, which arises in allocating 
staff to university tutorials, can be modeled as a network flow problem. 


Network Flow Model: From the source there is one edge directed to a teacher- 
node for each teacher, with capacity equal to the maximum number of classes for that 
teacher. From each teacher-node there is one edge with capacity 1 for each time that 
that teacher is available. Each such edge is directed to a time-node that represents the 
set of all meetings assigned that time. This time-node receives edges from all teachers 
available at that time. From each time-node, there is an edge directed to the sink, with 
capacity equal to the number of simultaneous classes allowed at that time. A minimum- 
cost network flow model would allow the inclusion of soft constraints such as teacher 
preferences for certain times. 


REMARK 


R10: For a discussion of minimum-cost network flow, see, for example, [Pa82] or §11.2 
of the Handbook. 
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5.5.3. University Course Timetabling 


University course timetabling differs from the basic class-teacher timetabling prob- 
lem essentially by the fact that each student may in principle choose the courses of his 
program, and that there are no other classes of students that are given beforehand and 
that follow exactly the same program. 


Basic Model 


The following notation will be used for the rest of this subsection. 


NOTATION: Let C = {C\,...,C,} denote a collection of courses to be offered during the 
week W, where W is viewed as a set of time periods. We assume that each course Ci 
consists of c; one-period lectures, that is, C; = {C},C?,...,Cf*}. For each student s¢, 
let S; be the collection of courses chosen by student s;. 


DEFINITIONS 


D17: A course timetable is an assignment to each course C; a set C; C W of c; time 
periods, one for each of its c; lectures. 


D18: Given a course timetable, a conflict occurs if for some student s;, there exist 
two courses C;,C; € S; such that C; Cj #0. In other words, there are two courses 
chosen by student s; that have at least one lecture at the same time. 


D19: The university course timetabling problem is to produce a conflict-free (or 
feasible) course timetable. 


REMARKS 


R11: For the moment we assume that there are no capacity obstacles (i.e., the class- 
rooms are large enough and a course may accommodate any number of students). 


R12: It may occur that with a given set of data, no feasible timetable can be found. In 
such a case we may need to relax our requirements and consider allowing certain con- 
flicting lectures to occur. The resulting timetabling problem becomes one of minimizing 
the severity of the conflicts. A measure of the severity of a conflict is the number of 
students who have elected to take both of these lectures. This is formalized in Definition 
D20 below. 


A Graph Formulation 


Our graph model consists of nodes representing lectures, and edges joining pairs of 
these nodes, where the edges are weighted according the severity of the conflicts they 
represent. 


DEFINITIONS 


D20: The (penalty) weight, w;;, of a conflict between two lectures C7 and C} is the 
number of students who have to take both of these lectures, i-e., 


wis = |{t]Ci,C; € S}| 
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D21: A conflict graph G is an edge-weighted graph defined as follows: for each 
course C;,, there are c; nodes, 
CC. to 


representing its lectures. For each pair i,j, i # j, if wi; > 0, then an edge with weight 
wiz is created between nodes C7 and C} for each pair r,s, r # s. In addition, an edge 
with weight oo is created between nodes C7, C? for each possible pair r,s, representing 
a prohibitive penalty corresponding to two lectures of the same course. 


NOTATION: The edge joining nodes x and y is denoted [x,y]. This causes no ambiguity 
here because conflict graphs have no multi-edges. Some other sections of the Handbook 
use (x,y) or xy to denote simple adjacency between x and y. 


D22: In a graph G, a subset of mutually non-adjacent nodes is called a stable (or 
independent) set of nodes. 


D23: A proper node-coloring of a graph G is an assignment of colors to the nodes 
of G such that adjacent nodes receive different colors. A proper node k-coloring is a 
proper node-coloring that uses k different colors. 


FACTS 


F9: The timetabling problem reduces to finding a partition P of the node-set V(G) 
into k = |W| subsets, S1,...,5,, that minimizes the total penalty 


k 
xP) = >_ (wiz | CT, C8 € Su) 


u=1 


F10: It is easy to see that there is a one-to-one correspondence between feasible 
(conflict-free) timetables and partitions P with z(P) = 0: given such a partition, CT € 
S,, means that lecture r of course C; is scheduled at period u € W. 


F11: A partition P for which z(P) = 0 gives rise to a proper node-coloring, obtained 
by assigning the same color to each node in one cell of the partition so that different 
cells get different colors. Conversely, given a proper node-coloring, the node-subsets 
receiving the same color (called color classes) form a partition with z(P) = 0. Thus, 
there exists a feasible timetable in k = |W| periods if and only if G has a proper node 
k-coloring. 


F12: Node-coloring models are more general than edge-coloring models: one can always 
transform an edge-coloring instance into a node-coloring instance in an auxiliary graph, 
but the converse is not true. 


F13: For some classes of graphs, the determination of the smallest & for which there 
exists a node k-coloring (the chromatic number) is easy; it is in particular the case 
for perfect graphs (see [Be83]). But in general the problem is NP-hard. Node-coloring 
(vertex-coloring) is discussed in detail in §5.1 and 85.2. 
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EXAMPLE 


E14: Figure 5.5.1 gives an example of a university timetabling problem. In the basic 
model introduced above, we have not mentioned the teachers giving the various courses. 
We have assumed that all courses are to be taught by different teachers. Should this not 
be the case, we would simply introduce edges with a prohibitively large weight between 
lectures (of different courses) that have to be given by the same teacher. This would not 
change the nature of the problem, which remains a node-coloring problem in a graph 
or a weighted extension as shown above. 


C, = 3 lectures Si = C1,C2 wi2 = 
C2 = 2 lectures So = C2, C3 wi3=1 
C3 = 2 lectures S3 = C1, C3, C4 wy=l 
C4 = 1 lecture S4 = C3,C4 w23 = 1 
Ss = Co wos = 
W34 = 2 


prohibitive weight ——————~00 


weight 


Figure 5.5.1: An example of university timetabling. 


Ci,C} : period1 C?,C? : period3 
C?,Ci : period 2 C3,Ct : period 4 


2(P)=1 (edge[C},C3]) 


Scheduling Multi-Section Courses 


Suppose that a collection of m courses, {C,,C2,...,Cm}, has to be scheduled. Assume, 
for notational simplicity, that each course consists of a single weekly lecture and that 
there are exactly h; sections of course C;, where hy > hg >--- > hm. The following 
four-step strategy produces a timetable for all sections of all m courses, in advance, 
that can accommodate any collection of student groups, {g1, 92,---,;9n}, as long as no 
more than h; of those groups need course C;. The strategy is followed by an example 
illustrating each step on a sample problem. 
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Step 1: Construct the bipartite graph G* = (L*, R*, E*), where as a left set, we have 
L* = {C1,C2,...,Cm}, the right set R* = {1,2,...,hi}, and for each 1 = 1,2,...,m, 
(C;, i] € E* for j =1,2,...hi. 


Step 2: Produce a set of feasible colors for the sections of each course. 
e If A is the maximum degree of nodes in G*, then clearly, A = max{m, hy}. 


e From K6nig’s theorem (Fact F8), G* has a proper edge A-coloring, which can be 
constructed easily. 


e For this edge-coloring of bipartite graph G*, let p(C;), « = 1,2,...,m, denote 
the set of colors used for the edges incident on node (course) C;. Observe that 
|p(C;)| = hi, 7 =1,2,...,m. 


Step 3: Given an actual collection G = {91, 92,..-,9n} of student groups, construct a 
bipartite graph G** = (L**, R**, E**), where the left set L** = {C1,C2,...,Cm}, the 
right set R** = {g1, 92,---,9n}, and for each pair 7,j,i = 1,2,...,mandj =1,2,...,n, 
edge [Ci 9;] € E** if and only if student group g; needs course Cj. 

Step 4: Assign the collection G of student groups to the sections of courses C1, C2,...,Cm 
without changing the time-period of any section. 


NOTATION: Let g(C;) denote the set of student groups needing course Cj. 


FACT 


F14: [Ha83] Given the bipartite graph G** defined in Step 3, if degg+«(C;) = h; for 
each i, then there is a proper edge A-coloring of G** such that the edges incident on node 
C;, are assigned the feasible colors of C; obtained from Step 2. (See also [AsDeHa98].) 


REMARK 


R13: In terms of the timetabling problem, Fact F14 says that if the number of student 
groups that need course C; equals the number of sections that have been scheduled for 
Ci, ie., |g(Ci)| = |p(Ci)| (= hi), i = 1,2,...,m, then there exists an assignment of the 
student groups to sections such that each student group gets the courses it needs and 
the original set of time-periods for the sections of each course is unchanged. 


EXAMPLE 


E15: (Step 1) The graph G* with m = 5 and (h, he,...,hs) = (6,4, 4, 2,1) is shown 
in Figure 5.5.2. 


(Step 2) For the graph in Figure 5.5.2, A = 6, and a proper edge 6-coloring using colors 
{a,b,c,d,e, f} is represented by the following matrix whose (i, 7)th entry is the color 
assigned to edge [Ci, j]. 


123 4 5 6 
Ci fabede f 
Co! b e da 
C3) c dae 
Cy da 
Cs e€ 
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Figure 5.5.2: Bipartite graph G* for m = 5 and (hy, ho,...,hs) = (6,4, 4, 2,1). 


Thus, the corresponding timetable for the courses is given by the following sets p(C;) 
of feasible colors (time-periods) for the sections of course C;, i = 1,2,...,m: 


(C1) 
p(C2) = {a,b,d, e} 
p(C3) = {a,c,d, e} 
p(C4) = {a, d} 
p(Cs) = {fe} 


(Step 3) The bipartite graph G** shown in Figure 5.5.3 below represents the specific 
requirements of seven student groups, g1, 92,---,97-. For instance, group g; needs courses 
C1, Co, and C4, and group ge needs courses C and C4. 


Figure 5.5.3: Bipartite graph G**. 
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(Step 4) A proper edge 6-coloring for bipartite graph G** in Figure 5.5.3 is represented 
by the matrix below. Observe that the colors used for the edges incident on a given 
course-node are precisely that node’s feasible colors determined in Step 2. 


9. 92 93 94 95 G6 97 


Ci ff ee  d a bee 
Co] d b e a 
C3 d coa e 

Cy | a d 

Cs € 


5.5.4 University Examination Timetabling 


Basic Model 


Examination timetabling differs from university course timetabling in a number of ways. 
However, the very core of the problem can be considered to be the same. We have a 
collection of exams F,..., £, that have to be assigned time slots (periods) and rooms. 
The number of periods that are available can play a crucial role. In many universities the 
number of periods extends over a time length of two to four weeks. The constraints that 
characterize the examination timetabling problem are quite different from constraints 
that are important in course timetabling. 


DEFINITION 


D24: Given an examination timetable, a conflict occurs if two exams taken by the 
same student are scheduled in the same time period. 


FACTS 


F15: In examination timetabling it is often desirable (or necessary) to have several 
exams allocated to the same room. It would, of course, not be very sensible to assign a 
number of lectures to the same room! 


F16: In examination timetabling, it is usually considered desirable to spread exams 
out over the number of periods so that students do not have exams in succession. 
On the other hand, for course timetabling it is often considered undesirable to spread 
the lectures out. Students tend to prefer to have lectures in contiguous blocks. The 
prototype problem given in Example E16 below has seven exams (F1,..., £7) that it 
has to allocate to five time periods P,,..., Ps. It only attempts to satisfy the constraint 
that no student can attend more than one examination at the same time. We say that 
there is a conflict in the timetable if that constraint is not satisfied. 
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EXAMPLE 
E16: For our prototype problem, there are seven exams (F,..., £7) to assign to five 
time periods P,,...,P5 such that there are no conflicts. In our graph model, nodes 


represent examinations, and edges join two nodes whose corresponding exams have at 
least one student in common. Weights on the edges between two nodes (exams) can 
represent the number of students who have to take both of those exams. The graph 
model is shown in Figure 5.5.4 below. Exam 1 only conflicts with Exam 4 (seven stu- 
dents need to take both exams). However, Exam 2 has one, seven, and three students in 
common with Exam 4, Exam 5, and Exam 7, respectively. This simplified examination- 
timetabling problem is directly analogous to the node-coloring problem where the colors 
are represented by the periods and is very similar to the graph-theoretical models dis- 
cussed earlier. The solution to this simplified problem, shown in Figure 5.5.5, uses all 
five colors (periods). 


(on) 


Figure 5.5.4: A graph model for our prototype problem. 


Period 1 Period 2 Period 3 Period 4 Period 5 
Exam-1 Exam-3 Exam-2 Exam-6 Exam-7 
Exam-5 Exam-4 


Figure 5.5.5: A solution to the problem using all the colors (periods). 


A More Compact Schedule 


Is the solution given in the table above a good solution? If we consider the problem 
purely as producing a coloring that uses the fewest possible colors, then the answer is 
clearly no. An alternative coloring can be seen in Figure 5.5.6. 


Period 1 Period 2 Period 3 Period 4 Period 5 


Exam-1 Exam-3 Exam-2 
Exam-5 Exam-4 
Exam-6 


Exam-7 


Figure 5.5.6: A solution to the problem that uses only three colors. 
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REMARK 


R14: The solution in Figure 5.5.6 uses only three colors (periods), rather than five, so 
it is clearly better in terms of the number of colors used. However, if we consider it as 
an examination timetabling problem, then a student who has to take Exam 1, Exam 2, 
and Exam 7 would (almost certainly) consider the solution in Figure 5.5.5 to be better. 
On the other hand, a university administrator might think that the solution in Figure 
5.5.6 is better than the one in Figure 5.5.5 because it gets the exams completed more 
quickly. 


FACTS 


F17: The node-coloring problem can be considered to be an underlying model of 
examination timetabling, but for realistic applications there are a number of other 
constraints, both hard and soft, that need to be considered. For instance, room capacity 
is a hard constraint, but avoiding having students take consecutive exams is a soft 
constraint. 


F18: In situations where no proper node-coloring exists, the objective might be to 
minimize the number of students having conflicts. But a further complication is weighing 
this consideration against other features of an examination timetable: how spread out 
the exams are, how many days are used, etc. 


DEFINITION 


D25: The quality of a solution to an examination timetabling problem can be 
defined as a measure of the level of satisfaction of the soft constraints (provided all the 
hard constraints are satisfied). 


The Breadth and Variation of Exam Timetabling Constraints 


In 1996, Burke, Elliman, Ford, and Weare [BuE]IFoWe96] published a paper that ana- 
lyzed and discussed the results of a questionnaire completed by examination-timetabling 
administrators from 56 British universities. The aim of this exercise was to deter- 
mine the nature of the problem as it occurred in British universities (circa 1995). The 
questionnaire concentrated upon 13 constraints but also asked administrators to in- 
clude other constraints thought to be important to their institution. An additional 19 
constraints were listed. The 32 constraints demonstrate the breadth and variation of 
requirements and priorities among British universities. 


REMARK 


R15: The message for developers of examination-timetabling decision-support software 
is that if a system is to be generic and widely applicable, then it has to be flexible in 
designating which soft constraints are important and which are not. It has to allow the 
user to weight soft constraints according to the needs and requirements of the user’s 
own institution. 
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Heuristic Methods 


Early approaches to solving the examination-timetabling problem [Br64], [Co64] em- 
ployed heuristic construction methods. As mentioned in the introduction, the analogy 
with graph coloring was observed by Welsh and Powell in 1967 [WePo67]. This ob- 
servation has underpinned the development of examination-timetabling methods. The 
survey paper [Ca86] and its sequel by Carter and Laporte [CaLa96] provide an excel- 
lent overview of the development of graph-coloring-based heuristic approaches for the 
examination-timetabling problem. 


FACTS 


F19: One of the basic approaches for solving the examination-timetabling problem is 
to construct the timetable by sequentially placing exams into periods according to some 
measure (heuristic) of how difficult the exams are to schedule (see [Ca86]). The early 
approaches mentioned above used this strategy. 


F20: The strategy of scheduling the most troublesome exams first corresponds to col- 
oring the nodes in the graph model that are expected to be the most difficult. Examples 
of four of the most common node-coloring heuristics used in examination-timetabling 
systems are presented below. 


HEURISTICS 


H1: Largest Degree: This heuristic takes the nodes with the largest degree (number 
of edges) and schedules them first. This corresponds to the exams that have the most 
conflicts with other exams. 


H2: Largest Weighted Degree: This heuristic is similar to Largest Degree except 
that the edges are weighted by the number of students who are involved in the conflict. 


H3: Color Degree: Here, we first schedule the exams that have the largest number of 
conflicts (degree) with the other exams that have already been placed into the timetable. 


H4: Saturation Degree: This heuristic chooses first those exams that have the least 
number of available periods in the timetable that can be selected without violating hard 
constraints. 


REMARKS 


R16: While these four heuristics do not form an exhaustive list, they do cover the 
key node-coloring-based heuristics used in examination-timetabling. Examples of these 
and other similar approaches include [Br64], [Co64], [WePo67], [Wo68], [Me81], [Me82], 
and [BuE]We94]. For further discussion, see the survey papers [de85-b], [Ca86], [Ba96], 
[Wr96], [BuJaKiWe97], [CaLa96], [CaLa98], and [Sc99]. 


R17: There is an obvious limitation with the simple timetable construction method 
outlined above (independent of the heuristics used). Exams scheduled early in the 
process might make certain other exams impossible to schedule later on. This can be 
addressed by adding a backtracking component to the process. An algorithm that gets 
stuck can unschedule or re-schedule exams. Examples of this kind of approach can be 
found in [CaLaCh94] and [CaLaLe96]. 


Section 5.5. Applications to Timetabling 547 


R18: Consistent with the philosophy of scheduling the most troublesome exams first, 
Carter and his colleagues investigated methods based on finding a maximum clique 
of the conflict graph. A maximum clique is a largest subgraph where each node is 
adjacent to every other node. See [CaJo01] for more details about the role of cliques in 
examination timetabling. 


Two Different Random-Selection Strategies 


Burke, Newall, and Weare [BuNeWe98-a] use a random element in the process of select- 
ing the next exam to schedule. This approach produced good results quickly and can 
be seen as a compromise between the relatively simple coloring-heuristic-based methods 
discussed earlier and the more complex meta-heuristic methods (discussed briefly be- 
low), which generally require much more computational time. The two randomization 
approaches that are considered in [BuNeWe98-a] are described in the next two examples. 


(1) A random subset of exams is selected, and the most difficult from within the 
subset is selected (according to some heuristic). 


(2) The x most difficult exams to schedule are selected (according to some heuristic), 
and then one of those x exams is selected at random. 


Hybrid Graph-Coloring /Meta-Heuristic Approaches 


Throughout the 1990s, meta-heuristic approaches, such as simulated annealing, evolu- 
tionary methods, and tabu search, were investigated and developed for various timetab- 
ling problems. Significant progress has been made by combining the more modern meta- 
heuristic methods with some of the older graph-coloring-based methods. For a discussion 
of the advantages and disadvantages of using such approaches for timetabling, see [de85- 
b], [Ca86], [Ba96], [Wr96], [BuJaKiWe97], [CaLa96], [CaLa98], and [Sc99]. To find out 
more about the meta-heuristics themselves, see [Gl1Ko03]. 


EXAMPLES 


E17: Dowsland and Thompson [ThDo96-a, ThDo96-b] implemented a simulated an- 
nealing /graph-coloring hybrid approach for solving the examination-timetabling prob- 
lem at the University of Wales Swansea. Their method works in two phases. The first 
phase satisfies the hard (binding) constraints: 


all exams to be scheduled within 24 time slots, 
no student clashes to be allowed, 


certain pairs of exams to be scheduled at the same time, 


certain groups of exams to be scheduled in order, 


) 
) 
) 
d) certain pairs of exams to be scheduled at different times, 
) 
) certain exams to be scheduled within time windows, 

) 


no more than 1200 students to be involved in any one session. 
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The second phase of this simulated annealing approach attempts to optimize the soft 
constraints of the problem: 


(a) minimize the number of exams with over 100 students scheduled after period 10, 


(b) minimize the number of occurrences of students having exams in consecutive 
periods. 


E18: Burke, Newall, and Weare in 1998 [BuNeWe98-b] used graph-coloring heuristics 
(Largest Degree, Color Degree, and Saturation Degree) to construct initial solutions that 
were then fine-tuned by memetic algorithms. Memetic algorithms refer to evolution- 
ary methods (often genetic algorithms) combined with local search (often hill-climbing). 
The memetic algorithm that they investigated was based upon one that had already 
been shown to work well on benchmark examination-timetabling problems [BuNeWe96]. 


E19: Burke and Newall used the heuristics outlined above in conjunction with a 
decomposition approach [BuNe99]. The authors investigated ways of decomposing 
large problems into smaller subproblems, which were then solved using memetic algo- 
rithms. However, the authors noted that the decomposition approach is independent 
of the method that is used to solve each of the subproblems. Decomposition had been 
previously addressed by Carter [Ca83]. 


E20: Di Gaspero and Schaerf [DiSc01] presented an approach, based on the work 
of Hertz and de Werra [Hede87], that combined graph-coloring heuristics and tabu 
search. They employed weights on the edges to represent the number of students who 
were involved in the conflicts between the corresponding pairs of exams, and they also 
employed weights on the nodes to indicate the number of students taking the exams. 
For a range of benchmark problems, their method was competitive with (and in some 
cases, better than) state-of-the-art methods in 2001. 


REMARK 


R19: A potential drawback with the decomposition approach described in Example 
E19 is that exams can be assigned time slots in earlier subproblems that then lead 
to the infeasibility of later subproblems. Burke and Newall employed graph coloring 
heuristics to build the subproblems in order to tackle this difficulty. For the problems 
they considered in [BuNe99], the approach that used the saturation-degree heuristic 
along with using a subproblem size of 50 exams for the smaller problems and 100 exams 
for the larger problems was the most effective one. They also employed a look-ahead 
approach to try and detect difficulties. It considered two subproblems together and 
fixed the solution to the ith one only after it had solved the (i + 1)th one. 


5.5.5 Sports Timetabling 


This section focuses on modeling and solving some basic problems occurring in the 
construction of season schedules for sports leagues. We show how the design of some 
round-robin tournaments can be modeled as an edge-coloring problem in a digraph. Such 
a model should then be extended to handle more general constraints that arise when a 
season schedule involves travel that should be optimized. This traveling-tournament 
problem (TTP) is described in [EaNeTr03]. Instead of discussing the general problem 
here, we concentrate on a simple model using elementary properties of graphs. We use 
the terminology of [Be83] for general graphs and that of [de81] for sports scheduling. 
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DEFINITIONS 


D26: A (single) round-robin tournament for a set of | teams is a collection of 
games such that each team plays each other team exactly once. Each game is played in 
one of the two teams’ home city. 


D27: If the game between teams i and 7 is played in the home city of team 7, then 
the game is a home game, H, for team 7 and an away game, A, for team i. 


D28: Given a sports league consisting of 1 teams, a basic sports timetable 
(schedule) (for a round-robin tournament) has two components for each pair of teams 
i and 7: 


e designating the day on which the game between 7 and 7 is played; 


e designating the home city for that game. 


REMARK 


R20: For the rest of this subsection, we assume that the sports league consists of 2n 
teams for some integer n. 


A Simple Graph Model 


The league of 2n teams is identified with the node set of a graph G, and an edge joining 
node 7 and node j corresponds to a game between team i and team j. Observe that if 
a round-robin tournament is to be scheduled, graph G is the complete graph Kop. 


NOTATION: An undirected edge between node i and node j is denoted [i, 7]. A directed 
edge from i to 7 is denoted (7,7) and indicates that the game is a home game for team 
j and an away game for team 2. 


DEFINITIONS 


D29: Let G be the 2n-node graph representing a league of 2n teams. An oriented 
d-coloring of graph G is a proper edge-d-coloring together with an assignment of a 
direction to each edge. This oriented d-coloring results in a digraph, each of whose arcs 
is assigned one of the d colors. This arc-colored digraph specifies a sports timetable 
using d days for the 2n teams: the arcs that are assigned color cx, correspond to those 
games that are scheduled for day k, and the arc (i, 7) indicates that the game between 
teams 7 and j is a home game for 7 and an away game for 7. 


NOTATION: The digraph created from an oriented coloring of a graph G' is denoted G. 


TERMINOLOGY: When a round-robin tournament is to be scheduled (i.e., G is Kon), the 
digraph G is a tournament. This family of digraphs is covered in detail in 83.3. 


D30: Let {c1,c2,...,ca} be the colors used for an edge d-coloring of a graph G. For 
each color cy, k = 1,2,...,d, the color class My, is the set of edges assigned color cx. 
For a given oriented d-coloring of graph G, M; denotes the set of arcs assigned color cx. 
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D31: A factor of a graph (digraph) G is a subset F’ of edges (arcs) such that every 
node of G is incident on exactly one edge (arc) in F’. 


D32: A d-factorization of a graph G is a partition, {F), F2,..., Fa}, of the edge-set 
of G such that each F; is a factor of G. A graph G is d-factorizable if there exists 
a d-factorization of G. A d-factorization {F\, Fo, ed ., Fy} of the arcs of a digraph is 
defined analogously. 


TERMINOLOGY NOTE: A factor in an undirected graph is also called a perfect matching 
and is actually a 1-factor, where an r-factor is an r-regular, spanning subgraph of G. 
Matchings are discussed in §11.3, regular graphs are introduced in §1.2, and graph 
factors and factorization are discussed in §5.4. 


FACTS 
F21: A d-factorization {F,,..., Fu} of graph G induces a proper edge d-coloring of G, 
obtained by assigning color cz to each of factor F,, k = 1,2,...,d. Thus, if a graph G 


is d-factorizable, then there exists a proper edge d-coloring of G. 


F22: Analogous to Fact F21, an oriented d-coloring of a graph G induces a d-factoriz- 
ation of the digraph G. 


F23: The d-day schedule that corresponds to a d-factorization of a graph G has the 
property that each team plays a game on each of the d days, i.e., no team has a day off. 


TERMINOLOGY: Sometimes, an oriented d-coloring, its induced d-factorization of the 
arc-set of the resulting digraph, and the corresponding schedule will all be regarded as 
the same thing. 


EXAMPLE 


E21: Figure 5.5.7 shows an oriented 5-coloring (using colors 1,2,...,5) of a complete 
graph G = Kg representing a league of 2n = 6 teams. 


5 4 


Figure 5.5.7: An oriented 5-coloring of Kg. 
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The corresponding 5-factorization, (Fy, Pes Psy: of the digraph G is shown in table below 
(Figure 5.5.8). The ith row of the table lists the arcs assigned color i. This factorization 
specifies the complete 5-day schedule for the six teams. In particular, the arcs in the 
ith row indicate the games scheduled on the ith day. For instance, on day 2, team 3 
plays team 1, and team 1 is at home. 


game 1 game 2 game 3 


F, (day1) / (1,6) (2,5) ~— (4,3) 
Fy (day2) { (6,2) (3,1) (5,4) 
F; (day3) (3, 6) (4, 2) (I, 5) 
F, (day4) | (6,4) (5,3) ~— (2,1) 
F's (day5) (5, 6) Gh 4) (3, 2) 
game 1 game2 game 3 
F, (dayl) / (1,6) (2,5) ~— (4,3) 
Fy (day2) { (6,2) (3,1) (5,4) 
Fs (day3) | (3,6) (4,2) ~— (4,5) 
F, (day4) | (6,4) (5,3) ~— (2,1) 
F's (day5) (5, 6) (ls 4) (3, 2) 


Figure 5.5.8: A compact schedule. 


Observe that each team plays a game on each of the five days, which illustrates Fact 
F23. This kind of compact schedule always exists for a round-robin tournament of 2n 
teams because the complete graph K2, has a (2n — 1)-factorization, where each factor 
has n edges. 


Profiles, Breaks, and Home-Away Patterns of a Schedule 


DEFINITIONS 


D33: Let S be a schedule for a league of 2n teams. The home-away pattern (HAP) 
associated with S' (see [de81]), denoted H(S), is a 2n x (2n — 1) array defined by 


A an away game 
hir(S) = 4 H > if team 7 has ¢ a home game ? on day k 
0 no game 


D34: For a given an (2n — 1)-day schedule S' for a league of 2n teams, the profile of 
team i is the ith row of H(S). Thus, the profile is the sequence of H’s and A’s indicating 
when team 7 is home and away for the 2n — 1 days. 


D35: For a given schedule S$, the profiles of two teams are complementary if for each 
day in the schedule, one of the teams is at home and the other is away. 


D36: For a given schedule S, team i has a break on day (k +1) if hix(S) = hin41(S). 
In other words, the profile of team 7 has two consecutive H’s or two consecutive A’s, 
with the second one falling on day (k + 1). 
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EXAMPLE 


E22: Figure 5.5.9 shows the HAP associated with the oriented 5-coloring of Figure 5.5.7 
and its corresponding schedule in Figure 5.5.8. The breaks are indicated by underlining. 
Notice that teams 4 and 5 have complementary profiles. 


days 


teams 


Figure 5.5.9: The HAP associated with the schedule of Figure 5.5.8. 


A Lower Bound on the Number of Breaks 


Often in round-robin tournaments, one tries to construct schedules in which for each 
team, home games and away games alternate as regularly as possible (i.e., the number 
of breaks is minimized). 


DEFINITION 
D37: A subset of mutually non-adjacent nodes in a graph G is called a stable (or 


independent) set. The independence number of G,, denoted a(G), is the maximum 
size of a stable set. Some other sections of the Handbook use ind(G) instead of a(G). 


FACTS 
F24: [de88] Let G be a d-factorizable graph on 2n nodes, and let (Hy Ps 
be a d-factorization arising from an oriented d-coloring of G. Then the corresponding 


schedule has at least 2(n — a(G)) breaks. 


F25: Since the independence number of a complete graph equals 1, any oriented 
(2n — 1)-coloring of Kon, (Fi,...,Fan—1), has at least 2n — 2 breaks. 


REMARK 


R21: Fact F25 implies that the schedule given in Figure 5.5.8 has a minimum number 
of breaks. 
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Irreducible and Compact Schedules 


DEFINITIONS 


D38: A schedule is irreducible if, whenever two teams play against each other, at 
most one of them has a break on that day. 


D39: A schedule is compact if each team plays one game on each day (i.e., its HAP 
has no ¢ symbols). 


FACTS 


F26: A compact d-day schedule corresponds to a d-factorization of the associated 
graph. 


F27: In a compact schedule, if there is a team with an A in its profile for days k and 
k +1, there must be another team with an H in its profile for days k and k+ 1. Thus, 
in a compact schedule, breaks occur in pairs. (In Figure 5.5.9, teams 2 and 3 and teams 
4 and 5 are two such pairs.) 


F28: By reversing the orientation of some arcs (corresponding to games with a break 
for each one of its teams), one may always generate from a schedule S' an irreducible 
schedule that does not have more breaks than S. For the rest of this section, we assume 
(without loss of generality) that the schedules we consider are irreducible. 


NOTATION: Fact F29 below uses the following notation. Given a compact schedule S 
constructed on a d-regular graph, b; denotes the ith day on which breaks occur, and 4; 
is the number of breaks occurring on day b; (where 2 < b) < bp <--- <b) <d). In 
addition, we define bb) = 1 and bp4; =d+1. 


F29: Let G be a d-regular graph. Then the following conditions are equivalent: 


(1) There exists a compact schedule S constructed on G, where 2-y; breaks occur on 
day b;,2=1,...,p. 
2) The edge-set of G can be partitioned into subsets Fy,..., #41 such that 
g P 
(a) The edge subset EF; induces a (b; — bj1)-regular bipartite graph with vertex 
bipartition {X;, X;} fori=1,...,p +1. 
(b) |Xin1 XG] = [XiN Xia] =% fori=1,...,p. 


EXAMPLE 


E23: For the compact schedule S in Figure 5.5.8 (and its corresponding HAP in Figure 
5.5.9), (bo, 01, be, 63) = (1,3,5,6) and y, = 72 = 1, and it is easy to see that condition 
(1) of Fact F29 is satisfied. 


To show that condition (2) is satisfied, let Ey = Fy U Fo, Ey = F3 U Fy, and F3 = Fs. 
Then the vertex bipartitions of the induced subgraphs are: 

Xe Sth 4 XG = 13,56} 

X» = {1,3,4}, X2 = {2,5,6} 

X3 = {1,3,5}, X3 = {2,4,6} 


It is now straightforward to verify that condition (2) is also satisfied. 
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Complementarity 


Another property of compact schedules that is of interest in practice is complementarity. 


DEFINITION 


D40: A compact schedule S for K2, has the complementarity property if the 2n 
teams can be grouped into n disjoint pairs T),...,7;, such that the two teams in each 
T; have complementary profiles. 


FACTS 


F30: [de88] If S is a compact schedule (for K2,,) such that each team has at most one 
break, then S has the complementarity property (by Fact F27). 


F31: If S is a compact schedule (for K2,,) with exactly 2n — 2 breaks, then S' has the 
complementarity property. 


F32: There are compact schedules with the complementarity property where some 
teams have more than one break. 


EXAMPLES 


E24: Consider the compact schedule S in Figure 5.5.8 and its corresponding HAP, 
given in Figure 5.5.9. The three pairs T; = {1,6}, To = {2,3}, and 73 = {4,5} show 
that S has the complementarity property. 


E25: Figure 5.5.10 shows a 3-factorization of G = K4 that corresponds to an irreducible 
compact schedule S. Its HAP shows that S does not have the complementarity property 
(team a has two breaks). 


a d ag —< od a da d 
b c bO—<—Oc b c b c 
Ky Fi Fy F3 


day 1 day 2 day 3 


team a A A A 
team b A A HH 
team c A A H 
team d A A A 


Figure 5.5.10: An irreducible compact schedule of K4. 
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Constructing a Compact Schedule with a Minimum Number of 
Breaks 


We restrict our attention to the most common case, when G = K2,,. Algorithm 5.5.1 
below gives a simple construction that produces an oriented coloring (and hence, a 
schedule) having exactly 2n — 2 breaks, which, by Fact F25, is the minimum. 
EXAMPLE 


E26: Figure 5.5.11 illustrates Algorithm 5.5.1 for Keg. Observe that the schedule 
reproduces the oriented 5-coloring given in Figure 5.5.7. 


ne 5 7 2 a 2 5 ss +. 
mA et Ne 


4—>3 
F, Fo Fs F4 Fy 


Figure 5.5.11: The 5-day schedule of Kg produced by Algorithm 5.5.1. 


Algorithm 5.5.1: A (2n—1)-Day Schedule of K2, With 2n — 2 Breaks 


Input: Complete graph Kon. 
Output: A (2n — 1)-day schedule with 2n — 2 breaks. 


Step 1. Construct a (2n — 1)-factorization of Kon: 
For z= 1 to 2n—-1 
PF, = {[2n, i} U {[i+k,i—k] (mod 2n—-1):k = 1,2,...,n—1} 
Step 2. Orient the edges: 
For 7 = 1 to 2n—-1 
If 7 is odd 
Orient edge [2n, 7] as (7, 2n) 
Else 
Orient edge [2n, 7] as (2n, 7) 
Fork =1ton-1 
If k is odd 
Orient edge [i + k,i — k] as (i+ k,i— k) 
Else 
Orient edge [i + k,i — k] as (i k,i+k) 


REMARK 


R22: The factorization specified in Step 1 of the algorithm is discussed in [Be83, 
Chapter 5]. It is called a canonical factorization [de88]. 


556 Chapter 5. Colorings and Related Topics 


An Alternate View of the Canonical Factorization 


Let Qn, 01, 21, @2, 82,03,---,An—1,; Pn—1, Pn denote the nodes 1,2,...,2n, respectively, 
and let (Fi,...,Fon-1) be the canonical factorization produced by Algorithm 5.5.1. 
Consider the partition (FE), E2,...,F,) of the edge-set of K2,, defined by EB, = Fon-1 
and EB; = fy, U Fa, = 1,...,n—1. 


FACTS 
F33: Fori=1,2,...,n—1, EF; defines a 2-regular bipartite graph on node sets 
X; = {Qi, it toes: »Qn, 21, Bo, ea ,Bi-1} 


and X; = sichewerees ace » Bn, 1,02, oe ae aan 


In addition, XiNXi41 = q; and Xi Xia = 6;. Thus, by Fact F29, it defines a schedule 
where nodes a; and 3; have a simultaneous break on day b; = 2i+1,i1=1,...,n—1. 


F34: Fact F33 implies that the canonical factorization produces a compact schedule 
having exactly 2n — 2 breaks and satisfying the complementarity property. 


EXAMPLE 


E27: For Kg, we have a; = 2,a9 = 4,a3 = 1,0, = 3, 8 = 5,83 = 6, and from the 
HAP of Figure 5.5.9, one sees that teams a; = 2 and 6; = 3 have a break on day 3, 
while teams a2 = 4 and (62 = 5 have a break on day 5, and teams a3 = 1 and $3 = 6 
have no break. 


Some Characterization Results 


FACTS 


F35: Let S; and $2 be two compact schedules for K2,,, each with exactly 2n —2 breaks. 
If both schedules have the same sequence 0, b2,...,bn—1 of days where breaks occur in 
pairs, then their HAPs, H(S,) and H(S2), are the same (up to a permutation of rows). 


F36: Equivalently, by setting bp) = 1 and b, = 2n, we could start with the sequence 
D = (b1 — bo, bo — b1,..., bn — bn-1), which is the sequence of degrees of the (b; — bj-1)- 
regular bipartite graphs appearing in the partition of the edge-set of K2,, defined in Fact 
F33. (For instance, the schedule of Figure 5.5.8 has D = (2,2,1) since (0p, b1, b2, b3) = 
(1, 3,5, 6).) 


F37: Given a sequence D = (dj,do,...,dn) with dj +--- +d, = 2n —1, we can 
reconstruct a unique HAP as follows: for i < n—1 the profile of a; starts with an A 
and has a unique break on day d; +---+d;+1; the profile of a, starts also with an A 
and has no break. For each 7 < n the profile of 6; is the complement of the profile of 
Ay. 
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EXAMPLE 


E28: The HAP in Figure 5.5.12 illustrates Fact F37 for 2n = 6 and D = (3,1,1). 


day 1 day 2 day 3 day4 day 5 


oa, [ H A H H A 
ao | H A H A A 
a3| H A H A H 
B,) A H A A H 
Bo| A H A H H 
Bs \ A H A H A 


Figure 5.5.12: A HAP corresponding to D = (3,1, 1). 


REMARK 


R23: A sequence D = (d,,...,d,,) of positive integers with d, +---+d, = 2n—1 does 
not in general give a HAP that corresponds to a compact schedule for K2, with 2n — 2 
breaks. For instance, the HAP in Figure 5.5.12 does not correspond to any compact 
schedule for Ke. 


Feasible Sequences 


DEFINITION 


D41: A sequence D = (di,...,d,) and its corresponding HAP are feasible if they 
correspond to a compact schedule for Ky, with 2n — 2 breaks. 


FACTS 


F38: [de88] If D = (di,d2,...,dn) is feasible for Ka, then D = (dn, dn—1,...,d,) and 
all sequences obtained by a cyclic permutation of D or D are also feasible. 


F39: No complete characterization of the feasible sequences has been obtained yet; 
however, for n < 13 the feasible sequences have been characterized (see [MilwMa02}). 


F40: Given a sequence (dj, d2,...,dn), we can reconstruct the associated HAP such 
that: the rows are ordered aj, @2,...,Q@n, 91, 82,.--,8n, where a; and {; have comple- 
mentary profiles; a; and 6; have their break on day d; +---+d;+1 (¢=1,...,n—1); 
and the profiles of a1,...,Q@, start with an A. 


NOTATION: For a given HAP and any subset T of teams, we define the quantity a;(T) 
for each day k by ax(T) = min {|{t € T|Ai, = A}, |{2 © Tlhin = H}I}. 
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F41: [MilwMa02] If a given HAP is feasible, then for any subset T of teams, a;,(T) 
is an upper bound on the number of games between teams in JT that can be scheduled 
at period k. Moreover, since all these teams have to play against each other over the 
2n — 1 days, we have 


2n—-1 


>> oe (F) > & 


k=1 


F42: [MilwMa02] Instead of checking explicitly all possible subsets T, it is sufficient to 
assume that the 2n teams are cyclically ordered (a1, @2,..-,Qn,(1,---;Bn,Q1,---), and 
to examine only subsets T that are intervals of at most n consecutive teams in the cyclic 
order. Thus, the number of inequalities to check is O(n”). Using these observations, the 
authors were able to eliminate, as infeasible, a number of sequences D = (d),...,dn); 
it turned out that for 2n < 26, all sequences D that were not eliminated did correspond 
to feasible HAPs. 


CONJECTURE 


[MilwMa02] The inequalities given in Fact F41 are necessary and sufficient conditions 
for a sequence D to correspond to a feasible HAP. 


REMARKS 


R24: There are also season schedules where each pair of teams has to meet several 
times. The schedule consists of rounds that have to satisfy additional requirements. 


R25: Also, there are often constraints that require more breaks in the schedule (for 
instance, some teams might have away games or home games on prespecified days), so 
we cannot use all the properties of schedules with a minimum number of breaks. 


R26: Some references on various types of sports-scheduling problems are given in 
[EaNeTr03]. 


R27: Finally, one should observe that canonical factorizations are not the only fac- 
torizations that should be considered for constructing the schedules. There are other 
types of factorization that are of interest (in particular when a league is divided into 
several subleagues in which internal games have also to be played). Such constraints 
are considered in [de82] and [de85-c]. 
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INTRODUCTION 


There are more than 1000 papers published on a wide variety of graph labeling 
methods. Many of these methods trace their origin to one introduced by Rosa [Ro67] 
in 1967. An extensive survey article on graph labelings that is periodically updated is 
available online at [Gall]. 


DEFINITIONS 


D1: Rosa called a function f a G-valuation of a graph G with q edges, if f is an 
injection from the vertices of G to the set {0,1,...,q} such that, when each edge «ry is 
assigned the label | f(x) — f(y)|, the resulting edge labels are distinct. 


D2: Golomb [Go72] subsequently called such labelings graceful labelings and this is 
now the popular term. 


REMARKS 


R1: Rosa introduced 6-valuations as well as a number of other labelings as tools for 
decomposing the complete graph into isomorphic subgraphs. In particular, G-valuations 
originated as a means of attacking the conjecture of Ringel [Ri64] that the complete 
graph Ko,+1 can be decomposed into 2n + 1 subgraphs that are all isomorphic to a 
given tree with n edges. 
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R2: Although an unpublished result of Erdés (see [GS80]) says that most graphs are 
not graceful, many graphs that have some sort of regularity of structure are graceful. 
Sheppard [Sh76] has shown that there are exactly q! gracefully labeled graphs with 
q edges. Rosa [Ro67] has identified essentially three reasons why a graph fails to be 
graceful: 

1. G has “too many vertices” and “not enough edges” ; 

2. G has “too many edges”; and 

3. G has “the wrong parity.” 


FACTS 


F1: The disjoint union of trees is a case where there are too many vertices for the 
numbers of edges it has. 


F2: An infinite class of graphs that are not graceful for the second reason is given by 
[BG86]. 


F3: [Ro67] If every vertex of a given graph has even degree, and if the number of edges 
is congruent to 1 or 2 (mod 4), then the graph is not graceful. In particular, the cycles 
C4n41 and C4n+2 are not graceful. These are examples of the third possible reason for 
non-gracefulness. 


F4: [Ach82] Every graph can be embedded as an induced subgraph of a graceful graph. 


F5: [Ach82] Every connected graph can be embedded as an induced subgraph of a 
graceful connected graph. 


The next three results demonstrate that there is no forbidden subgraph characteri- 
zation of various particular kinds of graceful graphs. 


F6: {ARA0O8] Every triangle-free graph can be embedded as an induced subgraph of a 
triangle-free graceful graph. 


F7: [ARA0O8] Every planar graph can be embedded as an induced subgraph of a planar 
graceful graph. 


F8: [ARAO8] Every tree can be embedded as an induced subgraph of a graceful tree. 


5.6.1 Trees 


DEFINITIONS 


D3: A caterpillar is a tree such that the deletion of all univalent vertices leaves a 
path. 


D4: A lobster is a tree such that the deletion of all univalent vertices leaves a cater- 
pillar. 
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CONJECTURES 

A conjecture by Ringel and Kotzig has spurred much research, which Kotzig [HKR82] 
has characterized as a “disease.” A special case of that conjecture was made by Bermond 
in 1979. 
C1: (Graceful Tree Conjecture) [Ringel—Kotzig] All trees are graceful. 


C2: [Be79] Lobsters are graceful. 


FACTS 

F9: [Fa] Trees with at most 35 vertices are graceful. 

F10: [Ro67] Caterpillars are graceful. 

F11: [HKR82, Zha89, JMW93] Trees with at most 4 end-vertices are graceful. 

F12: [Zha89, HHO1] Trees with diameter at most 5 are graceful. 

F13: [BeS76, PoS82] Rooted trees in which every level contains vertices of the same 
degree are graceful. 

REMARK 


R3: Methods for combining graceful trees to yield larger graceful trees are given in 
[StZa73, KR80, KRT81, KTR79b]. Recursive constructions to create graceful trees are 
provided in [Rog78, KTR79a]. 


5.6.2 Cycle-Related Graphs 


Cycle-related graphs have been a major focus of attention. 


FACTS 
F14: [Ro67] The n-cycle C,, is graceful if and only if n = 0 or 3 (mod 4). 
F15: [Fr79] Wheels W,, = C, + Ky are graceful. 


F16: [BNS03] The n-cone (also called the n-point suspension of Cm) Cm + Kn is 
graceful when m = 0 or 3 (mod 12). When n is even and m is 2, 6 or 10 (mod 12) 
Cm + Ky violates Rosa’s parity condition for a graceful graph. 


F17: [AF84] The helm graphs obtained from a wheel W,, by attaching a pendant 
edge at each vertex of the n-cycle are graceful. 


F18: [KLG96] The web graphs obtained by joining the pendant points of a helm to 
form a cycle and then adding a single pendant edge to each vertex of this outer cycle 
are graceful. 
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F19: [MF84b] The gear graphs obtained from the wheel W,, by adding a vertex 
between every pair of adjacent vertices of the n-cycle are graceful. 


F20: [LY95] The graphs obtained from a gear graph by attaching one or more pendant 
edges to each vertex between the vertices of the n-cycle are graceful. 


F21: [LY96] The graphs obtained when two or more vertices are inserted between 
every pair of vertices of the n-cycle of the wheel W,, are graceful. 


F22: [DMTS80, MF84a, KY85, PP87] The graphs obtained from a cycle by joining two 
nonconsecutive vertices of the cycle with a path of fixed length at least 3 are graceful. 


F23: [KP82, KRY80, GoL92] For 3 < p < n—T, the n-cycle with consecutive ver- 
tices V1, V2,.-.,Un to which the r chords v1, Up, V1Up41, ---,U1Up4r—1 have been added is 
graceful. 


Cycles that share a common edge or a vertex have received some attention. 


F24: [MuAr] Books with n pentagonal pages (i.e., n copies of Cs with an edge in 
common) are graceful when n is even and not graceful when n is odd. 


F25: [BBG78, BKT78] Graphs, oY that are the one-point union of t 3-cycles are 
graceful if and only if t = 0 or 1 (mod 4). 


F26: [Sh91] Graphs, c!), that are the one-point union of t 4-cycles are graceful for 
all t. 


CONJECTURE 


C3: [KRT79] A graph C! that is the one-point union of t n-cycles is graceful if and 
only if nt = 0 or 3 (mod 4). 


FACTS about 1-point unions 


F27: (see [Gall]) Conjecture C3 has been proved for n = 5,7,9,11,13 and when t = 2 
and n is even. 


F28: [BSW75] The one-point union of any two cycles is graceful when the number 
of edges is congruent to 0 or 3 modulo 4. (The other cases violate the necessary parity 
condition.) 


F29: [SeSe01b] For a variety of choices of points, the one-point union of any number 
of non-isomorphic complete bipartite graphs is graceful. The question of whether this 
is true for all choices of the common point is open. 


DEFINITIONS 
D5: A block of a graph is a maximal connected subgraph that has no cut-vertex. 


D6: The block-cutpoint graph of a graph G is a bipartite graph in which one partite 
set consists of the cut vertices of G, and the other has a vertex b; for each block B; 
of G. 


D7: A C,,-cactus is a connected graph all of whose blocks are C;,. 


Section 5.6. Graceful Labelings 567 


FACTS about cacti 


F30: [Mo89] Triangular cacti are graceful if and only the number of blocks of the graph 
is 0 or 1 (mod 4). 


F31: [Sek02] The C,,-cacti whose block-cutpoint graphs are paths are graceful when 
n = 0 (mod 4) (n > 8) and when n = 2 (mod 4) and the number of C,, is even. 


F32: [Sek02] The graphs obtained from C,, by appending a path P, to each vertex 
are graceful. 


F33: [Qi, KLG96] The graphs formed by adding a single pendant edge to arbitrarily 
many vertices of a cycle are graceful. 


CONJECTURE 


The graphs in the previous two facts are special cases of a long standing conjecture. 


C4: [Tr84] All unicyclic graphs except C,, for n = 1 or 2 (mod 4) are graceful. 


REMARK 
R4: For given cycle C,, with n = 0 or 3 (mod 4) and a family of trees 
T = {T1,To,...,Tn}, 


let wu; and v;,1 <7 <n, be fixed vertices of C;, and T;, respectively. Figueroa-Centeno, 
Ichishima, Muntaner-Batle, and Oshima [FMO] provided two construction methods 
that generate a graceful labeling of the unicyclic graphs obtained from C;, and 7 by 
amalgamating them at each u; and v;. Their results encompass all previously known 
results for unicyclic graphs whose cycle length is 0 or 3 (mod 4) and considerably extend 
the known classes of graceful unicyclic graphs. 


5.6.3. Product-Related Graphs 


Graphs that are Cartesian products and related graphs have been the subject of 
many papers. 


FACTS 
F34: [AG81, Mah80] Planar grids, P,, x Pn, are graceful. 
F35: [FG88] The graphs C;,, x P: are graceful. 


F36: [JR92] The graphs C,, x P, are graceful when m and n are even or when m = 0 
(mod 4). 


REMARK 


R5: The graphs C,, x P, can be viewed as grids on cylinders. 
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FACTS 


F37: [YW92, YW94] The graphs C4n+42 X Pam+3, Cn X Po, and Cg x Pm (m > 2) are 
graceful. 


F38: [Sin92] The graphs C3 x P,, are graceful. 


F39: [HuS94] The graphs C,,, x P, are graceful for all n when m is even and for all n 
with 3 <n < 12 when m is odd. 


F40: [JR92] The torus grids C,, x C,, are graceful when m = 0 (mod 4) and n is even. 


REMARKS 


R6: Jungreis and Reid [JR92] also investigated the existence of a stronger form of 
graceful labeling called an a-labeling (see Section 5.1.7) for graphs of the form P,, x 
Pn;Cm X Pn, and Cm, x Cy (see also [Gal1]). 


R7: The graceful parity condition is violated for Cy, x C, when m and n are odd. A 
complete determination of which torus grids are graceful will most likely involve a large 
number of cases. 


FACTS about prism graphs 


F41: [GPW93] The graphs C,,, x P2 with a single vertex deleted or single edge deleted 
are graceful. 


F42: [Ga89] Mobius ladders obtained from the ladder P,, x P2 by joining the opposite 
endpoints of the two copies of P,, are graceful. 


F43: [Rop90] The graphs C,, x P: with a single pendant edge at each vertex are 
graceful. 


F44: [Rop90] The graphs C,, x P2 with a single pendant edge at each vertex of one of 
the m-cycles are graceful. 


F45: [Mah80] The graphs S2,, x Pz (book graphs) where S,, is the star with n +1 
vertices are graceful. 


F46: [De80] The books S441 x P2 are graceful. 


REMARK 
R8: The books S443 x P. do not satisfy the graceful parity condition. 


FACTS 

F47: [Mah80] The graphs (P,, x P2) x P2 and (Som x P2) x P» are graceful. 
F48: [K75, Mah80] The n-cubes Kz x Kz x --- x K2 (n copies) are graceful. 
F49: [GJ88] The graphs S2,, x P, (stacked books) are graceful. 


REMARK 


R9: Whether the graphs S2,,4, x P, are graceful is an open question. 
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5.6.4 Complete Graphs 


Complete graphs and variations have been well studied. 


FACTS 
F50: [Go72, Si74, BH02] The complete graphs K,, are graceful if and only if n < 4. 
F51: [Ro67, Go72] The bipartite complete graphs K,,,,,, are graceful. 


F52: [AM, Gn91, BH02] The graphs Kymin, Kijymn, and Kom n are graceful. 


CONJECTURE 

C5: [BH02] The graphs Kimn,Kamn;Kij,mn, and Km» are the only complete 
multipartite graphs that are graceful. 

REMARK 

R10: Beutner and Harborth [BH02] have verified their above conjecture for graphs 
with up to 23 vertices via computer. 

FACTS 


F53: [Jir03, AK] The graphs Ky, with a pendant edge attached to each vertex are 
graceful. 


F54: [SeEl01] The graphs Ky, with a pendent edge attached to each vertex are 
graceful. 


F55: [BH02] The graphs K,, with an edge deleted are graceful only if n < 5. 

F56: [BH02] The graphs K,, with two or three edges deleted are graceful only if n < 6. 
F57: [KRY80] A necessary condition for the graphs consisting of m copies of K,, with 
a vertex in common, K&™ (n > 3), to be graceful is that n = 4 or 5. 

REMARK 


R11: The gracefulness of 1. is equivalent to the existence of a (12m+1, 4, 1)-perfect 
difference family, which is known to exist for m < 1000 (see [HuS94, AB04, WC10], and 
[GMS10]). 


CONJECTURE 


C6: [Be79] The graphs K. me) are graceful for all m > 4. 


FACTS 


F58: [BKT78] The graphs K&), nS. and Ke are not graceful. 
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F59: [KRT79, RaPu] The graphs that are the one-point union of t copies of Ky,n are 
graceful. 


F60: [SeSe01a] The one-point union of graphs of the form K2m, for i = 1,2,...,n, 
where the union is taken at a vertex from the partite set with exactly two vertices, are 
graceful if at most two of the m; are equal. 


CONJECTURE 


C7: [SeSe01la] The restriction in Fact F60 that at most two of the m; are equal is not 
necessary. 


FACT 


Bermond [Be79] raised the question: “For which m,n, and r is, B(n,r,m), the graph 
consisting of m copies of K,, with a K, in common (n > r) graceful? 


F61: For r > 1 the graphs B(n,r,m) are graceful in the following cases: 
e n=3,r =2,m> 1 [KRL79]; 
en=4, r=2, m>1 [De80]; 
en=4, r=3, m> 1 [Be79, KRL79]. 


REMARK 


R12: Combining results of Bermond and Farhi [BF82] and of Smith and Puget 
[SmPul0] shows that B(n,2,2) is not graceful for n > 5. 


5.6.5 Disconnected Graphs 


For any graph G the graph mG denotes the disjoint union of m copies of G. In 
1984 Kotzig [Ko81] investigated the gracefulness of rC, as well as of graphs that are the 
disjoint unions of odd cycles. For graphs of the latter kind he gives several necessary 
conditions. His paper concludes with an elaborate table that summarizes what was then 
known about the gracefulness of rC,. When rs = 1 or 2 (mod 4), these graphs violate 
the gracefulness parity condition. 


FACTS 


F62: [Ko73] For r = 3 and 4k > 4, rC'4;, has a stronger form of graceful labeling called 
a-labeling (see Section 5.1.7) whereas when r > 2 and s = 3 or 5, rC, is not graceful. 


F63: [He95] The graphs 2C2,, and graphs obtained by connecting two copies of Com, 
with an edge are graceful. 


F64: [KT76] The graphs mK, are graceful if and only ifm =1 and n < 4. 
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REMARKS 


R13: Bu and Cao [BuC95] give some sufficient conditions for the gracefulness of graphs 
of the form Ky,» UG and they prove that Ky,,,U P; and the disjoint union of complete 
bipartite graphs are graceful under some conditions. 


R14: In 1985 Frucht and Salinas [FS85] conjectured that C, U P, is graceful if and 
only if s+ > 7 and proved the conjecture for the case that s = 4. Between 1985 and 
2012 more than a dozen authors proved many special cases. Building on partial resuilts 
in a paper by Buratti and Traetta [BT], the Frucht-Salinas conjecture was proved by 
Traetta [Tr] in 2012. He used his result to get a complete solution to the well-known 
two-table Oberwolfach problem: given an odd number of people and two round tables, 
determine when is it possible to arrange a series of seatings so that each person sits next 
to each other person exactly once during the series. The t-table Oberwolfach problem 
OP(ni,72,-..,n+) asks to arrange a series of meals for an odd number n = 55>; of 
people around ¢ tables of sizes n1,n2,...,n4 so that each person sits next to each 
other exactly once. A solution to OP(n1,72,...,nz) is a 2-factorization of K,, whose 
factors consists of t cycles of lengths n1,n2,...,n;. The A-fold Oberwolfach problem 
OP,(n1,n2,...,n4) refers to the case where K,, is replaced by AK,. Traetta used 
his proof of the Frucht and Salinas conjecture to provide complete solutions to both 
OP(2r + 1,2s) and OP(2r + 1,s,s), except possibly for OP(3, s,s). He also gave a 
complete solution of the general A-fold Oberwolfach problem OP) (7, s). 


FACTS 

F65: [SeYo2] The graphs K5 U Kin, Kmjyn U Kp (m,n, p,¢ > 2), Km U Kp,q U 
Ky,s (m,n, p, 49,7, 8 = 2, (p,q) x (2, 2)), and PK mn (m,n = 2, (m,n) # (2, 2)) are 
graceful; the graphs Cy U Ky, (n 4 2) are not graceful. 

F66: [CK4, Kis96] The graphs C, U Kj,» are graceful for s > 7. 

F67: [LQW88] The graphs P,; U Ky1,, are graceful. 


F68: [AK96] The graphs C, U C, are graceful if and only if p+ q = 0 or 3 (mod 4). 


F69: [Zho93] The graphs K,, UK, (n > 1,m > 1) are graceful when {m,n} = {4,2} 
or {5,2}, and only then. 


F70: [BD96] The graphs C4; U Ky 44-1 and Car43 U Ky4e42 are graceful. 


REMARK 


R15: Subsection 5.6.7 includes numerous families of disconnected graphs that have a 
stronger form of graceful labelings. 
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5.6.6 Joins of Graphs 


Various joins of graphs have been shown to be graceful. 


FACTS 

F71: [Ach82] If G is a connected graceful graph, then G + K,, is graceful. 

F72: [Ba05] If G is a graceful graph of order n and size n—1, then G+ K,, is graceful. 
F73: [Re03] The graphs C,, + Ky (double cones) are graceful for n = 3,4,5,7,8,9, 11. 


That C,, + Ko is not graceful for n = 2 (mod 4) follows from Rosa’s parity condition. 


F74: [SeYo4] The join of any two stars and the join of any path and any star are 
graceful. 


F75: [You03] If G is a graceful graph with p vertices and q edges with p = q+ 1, then 
G+ Kip, is graceful. 


F76: [BG86] The graphs 2K2 + K;, are not graceful. 


F77: [You03, Ma86] The graphs mK, + K,, are graceful if m = 0 or 1 (mod 4) and 
mKy+ Ky, is not graceful if n is odd and m = 2 or 3 (mod 4). 


REMARK 


R16: Balakrishnan and Sampathkumar [BS96] ask for which m > 3 is the graph 
mK y+ K,, graceful for all n. 


5.6.7 a-labelings 


In 1966 Rosa [Ro67] defined an a-labeling (or a-valuation) as a graceful labeling 
with the additional property that there exists an integer k so that for each edge zy either 
f(a) <k < f(y) or f(y) < & < f(x). (Other names for such labelings are balanced, 
interlaced, and strongly graceful.) It follows that such a k must be the smaller of 
the two vertex labels that yield the edge labeled 1. Also, a graph with an a-labeling is 
necessarily bipartite and therefore cannot contain a cycle of odd length. 

Graphs with a-labelings have proved to be useful in the development of the theory 
of graph decompositions. Rosa [Ro67], for instance, has shown that if G is a graph with 
q edges and has an a-labeling, then for every natural number p, the complete graph 
Kogp+1 can be decomposed into copies of G in such a way that the automorphism group 
of the decomposition contains the cyclic group of order p. In the same vein El-Zanati 
and Vanden Eynden [EV96] proved that if G has q edges and admits an a-labeling then 
Kgm,qn can be partitioned into subgraphs isomorphic to G for all positive integers m 
and n. Although a proof of the graceful tree conjecture has withstood many attempts, 
examples of trees that do not have a-labelings are easy to construct (see [Ro67]). 


FACTS 


F78: [Wu] A necessary condition for a bipartite graph with n edges and degree sequence 
d,,d2,...,dp to have an a-labeling is that the ged(d1, do,...,dp,n) divides n(n — 1)/2. 
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F79: [Ko73] Almost all trees have a-labelings. 
F80: [Ro67] All paths have a-labelings. 
F81: [Ro67] The n-cycle has an a-labeling if and only if n = 0 (mod 4). 


F82: Other familiar graphs that have a-labelings include the following: 
e caterpillars [Ro67], 
e the n-cube [Ko65], 

Mobius ladders M,, when n is odd [Pas10], 

books with 4n + 1 pages [GJ88], 

© Com U Cam and Cam U Cam U Cam for all m > 1 [Ko73], 

© Cam U Cam U Can for all (m,n) 4 1,1) [ECO1], 

© P, x Qn [Mah80], 

© Kin X Qn [Mah80], 

© Cam U Cam U Cam U Cm [LV87], 

© Cim U Canto U Cap+2, Cam U Can U C1, when m+n <r [AK96], 

© Crm U Can U Cay U Cy, when m >n+r+s [ACE], 

© Cym U Can U Cap42 U Cis42 when m >n+r+s6+1 [ACE], 

© ((m+1)2 +1)Cy for all m [Zhi98}, 

e k?C4 for all k [Zhi98], and 

© (k2 +k)Cy for all k [Zhi98]. 


F83: [Es97] With the exception Cy UC, U C4, every 2-regular bipartite graph with 3 
components has an a-labeling if and only if the number of edges is a multiple of four. 


F84: [AK92] The graphs kC, have an a-labeling for 4 < k < 10. 

F85: [AK92] If &Cy has an a-labeling then so do (444+1)C4, (544+1)C4, and (9k+1)C4. 
F86: [Es02] The graphs 5C4, have a-labelings for all k. 

F87: [Le] The graphs Cg x P2:+1 have a-labelings. 

F88: [FMB03] If m = 0 (mod 4) then the one-point union of 2, 3, or 4 copies of Cy, 
admits an a-labeling, and if m = 2 (mod 4) then the one-point union of 2 or 4 copies 


of Cy, admits an a-labeling. 


F89: [Zhi98] The connected graphs all of whose blocks are C4, and whose block- 
cutpoint graph are paths have a-labelings but the connected graphs all of whose blocks 
are C,,, and whose block-cutpoint graph are paths do not have a-labelings when m is 
odd. 


CONJECTURE 


C8: The one-point union of n copies of C,, admits an a-labeling if and only if mn = 0 
(mod 4). 
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REMARKS 


R17: [EC01] Eshghi and Carter show several families of graphs of the form C4, U 
Can, U++:UC4n, have a-labelings. 


R18: Jungreis and Reid [JR92] investigated the existence of a-labelings for graphs of 
the form Py, X Pa, Cm X Pn, and Cm x Cr (see also [Gal1]). Of course, the cases involving 
Cin with m odd are not bipartite, so there is no a-labeling. The only unresolved cases 
among these three families are C4mi2 Pony, and C4ym+42 X Can. All other cases 
result in a-labelings. 


FACTS 
F90: [Ro67, Ba3] The graphs Ky,» have a-labelings. 


F91: [Sel02] The one-point unions of the following forms have an a-labeling: Kmmn, 
and Kinng} Kmijni, Kmsjng, and Km3ngz where my < mg < m3 and ny < no < 
m3; Km,n; Kmon, and Kmsz.n where my < m2 < m3 < 2n. 


F92: [Ba3] For n even the graphs obtained from the wheel W,, by attaching a pendant 
edge at each vertex have a-labelings. 


F93: [Sn2] Compositions of the form G[K,,] have an a-labeling whenever G does. (The 
composition G1 [G2] is the graph having vertex set V(G1) x V(G2) and edge set 
{(r1, yi), (@2, y2)| 1%2 € E(G1) or x1 = x2 and yiy2 € E(G2)}.) 


F94: [Qi] For n even graphs obtained from an n-cycle by adding one or more pendant 
edges at some vertices have a-labelings as long as at least one vertex has degree 3 and 
one vertex has degree 2. 


F95: [SeH11] All gear graphs have an a-labeling (see also [Le]), all graphs obtained 
by joining an endpoint of a path to a cycle of order n = 0 (mod 4) have an a-labeling, 
and the graphs obtained by identifying an endpoint of a star S,, (m > 3) with a vertex 
of C4, have an a-labeling. 
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Glossary for Chapter 5 


achromatic number — of a graph G: largest number of colors in a proper vertex- 
coloring such that the union of any two color classes induces at least one edge. 

almost regular factor (or semiregular factor) — of a graph G: a factor of G of type 
[k,k + 1], for some integer k > 0. 

antifactor set (or l-barrier) — in a graph G: a set S C V(G) such that co(G—S) > |S]. 

approximation (or approximate) algorithm: an algorithm that typically makes 
use of heuristics in reducing its computation but produces solutions that are not 
necessarily optimal. 

r(n)-approximation algorithm — for x: for every n and every input graph G with n 
vertices, the algorithm outputs an integer & such that y(G) < k < r(n)x(G) (where 
r(n) : N > R® is a given function). 

arboricity — of a graph G: the minimum number of edge-disjoint acyclic subgraphs 
whose union is G. 

k-assignment — on the vertices (edges) of a graph: a list assignment L where |L,| =k 
(|Le| = k) for every vertex v (every edge e). 

asymptotically almost surely (or a.a.s.): an event E concerning a graph G € Gn,» 
is said to hold asymptotically almost surely (or a.a.s.) if limp... Prob E = 1. 

1-barrier (or antifactor set) — in a graph G: a set S C V(G) such that co(G—S$) > |S]. 

6-valuation — of a graph G: a synonym for graceful labeling. 

bicritical graph: a graph G in which G — x — y has a 1-factor for every choice of two 
different vertices x and y € V(G). 

bi-hypergraph: a mixed hypergraph with C = D. 

binding number bind (G) — of a graph G: defined to be 

min {|N(X)|/|X| | 0#X CV(G), and N(X) ¢ V(G)} 

bipartite graph: a graph G whose vertex set V can be partitioned into two sets Vj 
and V2 such that every edge of G connects a vertex in V; with a vertex in V9. 

block — of a graph: a maximal connected subgraph that has no cut-vertex. 

block-cutpoint graph — of a graph G: a bipartite graph in which one partite set 
consists of the cut vertices of G, and the other has a vertex 6; for each block B; 
of G. 

book graph: the graph $2, x P2, where Sg, is the star with 2m + 1 vertices. 

___, Stacked: the graph So, x Py. 

C,,-cactus: a connected graph all of whose blocks are Cy. 

caterpillar: a tree such that the deletion of all univalent vertices leaves a path. 

choice number — of a graph G: the smallest nonnegative integer k such that G is 
k-choosable. Denoted by ch(G). 

(f,g)-choosable graph — for two functions f,g: V > N: if |Z,| = f(v) for every 
vertex, then one can choose subsets C, C Ly such that |Cy| = g(v) and C, NC, = 0 
for all wv € E. 
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k-choosable graph: L-colorable for every k-assignment L. 

f-choosable graph — for a function f : V — N: L-colorable for every list assignment 
L with |L,| = f(v) for allue V. 

chordal graph: a graph in which every circuit of length > 4 has a chord (i.e., an edge 
joining non-consecutive vertices on the circuit). 

k-chromatic graph: has precisely & as the smallest number of colors in a proper vertex 
coloring = k-colorable but not (k — 1)-colorable. 

chromatic index — of a graph G: smallest number of colors in a proper edge-coloring 
of G; same as the chromatic number of the line graph. Denoted by /(G). 

chromatic number — of a graph G: the minimum number of colors in a proper vertex 
coloring of G. Denoted by y(G). 

chromatic polynomial — of a graph G: for every natural number k, its value is the 
number of proper k-colorings of G; denoted P(G, X). 

chromatic sum — of a graph G: smallest sum of colors in a proper vertex-coloring with 
natural numbers; denoted }(G). 

Class 1/2: graph G is of Class 1 if its chromatic index is A(G), and of Class 2 if 
yV'(G) = A(G) +1. 

claw — of a graph G: an induced subgraph of a graph G isomorphic to the bipartite 
graph ky 3. 

claw-free graph G: a graph G containing no Ky3 as an induced subgraph. 

clique cover — of a graph G: a collection of cliques of G that contains every vertex of 
G. 

clique hypergraph — of a graph G = (V,£): the hypergraph on V whose edges are 
the vertex subsets inducing inclusionwise-maximal complete subgraphs in G, other 
than isolated vertices. Its chromatic number is denoted yo(G). 

clique number — of a graph G, denoted w(G): the size of the largest clique in G. 

clique partition number — of a graph G: the smallest number cp(G) such that there 
exists a set of cp(G) cliques in G such that the cliques form a partition of E(G). 

clique; — in a graph G: a subset of vertices in G that are mutually adjacent to one 
another (caution: non-uniform definition). 

clique — in a graph G: a maximal mutually adjacent set of vertices (caution: non- 
uniform definition). 

color class — in a vertex (edge) coloring: set consisting of all vertices (edges) having 
the same color. 

color cost: same as chromatic sum. 

k-colorable graph: has a proper vertex coloring with at most k colors. 

colorable mixed hypergraph: a mixed hypergraph that has at least one strict col- 
oring. 

L-colorable — graph G, with respect to list assignment LD: if G admits a proper vertex 
coloring y such that y(v) € Ly for all v. 

coloring number — of a graph G: smallest integer k such that every subgraph of G 
contains a vertex of degree less than k; denoted col(G). 

H-coloring — of a graph G: homomorphism from G to H. 

k-coloring: coloring with at most k colors. 

comparability graph: a graph whose edges can be directed so that directed adjacency 
becomes a transitive relation, that is, whenever there exist directed edges (a,b) and 
(b,c) there must also exist the directed edge (a,c). 

complement — of a graph G: the graph G° = (V, E*) which is related to graph G = 
(V, £) as follows: it has the same vertex set V as G and edges defined by (a, y) is in 
E* if and only (a, y) is not in E. 
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complete r-uniform hypergraph: its edges are all the r-element subsets of the vertex 

set; denoted KY}, (n is the number of vertices). 

n-cone: the graph C,, + Ky (also called the n-point suspension of C;,). 

conflict graph: a graph in which the nodes represent events (e.g., courses, exams) and 

an edge between two nodes indicates that the two events cannot be scheduled in the 
same time slot. 

construction heuristic: a heuristic that produces a feasible solution without any 

attempt to improve it. 

cost chromatic number — of a graph G, with given cost set C: smallest number of 

colors in a minimum-cost coloring. 

cost set: associates a positive real cost with each color. 

k-critical graph: k-chromatic graph whose chromatic number decreases to k — 1 when- 

ever an edge is deleted. 

n-cube: the graph Ky x K2 x --- x Kg (n copies). 

cyclically k-edge-connected graph: a graph in which at least k edges must be 

deleted in order to leave two components, each containing a cycle. 

H-decomposition problem: the problem defined as follows: given a fixed graph H, 

can the edge-set of an input graph G be partitioned into copies of H? 
divisible by t: A graph G which admits a partition of its edge-set into t isomorphic 
subgraphs is said to be divisible by t. 

edge k-colorable graph: has a proper edge-coloring with at most k colors. 

edge choice number — of a graph G: the choice number of the line graph of G; denoted 

ch'(G). 

edge, C-, D-edge — of a mixed hypergraph: see strict coloring. 

edge-chromatic number — of a graph G: the minimum number of different colors 

required for a proper edge-coloring of G; denoted y’(G). 

edge-coloring: assignment of colors to the edges (each edge gets one color). 

___, proper: edge-coloring where any two edges sharing a vertex have distinct colors. 
exact algorithm: an algorithm that solves a certain optimization problem to optimal- 
ity. 

factor — of a graph G: a spanning subgraph of G. 

, (1, f)-odd-: a spanning subgraph F' of G in which degp(v) € {1,3,..., f(v)}, 
where f is a function from V(G) to the odd positive integers. 

, (g, f)-: a spanning subgraph F' of G such that g(v) < deg,(G) < f(v) for all 
veEV(G). 

__, [a, bJ-: a factor of G for which a < deg,,(v) < 8, for all v € V(G), where a and b 

are integers such that 1<a <b. 

__, 1-: a set of vertex-disjoint edges in G which together span V(G). 

__, F-: a spanning subgraph of G in which each component is a single edge or an 
odd cycle. 

, f- — of multigraph G (possibly with loops): a spanning subgraph H of G such 
that degy(v) = f(v), for all v € V(G), where f, a non-negative, integer-valued 
function on V(G). 

__, G- — of a graph A: a set {G),...,Ga} of subgraphs of H such that each G; is 
isomorphic to G and such that the sets V(G) collectively partition V(G). 
__, k-: a k-regular spanning subgraph. 
G-factor recognition problem F'ACT(G): defined by 
INSTANCE: A graph H. 
QUESTION: Does H admit a G-factor? 
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factorization — of a graph G: a set of factors {F\, Fo,... Ff} of G such that the 
edge-disjoint union of factors F\, Fo,...F, is E(G). 

all (g, f)-factors: a graph G is said to have all (g, f)-factors if and only if G has an 
h-factor for every h such that g(v) < h(v) < f(v) for all v € V(G). 

feasible solution: a solution that satisfies all hard constraints. 

fractional chromatic number — of a graph G: smallest ratio p/q such that there 
exist p independent sets that cover each vertex precisely g times; denoted \*(G). 

fractional vertex-coloring: function from the family of independent vertex sets to 
R=°, such that the sum over the sets containing vertex v is at least 1, for each v. 

fractional (g, f)-factor — of a graph G: a vector x = (x-) with |E(G)| real components 
such that 0 < xe < ce and g(v) < deg,(v) < f(v). Here deg,(v) = }> uv, where 
the sum is over all edges incident with vertex v and c, is the “capacity” or perhaps 
the “multiplicity” of edge e. 

gear graph: obtained from the wheel W, by adding a vertex between every pair of 
adjacent vertices of the n-cycle. 

graceful labeling — of a graph G with q edges: an injection f from the vertices of G to 
the set {0,1,...,q} such that, when each edge xy is assigned the label | f(a) — f(y)|, 
the resulting edge labels are distinct. 

__, strongly: synonym for a-labeling. 

(a, b|-graph: a graph G in which a < deg(v) < b, for every vertex v € V(G). 

graphical — degree sequence: a sequence of non-negative integers d,,...,d,, such that 
there exists a graph G of order n and degrees (in some order) dj,..., dy. 

Grundy number - of a graph G: largest number of colors in a proper vertex-coloring 
with natural numbers where each vertex v has a neighbor in each color smaller than 
the color of v. 

Hamming distance — between a pair u = (u1,...,Un), UV = (U1,.-.,Un) of binary 
vectors: the number of indices i for which u; 4 1. 

Hamming graph H(n,d): a graph whose vertices are all binary vectors with n co- 
ordinates. A pair u,v of vertices in H(n,d) are adjacent if the Hamming distance 
between them is at least d. 

hard constraint: a constraint that must be satisfied. 

helm graph: the graph obtained from a wheel by attaching a pendant edge at each 
vertex of the cycle. 

homomorphism — from (di)graph G to H: maps the vertices of G to vertices of H 
in such a way that the image of every edge is an edge of H. Notation if it exists: 
G —> H; if it does not exist: GA H. 

hypergraph: a pair H = (X,F) where X is a set (vertex set) and F is a set system 
on X (edge set). 

hypohamiltonian — graph G: a graph G which has no Hamilton cycle, but for which 
G — v does, for all v € V(G). 

hypotraceable — graph G: a graph G which has no Hamilton path, but for which G—v 
does, for all v € V(G). 

independence number: the number of vertices in a maximum-size independent set 
of a graph. 

independent set — of a graph G: a subset of vertices in G that are mutually non- 
adjacent. 

independent set: a mutually non-adjacent set of vertices. 

independent-set cover — of a graph G: a collection of independent sets of G that 
contains every vertex of G. 
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intersection graph — of a family F' of subsets of a given set: a graph G(F’) with a 
1-to-1 correspondence between subsets of F' and vertices of G such that two vertices 
of G are adjacent if and only if they correspond to two subsets of F with a non-empty 
intersection. 

interval graph: a graph for which there exists a family F' of intervals on a line such 
that G is an intersection graph, that is, there is a 1-to-1 correspondence between 
intervals of F' and vertices of G such that two vertices of G are adjacent if and only 
if they correspond to overlapping intervals of F’. 

proper: an interval graph with the property that there is an interval model F for G in 
which no interval of F is properly contained within another interval of F’. 

interval model — for an interval graph G: a family of intervals on the line for which 
G is an intersection graph. 

a-labeling: a graceful labeling with the additional property that there exists an integer 
k so that for each edge xy, either f(a) <k < f(y) or f(y) <k < f(x). (Other names 
for a-labelings are balanced, interlaced, and strongly graceful.) 

line graph — of a graph G: a graph, denoted L(G), with a vertex for each edge of G 
and two vertices of L(G) joined by an edge if and only if they correspond to two 
edges in G with a common endpoint. 

k-linear arboricity — of a graph G: the minimum number of k-linear forests which 
partition E(G). 

k-linear forest: a forest in which all components are paths of length at most k. 

list assignment L — on the vertex set of a graph G: associates a set L, of “allowed” 
colors with each vertex v of G. 

list chromatic index or list edge chromatic number: same as edge choice number. 

list chromatic number: same as choice number. 

list colorable graph: L-colorable with respect to a list assignment LD that is under- 
stood. 

lobster: a tree such that the deletion of all univalent vertices leaves a caterpillar. 

local search: a heuristic that starts from a feasible solution and in each iteration, until 
termination, chooses the next solution from a neighborhood of solutions that are, in 
some prescribed sense, close to the current solution. 

__, improvement: a local search in which we choose only a solution that is better 
than the current one and stop if we cannot find one. 

lower chromatic number — of a colorable mixed hypergraph: the smallest number of 
colors in a strict coloring; denoted x(H). 

matching number of G: the size of a largest matching in G; denoted v(G). 

maximum clique: the number of vertices in a maximum-size clique of a graph. 

mixed hypergraph: has two sets of edges, C and D. (Cf. strict coloring.) 
___, C-: mixed hypergraph without D-edges. 
__, D-: mixed hypergraph without C-edges. Same as hypergraph. 
__, uncolorable: does not have any strict coloring. 
___, uniquely colorable: has just one strict coloring, apart from renaming the colors. 

Mobius ladders: obtained from the ladder P, x P2 by joining the opposite endpoints 
of the two copies of P,,. 

neighborhood complex — of a graph G: simplicial complex whose vertices are the 
vertices of G, and the simplexes are the vertex subsets having a common neighbor 
in G. 

nowhere-zero k-flow — on an oriented graph: weight function from the edge set to 
{0,1,...,4—1}, such that the in-flow (sum of weights on the edges oriented toward 
a vertex) equals the out-flow (sum on the out-going edges), at each vertex. 
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odd-cycle property: the property which states that every pair of odd cycles either 
has a vertex in common or is joined by an edge. 


on-line coloring: receiving the vertices v,,..., Un of a graph G one by one, a color has 
to be assigned to each successive vertex v; only after its neighbors in {v1,...,v;-1} 
are known. 


oriented d-coloring — of a graph G: a proper edge-d-coloring together with an as- 
signment of a direction to each edge; used to model the schedule of home and away 
games in sports timetabling. 

partial vertex/edge coloring: assignment of colors to a subset of the vertices/edges. 

partition number — of a graph G, denoted p(G): the size of the smallest clique cover 
of G. 

path factor — of a graph G: spanning subgraph of G each component of which is a 
path. 

permanent — of a matrix A: the matrix function defined by 

per A= S- 1 7(1)42n(2) °° * Inr(n) 

where the sum extends over all permutations 7 of the set {1,...,n}; denoted per A. 

Perron root \(G): the largest eigenvalue of Ag. 

planar dual — of a plane graph G: the dual vertices are the faces of G, and the 
endpoints of a dual edge are the faces whose boundary contains the original edge; 
denoted G* = (V*, E*). 

plane graph: planar graph imbedded in the plane. 

precoloring extension or PrExt problem: asks whether a given partial coloring can 
be extended to a proper coloring of the entire graph, using at most a given number 
k of colors. 

t-PrExt: Precoloring Extension where each color occurs on at most t vertices in the 
given partial coloring. 

proper vertex-coloring — of a hypergraph H: vertex partition where no partition 
class contains any edge of H. 

random graph G’,,,: Let 1,...,n be a labeling of the vertices and let {e;;},1 <i < 
j <n, be an array of independent random variables, where each e;; assumes the 
value 1 with probability p and 0 with probability 1— p. This array determines a 
random graph on {1,...,} where each (ij) is an edge if and only if e,; = 1. This 
probability space (or random graph) is denoted by Gn,p. 

ranking number — of a graph G: smallest number of colors in a vertex-coloring such 
that each path with the same color on its endpoints contains a vertex of larger color; 
denoted x,.(G). 

t-rational graph G: a graph G which is divisible by t or else t 1 |E(G)|. 

t-rational Problem: The problem defined by: Given a graph G and a positive integer 
t, is G t-rational? 

snark: 3-regular graph of chromatic index 4, which is also cyclically 4-edge-connected 
and has girth at least 5. 

soft constraint: a constraint that it is desirable, but not necessary, to satisfy. 

square — of a graph G: obtained from G by joining the vertex pairs at distance 2; 
denoted G?. 

stability number — of a graph G, denoted a(G): the size of the largest independent 
set in G. 

standard simplex: A = {x € R": x; >0, i=1,...,n, San Pp SeL 

strength — of a graph G: cost chromatic number of G where the cost set is N. 
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strict k-coloring — of a mixed hypergraph: vertex-coloring with exactly k colors, such 
that every C-edge has two vertices with a common color and every D-edge has two 
vertices with different colors. 

n-point suspension — of the cycle graph C,,: the graph C,, + Ky. 

Szekeres—Wilf number: same as coloring number. 

tabu search: a local search in which solutions that are worse than the current one can 
be chosen provided that they are not in any of the so-called tabu lists. 

timetabling problem: the assignment of times and resources to meetings so as to 
satisfy a set of constraints as best as possible. 

total coloring: assignment of colors to the vertices and to the edges (each vertex and 
each edge gets one color). 

__, proper: total coloring where no two adjacent or incident vertices/edges have the 
same color. 

total graph — of G = (V,E): its vertex set is VU EF, and a,y € V UE are adjacent if 
they are incident or adjacent in G; denoted T(G). 

toughness — of a graph G: defined to be +co when G is complete and otherwise to be 


min{|S|/c(G — S)|S C V(Q)}, 


where the minimum is taken over all subsets S C V(G) and c(G — S) denotes the 
number of components of G — S. Denoted tough (G). 

triangulation: graph imbedded in a surface, with all faces being cycles of length 3. 

r-uniform hypergraph: every edge has precisely r vertices. 

uniquely (vertex-)colorable graph: has just one proper coloring with the minimum 
number of colors, apart from renaming the colors. 

uniquely edge-colorable graph: has just one proper edge-coloring with the minimum 
number of colors, apart from renaming the colors. 

upper chromatic number — of a colorable mixed hypergraph: the largest number of 
colors in a strict coloring; denoted X(H). 

vertex cover — of a graph G: a subset C C V(G) such that every edge of G has at 
least one endpoint in C. 

k-vertex-critical graph: k-chromatic graph whose chromatic number decreases to 
k — 1 whenever a vertex is deleted. 

vertex-coloring: assignment of colors to the vertices (each vertex gets one color). 

___, proper: coloring where no two adjacent vertices have the same color. 

vertex-covering number — of a graph G: size of any smallest vertex cover in G; 
denoted 7(G). adjacency matrix — of a graph G: a 0-1 matrix with a row and a 
column for each vertex and entry (7,7) is 1 if and only if the ith vertex is adjacent 
to the jth vertex. 

vertex-induced subgraph: a subgrapn H = (V’, E’) of a graph G = (V, E) with the 
property that V’ C V and for any two vertices x,y € V’, (x,y) € E” if and only if 
(x,y) ¢ B. 

vertex-weighted graph: a graph G in which every vertex x is assigned a non-negative 
weight w(x). 

web graph: the graph obtained by joining the pendant points of a helm graph to form 
a cycle and then adding a single pendant edge to each vertex of this outer cycle. 
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INTRODUCTION 


An automorphism of a graph is a permutation of its vertex set that preserves inci- 
dence of vertices and edges. Under composition, the set of automorphisms of a graph 
forms a group that gives much information about both the local and the global structure 
of the graph. It may determine the graph’s connectivity structure (Section 4.2) and the 
kinds of surfaces in which it may be embedded (Sections 7.1, 7.5). It is indispensable for 
counting the number of essentially “distinct” graphs with a variety of different proper- 
ties (Section 6.3). In this section one will also encounter infinite graphs. It will usually 
be assumed that the infinite graphs are locally finite, i.e., all valences will be presumed 
to be finite, although they may be arbitrarily large. 


6.1.1 The Automorphism Group of a Graph 

NOTATION: We need to use the formal notation {u,v} (rather than uv) to represent the 
edge incident with vertices u and v for reasons that become evident in Subsection 6.1.3. 
DEFINITIONS 

D1: Given a graph X, a permutation a of V(X) is an automorphism of X if 


{u,v} € E(X) © {a(u), a(v)} € E(X), for all u,v € V(X). 
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D2: The set of all automorphisms of X, together with the operation of composition of 
functions, forms a subgroup of the symmetric group on V(X) called the automorphism 
group of X. 


NOTATION: The automorphism group of X is denoted by Aut(X). The identity of any 
group of permutations is denoted by v. 


D3: A graph is called asymmetric if 1 is its only automorphism. 


D4: A ray is a one-way infinite path; a double ray is a two-way infinite path. (The 
vertices of a ray admit a natural indexing with N; the vertices of a double ray admit a 
natural indexing with Z.) 


REMARKS 


R1: While all vertices in the same orbit of Aut(X) must have the same valence, there 
exist asymmetric graphs all of whose vertices have the same valence. 


R2: If Y is a subgraph of X, even an induced subgraph, then except in special cases 
there is little relationship between Aut(X) and Aut(Y). 


R3: When one speaks of Aut(X) being isomorphic to a group G, it is ambiguous 
whether the implied isomorphism is between abstract groups or between permutation 
groups (see Subsection 6.1.2). In the examples immediately below, the automorphism 
groups Aut(X) are abstractly isomorphic to the given groups G. 


EXAMPLES 


El: Let V(K4) = {a,b,c,d} and let X = K, — {a,c}. Then Aut(X) = {v,a,8,a} 
where a interchanges a and c but fixes both b and d, while ( fixes a and c but inter- 
changes b and d. Thus Aut(X) = Z2xZz. Strictly speaking, one should say that Aut(X) 
is isomorphic to Zz x Zz, but for brevity we abuse language in this way throughout this 
section. 


E2: The automorphism group of a circuit of length n is the dihedral group D,, with 
2n elements. 


E3: [Fr37] The automorphism group of the Petersen graph is Sym(5). 


E4: The automorphism group of a double ray is the infinite dihedral group Dy, but a 
ray is asymmetric. 


E5: The automorphism group of the underlying graph of the regular square tiling of 


the plane is generated by D,, (acting on a major axis) together with a rotation of 1/2 
about the origin. 


FACTS 


F1: For any graph X, the automorphism group of X and that of its complement X 
are identical. 
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F2: Given any finite tree, either there is a unique vertex or there is a unique edge that 
is fixed by all automorphisms. 


F3: Let the components of X be X1,...,Xx. If no two of the components are iso- 
morphic subgraphs, then Aut(X) is the direct product Pes Aut(X;). If all of the 
components are mutually isomorphic subgraphs, then A(X) is the wreath product 
Sym(k) ? Aut(X}). 


6.1.2 Graphs with Given Group 


D5: A group G of permutations of a set S acts transitively or is transitive on S 
if for every x,y € S, there exists a € G such that a(x) = y. 


D6: A graph X is said to be vertex-transitive if Aut(X) acts transitively on V(X). 
(Intuitively speaking, a vertex-transitive graph looks the same no matter from what 
vertex it is viewed.) 


D7: A group G of permutations of a set S acts doubly transitively on S if, for any 
two ordered pairs of distinct elements (x1, 72), (y1, y2) € S x S, there exists a € G such 
that a(a1) = y, and a(a2) = yo. 


D8: For i = 1,2, let G; be a group of permutations of the set S;. We say that G 
and G2 are isomorphic as permutation groups if there exist a group-isomorphism 
® : G, — G2 and a bijection f : S$; — Sg such that 


f(a(x)) = [®(a)](f(x)) for alla € Gy, « € $4, 


i.e., the diagram in Figure 6.1.1 commutes. 


Si 


Sy 


Figure 6.1.1: Isomorphism of permutation groups. 


In this case, |G| is the order of G and |S| is the degree of G. 


D9: An edge-isomorphsm from a graph X to a graph Y is a bijection 7 : E(X) > 
E(Y) such that edges e; and ez of X are incident with a common vertex of X if and only 
if n(e1) and (ez) are incident with a common vertex of Y. An edge-isomorphsim is 
an edge-automorphism from a graph to itself. 


D10: The set of edge-automorphisms of a graph X, together with the operation of 
composition of functions, forms a subgroup of the symmetric group on E(X) called the 
edge-group of X. 
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EXAMPLES 


E6: Let n denotes any cardinal, finite or infinite. The automorphism group of the 
complete graph K,, is Sym(n), and the implied isomorphism is between permutation 
groups. 


E7: For no graph X on n vertices is Aut(X) ever isomorphic as a permutation group 
to the alternating group Alt(n). This is because the permutation group Alt(n) acts 
doubly transitively on an n-set. The only graphs on n vertices whose automorphism 
group acts doubly transitively are K,, and its complement, the edgeless graph K,,, but 
the automorphism group of these two graphs is Sym(n). 


E8: The edge-groups of the 3-circuit C3 and of Ky,3 are isomorphic as permutation 
groups to each other and to Aut(C3) but are abstractly isomorphic to Aut(f1,3). 


FACTS 


F4: Every automorphism a of a graph X induces a unique edge-automorphism 79; 
namely, if {u,v} € E(X), then na({u, v}) = {a(u), a(v)}. The converse is not true. 


NOTATION: From here on, unless stated otherwise, one understands the term “isomor- 
phic” between groups to mean “abstractly isomorphic”. 


F5: [HarPa68] The permutation group induced by Aut(X) on E(X) is isomorphic to 
Aut(X) if and only if X has at most one isolated vertex and K is not a component of 
Xx, 


F6: Frucht’s Theorem [Fr38]: Given any group G, there exist infinitely many con- 
nected graphs X such that Aut(X) is isomorphic to G. Moreover, X may be chosen to 
be 3-valent [Fr49]. 


This result was extended by G. Sabidussi [Sa57] as follows: 


F7: In addition to having Aut(X) isomorphic to a given group G, one may further 
impose that X 
e has connectivity « for any integer K > 1, or 


has chromatic number c for any integer c > 2 (see Section 5.1), or 
e be r-valent for any integer r > 3, or 
e be spanned by a graph Y homeomorphic to a given connected graph Y. 


For most of the graphs X constructed by Frucht and Sabidussi, the degree of Aut(X), 
namely, |V(X)|, is several times as large as its order |A(X)|. Consequently, given a 
group G, there has been interest in seeking the smallest graph X such that A(X) is 
isomorphic to G. (The situation where the order and degree of A(X) are equal and 
A(X) acts transitively on V(X) is the subject of Subsection 6.1.6.) 


NOTATION: For a finite group G, let ~4(G) denote the least |V(X)| such that Aut(X) is 
isomorphic to G. 


F8: The asymmetric graph with the fewest edges is obtained from a path of length 
5 by adjoining a new edge to a vertex at distance 2 from an end-vertex of the path, 
yielding a tree on seven vertices. Thus pu({}) = 7. 
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F9: [Bab74] If G is a nontrivial finite group different from the cyclic groups of orders 
3, 4, and 5, then u(G) < 2|G]. 


F10: p(Z3) = 9; u(Z4) = 10; u(Zs) = 15. (See [Sa67].) 


FURTHER READING 
The automorphism groups of all the generalized Petersen graphs are presented in 
detail in [FrGraWa71]. 


6.1.3. Groups of Graph Products 


In this subsection, we use the symbol & to indicate an arbitrary graph product X&Y 
of graph X by graph Y, where we define a graph product of X by Y to be the graph 
with vertex set V(X) x V(Y) and whose edge set is determined in a prescribed way by 
(and only by) the adjacency relations in X and in Y. It has been shown (see [Im1z75]) 
that there exist exactly 20 graph products that satisfy this definition. One is generally 
interested in products that are associative, in the sense that, for all graphs W, X,Y, the 
graphs (W&xX)&Y and W&(X&Y) are isomorphic. 


DEFINITIONS 
The four most commonly used associative graph products are now defined. 


D11: Let Z be a graph product of arbitrary graphs X and Y. Let 21,22 be (not 
necessarily distinct) vertices of X, and let y1,y2 be (not necessarily distinct) vertices 
of Y. Suppose that {(x1, y1), (v2, y2)} € E(Z) if and only if 


e [{x1, 02} € E(X) and y = ye] or [v1 = x2 and {y1, y2} € E(Y)]. Then Z is the 
cartesian product of X by Y, and we write Z = XHY; 


{21,02} € E(X) and y, = yo] or [a, = x and {y1, yo} € E(Y)] or [{x1, 22} € 
E(X) and {y1,y2} € E(Y)]. Then Z is the strong product of X by Y, and we 
write Z= XY; 


{x1,t2} € E(X) and {yi,y2} € V(Y). Then Z is the categorical product 
or weak product of X by Y (also called the Kronecker product or direct 
product), and we write Z = X x Y; 


e {x1, 22} € E(X) or [x, = x and {y, y2} € E(Y)]. Then Z is the lexicographic 
product of X by Y, and we write Z = X[Y]. 


These four products are illustrated in Figure 6.1.2 wherein both X and Y denote 
the path of length 2. 


D12: A graph X is a divisor of a graph Z (with respect to a product &) if there 
exists a graph Y such that Z = X&Y or Z=Y&X. 


D13: A graph Z is prime (with respect to a given product &) if Z has no proper 
divisor, i.e., no divisor other than itself and the graph consisting of a single vertex. 


D14: Graphs X and Y are relatively prime (with respect to a given product &) if 
they have no common proper divisor. 


Section 6.1. Automorphisms 595 


Xoy XBY XxY X[Y] 
e @ @ e @ @ eo _——_e_® eo _—_e—_® 
4 x x x 


Figure 6.1.2: The four products of the 2-path by the 2-path. 


D15: Let G and H be groups of permutations of sets S and T, respectively. We define 
the wreath product G2? H to be the group of permutations of S x T with the property 
that, for each 7 € G2? H, there exists a € G and for each s € S there exists 8; € H 
such that m(s,t) = (a(s), Bas)(t)) holds for all (s,t) € S x T. The group operation is 
componentwise composition. 


GENERAL FACTS 


F11: The lexicographic product is the only one of these four products that is not 
commutative. In fact, if X[Y] = Y[X], then either both X and Y are complete, or both 
are edgeless, or both are powers (with respect to the lexicographic product) of the same 
graph W. (See [Im69].) 


F12: The lexicographic product is the only one of these four products that is self- 
complementary, in the sense that for any graphs X and Y, we have X [Y] = X [Y]. 


FACTS ABOUT CONNECTEDNESS 


F13: The cartesian (respectively, strong) product of two graphs is connected if and 
only if each factor is connected. 


F14: [Weic62] Let X and Y be graphs with at least one edge. Then X x Y is connected 
if and only if both X and Y are connected and not both X and Y are bipartite. 


F15: The lexicographic product X[Y] is connected if and only if X is connected. 


FACTS ABOUT FACTORIZATION 


F16: (Sa60] Every connected graph has a unique prime factorization with respect to 
the cartesian product and with respect to the strong product. 


F17: Finite nonbipartite graphs have a prime decomposition into nonbipartite divisors 
with respect to the categorical product. 


F18: [Im72] Any prime decomposition with respect to the lexicographic product can 
be transformed into any other by transpositions of edgeless or complete divisors. Thus 
if a graph has such a prime decomposition without such divisors, then it is unique. 
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FACTS ABOUT AUTOMORPHISM GROUPS OF PRODUCTS 


F19: Each of the four product graphs X&Y named in D11 is vertex-transitive if and 
only if both X and Y are vertex-transitive. 


F20: [Sa60] If X is connected, then Aut(X) is generated by the automorphisms of its 
prime divisors with respect to the cartesian product and the transpositions interchanging 
isomorphic prime divisors. 


An important corollary is the following: 


F21: Let X be the cartesian product X = X, Xx of pairwise relatively prime 
connected graphs. Then Aut(X) is the direct a deh ines , Aut(X;). 


The following notation is needed in order to characterize the group of the lexico- 
graphic product. 


NOTATION: For a subgraph Y of X, let 
OY) = {a € V(X)\V(Y) : {a,y} © E(X) for some y € V(Y)}. 


If Y consists of a single vertex Y = {y}, we write simply O(y). 
NOTATION: We define binary relations R(X) and S(X) on V(X) by 


e (u,v) € R(X) & Au) = Av); 
e (u,v) € S(X) > Ou) U {u} = (wv) U {ov}; 
e (u,v) EC A(X) Su=v. 


F22: [Sa61] Let X and Y be graphs that are not edgeless. Then Aut(X[Y]) = Aut(X)? 
Aut(Y) if and only if the following two conditions hold: 

(i) R(X) 4 A(X) = Y is connected, and 

(ii) S(X) 4 A(X) = Y is connected. 


FURTHER READING 
For a comprehensive and up-to-date treatment of all graph products, see [ImK100]. 


6.1.4 Transitivity 


DEFINITIONS 


D16: If G is a group of permutations of a set S and x € S, then the stabilizer of x 
(in G) is the subgroup G, = {a € G: a(x) = x}. 


NOTATION: The stabilizer of a vertex u € V(X) is denoted by Aut,,(X). 


D17: A graph X is edge-transitive if, given e1, eg € E(X), there exists a € Aut(X) 
such that a(e,) = a(e2). 


D18: A graph X is arc-transitive if given ordered pairs (u1, v1), (u2, v2) of adjacent 
vertices, there exists a € Aut(X) such that a(u1) = v1 and a(ug) = vo. 
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D19: A graph that is vertex-transitive and edge-transitive but not arc-transitive is 
half-transitive. 


D20: A graph with constant valence that is edge-transitive but not vertex-transitive 
is semisymmetric. 


The following definition is useful when discussing infinite graphs. 


D21: Given a graph X, let d(k) denote the number of vertices at distance & from some 
given vertex, and let a be some real number such that a > 1. Then the growth of X is 
defined to be gr(X) = liminf,_,..[d(k)/a*). 


e If gr(X) > 0, then X has exponential growth. 
e if gr(X) =0, then the growth is subexponential. 
e Ifr >1and0 < lim infyses [Do d(j)/k"] < co, then X has polynomial growth 
of degree r. 
FACTS 
F23: If X is vertex-transitive, then for any u,v € V(X) we have: 
e Auty(X) and Aut,(X) are conjugate subgroups of Aut(X); 
e |Auty(X)| = |{a € Aut(X) : a(u) = v}|; 
e If X is finite, then |Aut(X)| = |Aut,,(X)] -|V(X)]. 
F24: If a vertex-transitive graph is not connected, then its components are pairwise 
isomorphic and each is vertex-transitive. 


F25: If a graph X is edge-transitive but not vertex-transitive, then it is bipartite. 
In this case, Aut(X) induces exactly two orbits in V(X), namely, the two sides of the 
bipartition. 


F26: [Fo67] The smallest semisymmetric graph is 4-valent and has 20 vertices. 
F27: Ifa graph is arc-transitive, then it is both vertex-transitive and edge-transitive. 


F28: Tutte’s Theorem [Tut66]: Every finite half-transitive graph has even valence. 


Finite half-transitive graphs do indeed exist, though they are not plentiful: 


F29: [Bouw70] For every positive integer n, there exists a half-transitive graph on 9-6” 
vertices with valence 2(n + 1). 


F30: [Ho81] The smallest half-transitive graph has 27 vertices and is 4-valent. 


F31: [ThWa8s9] Every infinite half-transitive graph of subexponential growth has even 
valence. However, there exist half-transitive graphs with exponential growth that have 
odd valence. 
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F32: [Tr85] Let X be a connected vertex-transitive, infinite, locally finite graph. Then 
the following are equivalent: 


e X has polynomial growth; 


e There is a system S of imprimitivity of Aut(X) on V(X) with finite (possibly 
singleton) blocks such that Aut(X)/S is a finitely generated nilpotent-by-finite 
group and stabilizers in Aut(X)/S of vertices in X/S are finite. 


Combining F32 with the result of H. Bass [Bas72] that the rate of growth of a nilpo- 
tent group is always polynomial yields the following: 


F33: If the growth of a connected locally finite vertex-transitive graph X is not 
greater than that of any polynomial, then X has polynomial growth, ie., if 0 < 
lim Tica pea d(i)/k"| < co for some real number r, then r must be a positive 
integer. 


EXAMPLES 


E9: Kin.» when m ¥ n is edge-transitive but not vertex-transitive. 


E10: The graph P,, of the n-sided prism is the cartesian product C,U Kk. If n = 3 or 
n > 5, then Aut(P,) = D, x Za. The graph is vertex-transitive but it has two edge- 
orbits. Thus, although Aut(x) and the edge-group are (abstractly) isomorphic, they are 
not isomorphic as permutation groups. However, P, is the graph of the 3-dimensional 
cube; it is arc-transitive and |Aut(P,)| = 48. 


E11: The lexicographic product of an n-circuit by the edgeless graph on k vertices is 
arc-transitive. 


E12: Let D = D, be a double ray, and for n € N define Dy41 = D,UD or Dy+i = 
D, & D. Then D,, has polynomial growth of degree n for all n € N. 


E13: The cartesian, strong, or lexicographic product of the (infinite) r-valent tree for 
r > 3 with any finite connected graph has exponential growth. 


E14: An infinite half-transitive graph may be constructed by taking a two-way infinite 
sequence of copies of Folkman’s graph (see F26) with bipartition {V;, V2} and identifying 
the vertices in V2 of the nth copy with the vertices in V; in the (n+ 1)st copy for n € Z 
(cf. [ThWa8s9]). This graph has linear growth. 


FACTS 


F34: [Gri85] There exist r-valent graphs for small values of r that have subexponential 
growth but grow faster than any polynomial. (This is called intermediate growth.) 
They are Cayley graphs (see Section 6.2) of groups with the same rate of growth. These 
groups are generated by a set of four elements, and all elements have finite order, but 
these groups are not finitely presentable. 


F35: [Se91] Finitely generated groups with intermediate growth cannot act vertex- 
transitively on connected locally finite graphs of polynomial growth. 


FURTHER READING 
For a detailed list of conditions for the existence and non-existence of semisymmetric 
graphs, see [Iv87]. 
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6.1.5 s-Regularity and s-Transitivity 


In the next three subsections we consider some refinements of transitivity of auto- 
morphism groups of graphs and begin by reviewing two more notions from the theory 
of permutation groups. 


DEFINITIONS 


D22: If G is a group of permutations of a set S, then G acts semiregularly if 
G, = {v} for alla eS. 


D23: A permutation group acts regularly or is regular if G acts both transitively 
and semiregularly. 


D24: For s > 0, an s-arc in a graph X is a directed walk of length s in which 
consecutive edges are distinct. 


D25: A graph X is s-transitive if it contains at least one s-arc and Aut(X) acts 
transitively on the set of all s-arcs. The terms 1-transitive and arc-transitive are syn- 
onymous. 


D26: A graph X is s-regular if it contains at least one s-arc and Aut(X) acts regularly 
on the set of all s-arcs. 


D27: An m-cage is a smallest 3-valent graph with girth m. 


FACTS 


F36: Let G be a group of permutations of a finite set S. Then the following three 
statements are equivalent: 


1. G acts regularly on S. 
2. For all x,y € S, there is a unique a € G such that a(x) = y. 


3. G acts transitively on S and |G| = |S]. 


F37: [Tut66] Let X be a connected s-transitive graph with no pendant vertex that is 
not a circuit. Then 


e X is r-transitive for 1 <r< s; 

es < $7(X) +1, where 7(X) denotes the girth of X; 

e if X is not s-regular, then X is (s + 1)-transitive. 
F38: [Weis74] Let X be a finite (1 + p’n)-valent graph, where p is prime, r > 1, and 
1<n<p. If Aut(X) contains a subgroup that acts regularly on the s-arcs of X, then 


s <7and s #6. (This result has been generalized to infinite graphs of polynomial 
growth. See [Se91].) 


EXAMPLES 


E15: A circuit is s-transitive for all s > 0. 
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E16: For all r > 2, the infinite r-valent tree is s-transitive but not s-regular for all 
s>0. 


E17: The graphs of the cube and the dodecahedron are 2-regular. 


E18: Ky, for n > 2 is 2-transitive but not 1-transitive. It has pendant vertices (cf. 
F37). 


E19: Kz, is the unique 3-cage; K3,3 is the unique 4-cage. 
E20: The Petersen graph is 3-regular. It is the unique 5-cage. 
E21: The unique 6-cage is the 4-regular Heawood graph H, defined as follows. Let 


V(H) be the cyclic group Z14. For each j = 0,...,6, let the vertex 27 be adjacent to 
the three vertices 27 — 1, ,27 +1, and 27 +5. 


6.1.6 Graphical Regular Representations 


DEFINITIONS 


D28: Given a group G, a graph X such that Aut(X) is isomorphic to G and acts 
regularly on V(X) is called a graphical regular representation, or GRR, of G. 


D29: A generalized dicyclic group is a (finite or infinite) group G with the following 
properties: 


e G contains an abelian subgroup A of index 2; 
e There exists b € G\A such that bab! = a7! for all a € A; 
e There exists an element ap € A of order 2m where m > 2; 


e San. 


FACTS 

F39: The 8-element quaternion group Q is the smallest generalized dicyclic group. 
F40: If X is a GRR of G, then |V(X)| = |G]. (In the finite case, this would follow 
directly from F23.) It will be seen in Section 6.2 that X must be a Cayley graph of 
G. Of importance is that the automorphism group of every Cayley graph contains a 


subgroup that acts regularly on its vertex set. If the Cayley graph is a GRR, that 
subgroup is exactly the whole automorphism group of the graph. 


EXAMPLES 


E22: Ko isa GRR of Zo. 
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E23: The dihedral group D,, has the presentation 
(a; 6: a =b* = (ab)? Si): 


The Cayley graph Cay(D,,,.9) with connection set S = {a,a~!,b, ba, ba®} (see Subsec- 
tion 6.2.1) is a GRR of D, when n > 6. If 3 <n <5, then the vertex-stabilizers have 
order 2. 


FACTS 
The first several facts answer questions as to which abstract groups do and which 
do not admit a GRR. 


F41: [Ch64, Sa64, Im70] The only abelian groups that admit a GRR are the elementary 
abelian groups of order 2” forn = 1 andn>5. 


F42: [No68, Wa71] No generalized dicyclic group admits a GRR. 
F43: The following ten groups do not admit a GRR, namely: 
e the dihedral groups D,,, for n = 3,4,5 [Wa71]; 
e Alt(4) [Wa74]); 
e QxZ, for n = 2,3, 4 [Wa72]; 
e the nonabelian group of order 27 and exponent 3 [NoWa72b]; 
e the nonabelian group of order 18 and exponent 6, another group of order 16, and 
another group of order 24 [NoWa72a, Wa72]. 
F44: [He76] Every finite solvable group which is non-abelian, non-generalized dicyclic, 
and not one of the ten groups listed in F43 admits a GRR. 


F45: [Go81] Every finite non-solvable group admits a GRR. 


F46: [ImWa76] Let G be a finite abelian group. Then there exists a graph X such 
that Aut(X) contains a regular subgroup of index < 2 isomorphic to G unless G is one 
of the seven groups 


Ts U5, Da Igy La We, Te, Tend Ze. 


F47: [BabGo82] Let G be a nilpotent non-abelian group of odd order g. Let C(G) be 
the set of all graphs whose automorphism group contains a regular subgroup isomorphic 
to G. Then almost all the graphs in C(G) are GRR’s of G, i.e., there are only 0(29-))/?) 
exceptions as g —> 00. 


We close this subsection with a result about infinite GRR’s. 
NOTATION: If n is a positive integer, let [n] = {7 € Z:1< 7 <n}. 


F48: [Wa76] Let n > 2, and let {G; : j € [n]} be a family of pairwise-disjoint groups 
such that r(G;) < No, where r(G;) denotes the cardinality of a smallest generating set 
for G;. Then the free product G = [| ¢;,) Gj admits a GRR. If ))(¢),) 7(G;) is finite, 
then G admits a locally finite GRR. 
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6.1.7 Primitivity 
DEFINITIONS 


D30: Let G be a group of permutations of a set S. A subset B C S is called a block 
(of imprimitivity with respect to G) if for all a € G, either a(B) = B or a(B)N B= 9. 


e Clearly @,S, and the singleton subsets of V are blocks; they are called trivial 
blocks. 


e If G acts transitively and admits no block other than the trivial blocks, then G is 
primitive on S. 


e If G acts transitively but admits nontrivial blocks, then G is imprimitive on S. 


e If G is imprimitive on S and B is a nontrivial block, then the set of images of B 
under G forms a partition of S, called a system of imprimitivity. 


D31: A graph is said to be primitive if its automorphism group acts as a primitive 
permutation group on its vertex set. 


D32: A lobe of a graph is a maximal 2-connected subgraph. 


EXAMPLES 


E24: An n-circuit (n > 3) is primitive if and only if n is prime. On the other hand, if 
d is a proper divisor of n, then a nontrivial block is obtained by starting at any vertex 
v and selecting all vertices lying at distance a multiple of d from v. In this case, there 
exists a system of imprimitivity consisting of d blocks, each of size n/d. 


E25: kK, is primitive for all n, but K,,,, is never primitive; the two sides of the 
bipartition form a system of imprimitivity of V(Kp jn). 


E26: The Petersen graph is primitive. 


E27: Except for the tetrahedron, the graphs of all the Platonic solids are imprimitive. 
The blocks of imprimitivity are the pairs of antipodal vertices. 


FACTS 


F49: The vertex sets of the components of a vertex-transitive graph X are blocks 
of imprimitivity of Aut(X). Hence primitive graphs with nonempty edge-sets are con- 
nected. 


F50: In a primitive graph, the connectivity & must equal the valence. Otherwise the 
graph has nontrivial atoms (see §4.2.3), and the family of vertex sets of the atoms is a 
system of imprimitivity. 


F51: [GraWa88] Let X be a finite planar graph. Then X is primitive if and only if it 
is Ko, Kg, a circuit of prime length, or an edgeless graph. 


We state five primitivity results for infinite graphs. 
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F52: [JuWa89] Let n be a nonnegative integer. There exists an infinite, locally finite, 
primitive graph with connectivity « = n if and only if n F 2. 


F53: [JuWa77b] Let X be a vertex-transitive graph with connectivity K(X) = 1. A 
necessary and sufficient condition for X to be primitive is that it have no cut-edge and 
that its lobes themselves be primitive and pairwise isomorphic. 


F54: [WaGra04] An inifinite, locally finite, planar graph X is primitive if and only if 
«&(X) = 1, X has no cut-edge, and there exists an integer m > 2 such that every vertex 
of X is incident with exactly m lobes. Moreover, all of the lobes are isomorphic to K4 
or all are p-circuits for some fixed odd prime p. 


F55: [Sm10] Let A be an infinite, imprimitive directed graph with connectivity «(A) = 
1. The associated (undirected) graph of A is primitive if and only if it has no cut-edge 
and, for some odd prime p, the lobes of A are pairwise-isomorphic directed p-circuits. 


F56: [GolmSeWaWo89] Let X be a locally finite, connected, vertex-transitive graph 
of polynomial growth. Then X is not primitive. 


6.1.8 More Automorphisms of Infinite Graphs 


The overall “shape” of an infinite graph can be described effectively with the aid of 
the concept of the ends of the graph. This notion, previously used in describing infinite 
groups, was first applied to graphs by R. Halin [Hal64]. In this subsection, the symbol 
X will always denote an infinite graph. 


NOTATION: Given a graph X, let R(X) denote the set of all rays in X. 
DEFINITIONS 


D33: Let Ri, Ro € R(X). We say that R, and Ry are end-equivalent and write 
R, ~ Rez if there exists Rg € R(X) such that both V(R3M R,) and V(R3M R2) are 


infinite. 
It is not hard to show that ~ is an equivalence relation on the set R(X). 


D34: The equivalence classes with respect to ~ are the ends of X. 


NOTATION: €(X) will denote the set of ends of X, and we define the cardinal «(X) = 


|E(X)]. 


D35: If Y isa subgraph of X and R € R(X), then R is contained in Y if Y contains 
a subray of R. If Ry, Ro € R(X), then Y separates R, and Rz if these two rays are 
contained in distinct components of X —Y. 


D36: An automorphism a of X is a translation if it fixes no finite nonempty subgraph, 
ie., if a(Y) = Y implies that V(Y) is empty or infinite. 


D37: A subgroup G < Aut(X) acts almost transitively on V(X) if its action induces 
only finitely many orbits. We say that X is almost transitive if Aut(X) acts almost 
transitively. 
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D38: A torsion subgroup of an infinite group G is a subgroup all of whose elements 
have finite order. 


FACTS 


F57: K6nig’s “Unendlichkeitslemma” [K36]: Let $1, 55,... be a (countably infi- 
nite) sequence of finite, nonempty, pairwise-disjoint subsets of V(X). Suppose that for 
each positive integer n, each vertex in S;,,1 is adjacent to some vertex in S;,,. Then X 
contains a ray with vertices 71,2%2,..., where x, € S, for alln EN. 


F58: Ifa graph Y contains no ray at all, then either Y is finite or is infinite but not 
locally finite. In this case e(Y) = 0. 


F59: [Hal64] Let Ri, Ro € R(X). The following statements are equivalent: 
e Ri ~ Ro; 
e no finite subgraph of X separates R, and R2; 
e X contains infinitely many pairwise-disjoint (finite) R,; R2-paths. 


F60: When X is locally finite, then «(X) equals the supremum of the number of infinite 
components of X — S as S' ranges over all finite subsets of V(X). 


Clearly Aut(X) induces a group of permutations of €(X). 


F61: Given an automorphism a of a connected locally finite graph, either all orbits 
of @ are infinite (in which case a is a translation) or all orbits of a are finite (although 
they may be arbitrarily large). 


F62: [Hal73] Every translation of a connected locally finite graph fixes some double 
ray and at most two ends. 


F63: [Hal73] If X is connected and locally finite, and if Aut(X) contains a translation, 
then «(X) = 1, 2, or o. 


By combining this result with a theorem of H. A. Jung [Ju81], we obtain the following 
important classification of almost transitive infinite graphs. 


F64: If X is connected, locally finite, and almost transitive, then e(X) = 1, 2 or 2%. 


F65: Suppose that X is almost transitive. If X has linear growth, then e(X) = 2 (cf. 
“strips” below). If X has polynomial growth of degree d > 2, then e(X) = 1. If X has 
exponential growth, then €(X) = 1 or 2%°, 


F66: [SeTr97] Ifa graph X is almost transitive and has quadratic growth, then Aut(X) 
contains a subgroup isomorphic to Z @ Z that acts on V(X) with finitely many orbits. 


F67: [SeTr97] There are only countably many nonisomorphic almost transitive graphs 
with linear or quadratic growth. 


F68: [Se91] Let X be connected, locally finite, and vertex-transitive. Then Aut(X) 
is uncountable if and only if it contains a finitely generated subgroup of exponential 
growth that acts transitively on V(X). 
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F69: [Hal68] Every end of a graph contains a family of pairwise disjoint rays of 
maximum cardinality, i.e., if an end of a graph contains arbitrarily large finite families 
of pairwise disjoint rays, then that end contains an infinite family of pairwise disjoint 
rays. In this same sense, every graph contains a family of pairwise disjoint double rays 
of maximum cardinality. 


F70: [BabWa80] Suppose that X is connected and locally finite. If Aut(X) contains 
a torsion subgroup T that acts almost transitively on V(X), then e(X) = 1. If T acts 
transitively on V(X) and X is r-valent, then X has connectivity «(X) > 3(r +1) and 
edge-connectivity \(X) =r. 


REMARK 


R4: In F70 it is not known whether, under the stated hypotheses, the lower bound 
3(r +1) is attainable, although examples X exist for which K(X) = 4(r +1). More 
generally, if X is merely locally finite and vertex transitive, then it is known that 
K(X) > (r+ 1), and this bound is indeed sharp (see [JuWa77a]). 


EXAMPLES 


E28: The complete bipartite graph K,,x,, where n € N, has zero ends, because it 
contains no rays. Of course, it is not locally finite. Since it has finite diameter, its 
growth is not defined. 


E29: Let D be a double ray, and let Y be any connected graph. If Y is infinite, then 
the cartesian product DUY has exactly one end. However, if Y is a finite, then DAY 
has exactly two ends. In fact, DUIY is a strip (see below). 


E30: Let V(X) =ZxZ and let E(X) consist of all edges of the forms {(m,n), (m,n+ 
1)} and {(m, 0), (m+1,0)} for m,n € Z. Then X has quadratic growth and e(X) = No. 
By F64, X cannot be almost transitive. In fact, Aut(X) clearly has No orbits. 


E31: The cartesian, strong, or lexicographic product of the (infinite) r-valent tree for 
r > 3 with any finite connected graph has 2*° ends. 


E32: There are three so-called regular tessellations of the Euclidean plane, namely, 
1. six congruent equilateral triangles meeting at each vertex, or 
2. four congruent squares meeting at each vertex, or 
3. three congruent regular hexagons meeting at each vertex. 


Their underlying graphs all have quadratic growth and exactly one end. 


E33: The regular tessellations of the hyperbolic plane (e.g., four congruent pentagons 
meeting at every vertex) also have exactly one end, but their growth is exponential. 
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Strips 
A special class of 2-ended graphs is of some interest. 
DEFINITIONS 


D39: A connected graph X is called a strip if there exists a connected subgraph Y 
of X and an automorphism a € Aut(X) such that O(Y) and Y — a(Y) are finite and 
a(Y UA(Y)) CY. 


D40: The infinite connectivity of X, denoted k.(X), is the minimum cardinality 
of a set S C V(X) such that X — S has at least two infinite components. If no such 
minimum exists, then k..(X) = oo. 


REMARK 


R5: In §4.2.3, the notions of fragment and atom are presented with respect to the 
connectivity « of graph. These terms may also be defined with respect to the more 
restrictive parameter of infinite connectivity with very similar results. In particular, 
distinct Ko-atoms are disjoint. (See [JuWa77a].) 


FACTS ABOUT STRIPS 


F71: Let X be a connected infinite graph. Then the following statements are equiva- 
lent: 


e X isa strip. 


e X is locally finite and Aut(X) contains an automorphism with finitely many orbits 
[JuWas4]. 


e X is locally finite, e(X) = 2, and Aut(X) contains a translation [ImSe88}. 


F72: [ImSe88] Let X be connected, locally finite, and vertex-transitive. Then X has 
linear growth if and only if X is a strip. 


F73: [JuWa84] Suppose that X is connected and that Aut(X) contains an abelian 
subgroup H that acts transitively on V(X). Then either 


© Koo(X) = 00, Le, e(X) = 1, or 
e X isastrip and H = Z6 F for some finite abelian group F. 
F74: [Wa91] If a strip is edge-transitive, then all vertices have even valence. 


F75: [Wa91] Let S be a planar edge-transitive strip with connectivity K(.S) = k > 3. 
Then V(S) = (Z x Z)/p where p = {{(a,-«), (—x +k,a +k)}: a € Z} and the vertex 
(x, y)p is adjacent to (x,y+1)p and (2 +1,y)p for all (2, y) € Z x Z. (In this case, $ 
admits a regular embedding on an infinitely long circular cylinder obtained by “rolling 
up” the Euclidean plane at a 45° angle to the major axes.) 


The next result says that strips can be found as subgraphs of multi-ended graphs. 


F76: [Ju94] Let X be locally finite and suppose that a subgroup G < Aut(X) fixes 
some 2-subset of €(X). If G contains a translation, then there is a G-invariant induced 
subgraph S of X that is a strip (with respect to G), and OY is finite for every component 
Y of X —S. 
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Automorphisms and Distance 


DEFINITIONS 


D41: An automorphism a € Aut(X) is bounded if there exists M > 0 such that for 
all v € V(X), the distance d(v, a(v)) < M. 


D42: A path or ray or double ray is said to be geodetic if it contains a shortest path 
joining any two of its vertices. A geodetic double ray is a geodesic. 


D43: Let R be a ray or a double ray in X. The straightness o(R) of R is defined to 
be 


o(R) = liminf ah) ; 
dr(u,v)00 dr(u, v) 


where u,v € V(R) and dr(u,v) is the length of the subpath of R joining u and v. 
D44: A ray or double ray R is metric if o(R) > 0. 

D45: Ifa € Aut(X), then a ray R is a-essential if a"(R) C R for some positive 
integer n, and a is of metric type if there exists a metric a-essential ray. 

FACTS 


F77: [JuWa84] The set of bounded automorphisms of a graph X forms a normal 
subgroup of Aut(X). 


The following result extends Konig’s Unendlichkeitslemma: 


F78: [Wa86] If X has infinite diameter, then for each vertex u € V(X) and for each 
end of X, there exists a geodetic ray R belonging to that end that originates at u. 


F79: 0 <o(R) < 1 holds for any ray or double ray R. If R is a geodesic, then o(R) = 1, 
but not conversely. 


F80: [PoWa95] If a translation 7 € Aut(X) fixes some metric double ray, then every 
double ray fixed by a nonzero power of 7 is also metric. 


F81: [PoWa95] Suppose that a translation t € Aut(X) fixes some metric double 
ray Do. If o(Do) < 1, then some power of 7 fixes a metric double ray D, such that 
a(D,) > o(Do). If o(Do) = 1, then Do is a geodesic. 


F82: [JuNi94] If 7 € Aut(X) is a translation of metric type, then 


sup{o(R) : R is r-essential} = 1. 


EXAMPLE 


E34: Let V(X) = Z,, x Z where each vertex (x, y) is adjacent to vertices (x+1,y) and 
(x,y +1), the first coordinate being read modulo m. Let r(x, y) = (a +1,y +1) for all 
(x,y) € V(X). Let D be the double ray with edges of the forms {(#, x), (%,2+1)} and 
{(z,v+1),(a+1,2+1)}. Then o(D) = 1/2, and so D is metric but not geodetic. Note 
that all rays with vertex set {(z,y) : y € Z} are r-essential, as they are fixed by 7”. 
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6.1.9 Distinguishability 
DEFINITIONS 


D46: Given a group G of permutations of a set V and d € N, we say that the action of G 
on V is d-distinguishable if there exists a partition of V into d cells (or “color classes” ) 
such that the only element of G that fixes (setwise) every cell in the partition is the 
identity v. Such a partition is called a distinguishing coloring. The distinguishing 
number of this group of permutations is the least number d such that a distinguishing 
coloring with exactly d cells exists. 


D47: A graph X is d-distinguishable if there exists a partition of V(X) into d cells 
such that the only element of Aut(X) that fixes (setwise) every cell in the partition is 
the identity v. The distinguishing number of X is the least number d such that X is 
d-distinguishable. 


One may think of the distinguishing number of a graph as the least number of colors 
in a coloring of its vertex set such that every nonidentity automorphism maps some 
vertex of one color to some vertex of a different color. 


D48: The distinguishing chromatic number of a graph X is the least number 
of color classes in a distinguishing partition of X that is at the same time a proper 
vertex-coloring of X. 


NOTATION: The distinguishing number of a graph X is denoted by D(X), and the 
distinguishing chromatic number of X is denoted by D,(X). Clearly for any graph X 
with chromatic number x(X), one has D(X) > max{x(X), D(X)}. 


FURTHER READING 

The main topic of the remainder of the subsection is the distinguishing number 
of finite and infinite graphs. See [ColHovTre09] for more information concerning the 
relationship between this parameter and the distinguishing chromatic number. The 
reader interested in the distinguishing number of a permutation group acting on other 
combinatorial objects as well as on certain graphs may consult [Chan06] and [ConTuc11]. 


D49: Given a map M (see §7.6.1), the distinguishing number of M is the distin- 


guishing number of the group of map-automorphism (see §7.6.5) acting on the vertices 
of M. 


EXAMPLES 


E35: Since a graph and its complement have the very same automorphism group, 
D(X) = D(X) holds for all graphs X. 


E36: D(K,) =n for all n. 
E37: D(Kinn) =n ifm <n but D(Ky») =n+1. 


E38: If X is the graph Ko, with the edges of a 1-factor deleted, then its complement X 
is the disjoint union of n copies of Ky. Thus Aut(X) = Sym(n)?Z2, and so D(X) = (5). 
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E39: Let V(C,) = {20,21,.--,@n—1} be labeled in the natural way. If n > 6, then 
D(C,) = 2 with S = {xo, 11,23} being one color class and its complement being the 
other. However, for 3 <n < 5, every subset of V(C,,) is invariant under some reflection, 
but a third color can “break” that reflection, in which case D(C,,) = 3 (cf. E23). 


E40: The vertices of the Kneser graph K(n,k) are the k-subsets of an n-set (1 << k < 
n/2), and two such vertices are adjacent when their corresponding subsets are disjoint. 
All Kneser graphs K(n,k) have distinguishing number 2 except for the K(5,2), which 
is the Petersen graph, and K(n,1), which is complete [AlBout07]. In particular, the 
distinguishing number of the Petersen graph is 3 [AlCol96]. 


E41: Here is an example of a locally finite graph with infinite distinguishing number. 
Start with a ray R, where V(R) = {xo, 11, X2,...} indexed in the natural way. For each 
n €N, append to zx, exactly n pendant edges. 


FACTS 


F83: [AlCol96] For any finite group G there exists a graph X such that Aut(X) =G 
and D(X) = 2 (cf. F7). 


F84: [AlCol96] Let X be a graph and let G be a finite group and let m be the number 
of groups in a longest ascending chain of proper subgroups of G. If Aut(X) = G, then 
D(X) < m. 


F85: [AlCol96] If a nontrivial group G has the property that all its subgroups are 
self-conjugate (this includes abelian groups), and if Aut(X) = G, then D(X) < m. 


F86: [ColTre06] For any finite graph X, D(X) < A(X) +1, where A indicates the 
maximum valence. Equality holds if and only if X is of one of the graphs Ky, Kn n, or 
Cs. 


An extension of F86 to infinite graphs is the following: 


F87: [ImKI1Tr07] If X is a connected infinite graph with no vertex of valence greater 
than m, then D(X) < m. 


F88: [Tucll] There exist only finitely many maps M on orientable surfaces with 
D(M) > 2. Only four maps exist for which the distinguishing number of the subgroup 
of orientation-preserving automorphisms equals 3. 


Distinguishing Number of Graph Products 


NOTATION: The symbol m will denote any cardinal. If & denotes one of the graph 
products defined in D11, then the product of m copies of a graph X will be denoted 
by X£™. In particular, KY™ is the hypercube of dimension m, which is conventionally 
denoted by Qin. 


F89: D(Qm) = 2 for all m > 4. (For finite m, see [BogCow04]; for transfinite m, see 
[ImK1Tr07].) 
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F90: [ImKI1Tr07] If m and n are infinite cardinals, then 
DOR) SD Rey Se, 
but ifn > 2™, then D( Ky, O Ky) > m. 


F91: [ImK106] Let X be any connected graph, and let k > 2 be any integer. Then 
D(X"*) = 2 with the three exceptions of Q2, Q3, and K3, which have distinguishing 
number 3. 


Analogous results hold for the strong product and the categorical product as follows, 
but with some restrictions. 


F92: [ImK106] Let X be a connected graph that satisfies the following property: for 
no two distinct vertices u,v € V(X) does it hold that every vertex other than u and v 
is adjacent to either both u and v or to neither u nor v. Then for all k > 2, one has 
D(X™r) = 2. 


F93: [ImK106] Let X be a connected, nonbipartite graph with the property that no 
two vertices have the same closed neighborhood (see §5.3.4). Then for all k > 2, one 
has D(X**) = 2. 


More Distinguishing Number Results for Infinite Graphs 


DEFINITIONS 
D50: The n-sphere of a vertex u is the set of vertices at distance n from u. 


D51: [SmTucWal2] A graph X satisfies the Distinct Spheres Condition if there 
exists a vertex « € V(X) such that every pair y, z of vertices equidistant from x have 
distinct n-spheres. 


FACTS 


F94: [SmTucWal2] If a connected denumerable graph satisfies the Distinct Spheres 
Condition, then it is 2-distinguishable. 


F95: [SmTucWal2] A connected denumerable graph X satisfying any of the following 
conditions also satisfies the Distinct Spheres Condition and hence is 2-distinguishable: 


e X isa primitive graph with infinite diameter; 


e X is a denumerable connected graph such that the deletion of any vertex leaves 
at least two infinite components; 


e X is the cartesian product of two connected denumerable graphs of infinite diam- 
eter. 


F96: [WaZ07] Let T be an infinite tree. 


e If T has no 1-valent vertices and all valences are finite or countably infinite, then 
D(T) = 2. 


If T is locally finite and D(T) = Do < oo, then T contains a finite subtree Tp such 
that D(To) = Do. 


F97: [ImK1Tr07] The countable random graph has distinguishing number 2. 
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INTRODUCTION 

There are frequent occasions for which graphs with a lot of symmetry are required. 
One such family of graphs is constructed using groups. These graphs are called Cayley 
graphs and are the subject of this section. 


6.2.1 Construction and Recognition 


We restrict ourselves to finite graphs, which means we use finite groups, but the basic 
construction is the same for infinite groups. While Cayley graphs on finite groups and 
Cayley graphs on infinite groups share a variety of features, there are aspects of Cayley 
graphs on finite groups that do not carry over to Cayley graphs on infinite groups, and 
vice versa. 


DEFINITIONS 


D1: Let G be a finite group with identity 1. Let S be a subset of G satisfying 1 ¢S 
and S = S~!; that is, s € S if and only if s-! € S. The Cayley graph on G with 
connection set S, denoted Cay(G;S), satisfies these rules: 

e the vertices of Cay(G;S) are the elements of G; 

e there is an edge joining g,h € Cay(G;S) if and only if h = gs for some s € S. 


We note here that it is standard to use additive notation when G is an abelian group 
and multiplicative notation for nonabelian groups. Thus, for abelian groups, we have 
S = —S and g = h+s. Cayley graphs on the cyclic group Z,, are called circulant 
graphs and we use the special notation Cire(n; S). 


D2: The set of all Cayley graphs on a group G is denoted Cay(G). 
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EXAMPLES 


E1: The hypercube Q,, may be realized as a Cayley graph on the elementary abelian 
2-group Z5 using the standard generators €1,€2,...,€n for the connection set, where 
e; has a 1 in the i-th coordinate and zeroes elsewhere. We note here that Q,, may be 
realized in other ways as a Cayley graph, but the realization just given is the common 
one. 


E2: The complete graph K,, is a Cayley graph on any group G of order n, where the 
connection set is the set of non-identity elements of the group. We get the complement 
of K,, by using the empty set as the connection set. 


E3: The complete multipartite graph K,,,,, with m parts each of cardinality n, is 
realizable as a circulant graph of order mn with the connection set being all the elements 
not congruent to zero modulo n. 


E4: The graph formed on the finite field GF'(q), gq = 1(mod 4), where the connection 
set is the set of quadratic residues in GF(q), is called a Paley graph. Paley graphs 
have many interesting properties. 


E5: The circulant graph of even order n with connection set S = {+1,n/2} is known 
as the Mébius ladder of order n. 


E6: The Cayley graphs on the group Z?, where @ 2 3 and the connection set is the set 
of standard generators of the group, are of interest in computer science. 


Baa 


Figure 6.2.1: Two drawings of the Mobius ladder of order 8. 


DEFINITIONS 


D3: When G is a finite group and g € G, define gr acting on G by gr(h) = gh for 
allh € G. Clearly, gz is a permutation of the elements of G. Define the group Gz by 
Gr ={gr:g €G}. 


D4: A bijection f on the vertex set V(G) of a graph G is an automorphism of the 
graph G if (u,v) is an edge if and only if (f(u), f(v)) is an edge. 


D5: The set of all automorphisms of a graph G forms a group under function compo- 
sition and is denoted Aut(G). 


D6: A graph G is said to be vertex-transitive if Aut(G) acts transitively on V(G). 


D7: Let G bea transitive permutation group acting on a finite set 2. If G satisfies any 
one of the following three equivalent conditions, then it is said to be a regular action: 
e the only element of G fixing an element of 2 is the identity permutation; 
e |G] = |Q\; 
e for any w1,w2 € 2, there is a unique element g € G satisfying wig = we. 
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FACTS 
F1: Every Cayley graph is vertex-transitive. 
F2: (Sabidussi [Sa58]) A graph G is a Cayley graph if and only if Aut(G) contains a 


regular subgroup. 


REMARK 


R1: Sabidussi’s Theorem above is the basis for all work on recognizing whether or not 
an arbitrary graph is a Cayley graph. It is an absolutely fundamental result. 


6.2.2 Prevalence 


The family of Cayley graphs provides us with a straightforward construction for 
vertex-transitive graphs. A natural question to pose is whether or not the family of 
Cayley graphs encompasses all finite vertex-transitive graphs. The Petersen graph is 
the smallest vertex-transitive graph that is not a Cayley graph, which suggests the topic 
of this section. 


DEFINITION 


D8: Let NC denote the set of integers n for which there exists a non-Cayley vertex- 
transitive graph of order n. 


Figure 6.2.2: A non-Cayley vertex-transitive graph of order 26. 


EXAMPLE 


E7: It is easy to see that if m € NC, then any multiple of n belongs to NC. This 
follows by taking the appropriate number of vertex-disjoint copies of a non-Cayley, 
vertex-transitive graph of order n. Thus, in order to determine NC, it suffices to find 
the minimal elements belonging to NC. 
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FACTS 


The following results reduce the problem of trying to characterize membership in 
NC to the consideration of square-free integers. 


F3: <A prime power p© € NC if and only e 2 4. 


F4: Any positive integer, other than 12, divisible by a square is in NC. 


FACTS 


F5: Let p and q be distinct primes with p < gq. Then pq € NC if and only if one of the 
following holds: 


e p’ divides g—1; 

° q=2p-1>30rq= 24); 

e q=2'+1 and either p divides 2 —1 or p= 2°! —- 1; 
e g¢=2'—1and p=2*!41; and 

e p=7andq=11. 


F6: Let p and q be odd primes satisfying p < q. Then 2pq € NC if and only if one of 
the following holds: 

e p’ divides g—1; 

e 


= 1(mod 4) or gq = 1(mod 4); 


Pp 

Oe ee ae eck 
p = q=3(mod 4), p divides q — 1, and p? does not divide gq — 1; 
p =q = 3(mod 4), and p = a and 

e p=T7andq=19. 


F7: Let p,q,r be distinct odd primes satisfying p < q < r. Then par € NC if and only 
if at least one of pq, pr or gr is a member of NC, or none of pq, pr and qr is a member 
of NC but one of the following holds: 


© pgr = (22 +1)(2?"" +1), for some t; 


e pgr = (24+! + 1)(24 — 1), for some prime d; 


© pg = 2r +1 or pg = (r +1)/2; 
© pq = (r? + 1)/2 or pr = (¢ +1)/2; 
e pq = (r? — 1)/242 or pr = (q? — 1)/24x, where x € {1, 2,5}; 


e ab = 2' +1 and c divides 2° — 1, where {a,b,c} = {p,q,r}; 


the largest power of p dividing q—1 is p? and the largest power of qg dividing r— 1 
is qf, 
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e g= (3p4+1)/2 and r = 3p+ 2, org = 6p—1 andr =6p+1,; 
e qg=(r—1)/2 and p divides r +1, where p > q when p = (r + 1)/2; 


ep = (kY/? 4 1)/(k +1), q = (bil? — 1)/(k— 1), r = (R41 — 1)/(b — 1), where 
k,d—1,d/2 are primes and p > q may be the case; 


p= (KAD? + AY /(k +1), (ROY? — 1)/(k — 1), r = (k4 — 1)/(k — 1), where 
k,d,(d—1)/2 are primes and p > q may be the case; 


e p=k?—k+1,q=(k®—-1)/(k—1), r = (k” —1)/(k— 1), where k is prime; 


e p=3, q= (24 +1)/3, r = 24-1, where d is a prime; 


e p= (274+ 1)/3, q= 24-1, r = 27442 +1, where d = 2' + 1 is prime; 
e p=5,q=11 andr = 19; and 
°e p=7,q=73 and r = 257. 


RESEARCH PROBLEM 


RP1: Is there a number k > 0 such that every product of & distinct primes is in NC? 
No characterization of the members of NC that are products of four distinct primes is 
known. 


6.2.3. Isomorphism 


Some of the most interesting and deepest work on Cayley graphs has revolved around 
the question of trying to determine when two Cayley graphs are isomorphic. 


DEFINITIONS 


D9: A Cayley graph Cay(G;S) on G is called a CI-graph, if Cay(G;S) = Cay(G; S’) 
implies there exists an a € Aut(G) satisfying S’ = a(S). 


D10: We say that a group G is a CI-group if every Cayley graph on G is a Cl-graph. 


EXAMPLE 
E8: For n = 25, let 
S = {1,4,5,6,9, 11, 14, 16, 19, 20, 21, 24} 


and 
S’ = {1,4,6,9, 10, 11, 14, 15, 16, 19, 21, 24}. 


The two circulant graphs Circ(25;S) and Circ(25;S’) are isomorphic since both are 
wreath products of a 5-cycle with a 5-cycle. On the other hand, it is easy to see there 
is no a € Z3, for which S’ = aS is satisfied. Thus, Z25 is not a Cl-group. 
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FACTS 


F8: Let G be a Cayley graph on the finite group G. Then G is a Cl-graph if and only 
if all regular subgroups of Aut(G) isomorphic to G are conjugate in Aut(G). 


F9: [Li99] If G is a Cl-group, then G is solvable. 


F10: [Mu97] The cyclic group Z, is a Cl-group if and only if n = 2°m, where m is 
odd and square-free and e € {0,1,2}, or n € {8,9, 18}. 


F11: Let pa prime. The elementary abelian p-groups Z; are Cl-groups for 1 S e S 4. 
On the other hand, for e 2 2p -1+ easy Z,, is not a Cl-group. The latter inequality 
is sharp for p= 2. 


RESEARCH PROBLEM 


RP2: For an odd prime p, determine the values of e for which Zy, is a Cl-group. 


REMARK 


R2: One motivation for classifying Cl-groups is that the Cayley graphs on a Cl-group 
may be enumerated in a straightforward way using Pdlya’s enumeration theorem [Br64]. 
The next two theorems illustrate this for circulant graphs. 


DEFINITIONS 
D11: Let Z* denote the multiplicative group of units in the ring Zp. 


D12: Let n = pi'ps?---p;* be the factorization of n as a product of distinct prime 
powers. For any m satisfying 0 S m < n, let m(n) = (m1, mo,..., mz) be the t-tuple 
satisfying m; = m(mod p;') and 0 S m; < pi', i= 1,2,...4. 


D13: Let T(n) = {m(n): 0S m <n}. For a subset RC {1,2,...,t}, let Tr(n) be 
the elements of T(n) for which m; 4 0 if and only if i € T. When the value n under 
discussion is clear, we use the notation J and Tr. 


D14: Ifa = (a1, a2,...,a4) € Z, then the order-type of a is the t-tuple (di, d2,..., ds), 
where dj is the order of a; in Z}-. Note that a € Z, implies a; 4 0, i = 1,2,...,¢. 


D15: Let (di, do,...,d,) be the order-type of some a € Z*, and let R C {1,2,...,t}. 
If each d;, 1 € R, has the form d; = 2°b, where e 2 1 and b is odd, then let lem*(R) = 
Icm(R)/2. In all other cases, let leom*(R) = lcm(R), where lem(R) denotes the least 
common multiple of the d; terms in the coordinates corresponding to the elements of R. 


FACTS 


F12: If pis an odd prime, then the number of isomorphism classes of vertex-transitive 
graphs of order p is 
2 
ae @(q)Q(?-1)/2d 
pa Le | 


where the summation runs over all divisors d of (p — 1)/2 and ® denotes the Euler 
totient function. 
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F13: If n = pipo-:- pe is a product of distinct primes, or if pj = 4 and po,p3,..., Dt 
are distinct odd primes, then the number of isomorphism classes of circulant graphs of 


order n is 
—— S> Bd) ®(dy)--- (d,) T] 2171/2), 
n 
(di ,da,...,dz) R 


* 


where the sum is taken over all possible order types of a € Z*, and the product is 
taken over all non-empty subsets R of {1,2,...,t} unless p; = 4, in which case R = 9 
is included and |7g| is taken to be 2. 


EXAMPLE 


E9: We illustrate the preceding theorem for n = 20. The possible order-types are 


(1,2), (1,2), (1,4), (2,1), (2,2), and (2,4) 

There are, for example, two automorphisms of order type (1,4). Since p; = 4, the term 
corresponding to R = @ appears in the product, and the contribution is 2 since |7g| = 2 
and we consider lem*() = 1. For R = {1}, we have lem*(R) = 1 and |Tr| = 2 making 
a contribution of 2 to the product. For R = {2}, we have lem*(R) = 2 and |TpR| = 8, 
thereby contributing 27. Finally, when R = {1,2}, we have lem*(R) = 4 and |TR| = 8 
so the contribution is 2. Altogether the term in the product is 2°. We do the same 
thing for all possible order-types and find that there are 336 non-isomorphic circulant 
graphs of order 20. 


6.2.4 Subgraphs 


There are interesting results and questions regarding subgraphs of Cayley graphs. 
Some of the results we mention hold for all vertex-transitive graphs and we state them 
accordingly. It is not always clear just what impact vertex-transitivity has on the 
existence of certain subgraphs. 


DEFINITIONS 


D16: A connection set S is said to be a quasi-minimal connection set if the 
elements of S can be ordered 81, 2,..., 8; so that 
e if |s;| > 2, then 3° is either s;_1 or 5;4 1, and 
e if S; denotes the set {51, s9,...,5;}, then for each i such that |s;| = 2, (S;) is a 
proper supergroup of (S;-1), and for each 7 such that |s;| > 2 and o = $j-1, 
(S;) is a proper supergroup of (S;_2). 


FACTS 


F14: A graph G is said to be a Hamilton-connected graph if for any two vertices 
u,v of G, there is a Hamilton path whose terminal vertices are u and v. A bipartite 
graph G with parts A and B is said to be a Hamilton-laceable graph if for any u € A 
and v € B, there is a Hamilton path whose terminal vertices are u and v. 


F15: Let G be a connected vertex-transitive graph. If G has even order, then it has a 
1-factor. If G has odd order, then G — v has a 1-factor for every vertex v € G. 
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F16: IfG is aconnected vertex-transitive graph of degree d, then G is d-edge-connected. 


F17: [Ma71, Wa70] If G is a connected vertex-transitive graph of degree d, then y/d, 
where y denotes the vertex connectivity of G, strictly exceeds 2, Furthermore, for every 
€ > 0, there exists a connected vertex-transitive graph H of some degree d for which 
y/d<et 3. 


F18: If S is a quasi-minimal generating set of the group G, then the Cayley graph 
Cay(G;S) has vertex connectivity |S]. 


F19: For every positive integer m, there exists a Paley graph containing all graphs of 
order m as induced subgraphs. 


F20: Let G be a connected Cayley graph on a finite abelian group. If G is bipartite 
and has degree at least 3, then G is Hamilton-laceable. If G is not bipartite and has 
degree at least 3, then G is Hamilton-connected. 


F21: [Wi84] Every connected Cayley graph on a group of order p*, p a prime and 
e 2 1, has a Hamilton cycle. 


F22: [KMMMS12] Let p,q,r be distinct primes. A connected Cayley graph of any of 
the following orders is hamiltonian: kp with k < 32 and k 4 24, kpq with k < 5, kp? 
with k < 4, pqr, p® and 2p’. 


6.2.5 Factorization 


DEFINITIONS 
D17: A 1-factorization of a graph is a partition of the edge set into 1-factors. 


D18: We say the connection set S is a minimal generating Cayley set for G if S 
generates G, but S — {s,s~1} generates a proper subgroup for every s € S. 


D19: A Hamilton decomposition of a graph G is a partition of the edge set into 
Hamilton cycles when the degree is even, or a partition into Hamilton cycles and a 
1-factor when the degree is odd. 


D20: An isomorphic factorization of a graph G is a partition of the edge set of 
G so that the subgraphs induced by the edges in each part are pairwise isomorphic. 


FACTS 


F23: [St85] Every connected Cayley graph on the group G has a 1-factorization if one 
of the following holds: 


e |G| = 2* for an integer k; 
e G is an even order abelian group; or 
e G is dihedral or dicyclic. 
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F24: Cayley graphs whose connection sets are minimal generating Cayley sets have 
1-factorizations whenever the group is one of the following: 

e an even order nilpotent group; 

e the group contains a proper abelian normal subgroup of index 2”; or 

e the group has order 2”p" for a prime p satisfying p > 2”. 


F25: <A cubic Cayley graph G whose automorphism group has a solvable subgroup 
acting transitively on the vertex set of G has a 1-factorization. 


F26: [Li96, Lil3] If G = Cay(G,S) is a connected Cayley graph on an abelian group 
G, and if S is a minimal generating Cayley set, then G has a Hamilton decomposition 
with one small exception unsettled. Namely, when |G] is even, for each s € S, we must 
have that 2s does not lie in the subgroup generated by S — {+s}. 


F27: [Fi90] If T is any tree with n edges, then the n-dimensional cube Q,, has an 
isomorphic factorization by J. Furthermore, there is an isomorphic factorization so 
that each copy of T is an induced subgraph. 


RESEARCH PROBLEM 


RP3: Let C be one of the classes of circulant graphs, or Cayley graphs, or vertex- 
transitive graphs. Is it the case that for every G € C, whenever d divides |E(G)|, then 
there is an isomorphic factorization of G into d subgraphs? 


6.2.6 Miscellaneous 


Space limitations preclude discussion of several topics and we mention them briefly. 
We also include recommended further reading. 


EMBEDDINGS 

There is a long history and an extensive literature about embedding graphs on 
orientable and non-orientable surfaces. See Chapter 7 in this volume. The books 
[GrTu87, Ri74, Wh01] and a recent excellent survey [RSJTW05] provide a good starting 
point for this topic. 


APPLICATIONS 

There are a variety of meaningful applications of Cayley graphs and we mention only 
three. Circulant graphs appear in the study of circular chromatic number. For a recent 
survey see [Zh01]. 

Cayley graphs occur frequently in the literature on networks. A recent book on this 
topic is [Xu01] and a fundamental paper is [AkKr89]. 

Cayley graphs play a central role in the work on expanders. Two excellent references 
are [A195, Lu95]. 


FURTHER READING 


A survey on Cayley graph isomorphism is provided in [Li02]. A good general discus- 
sion about vertex-transitive graphs and Cayley graphs is [Ba95]. A good starting point 
for reading about NC is [IrPr01]. 
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INTRODUCTION 


It is often important to know how many graphs there are with some desired property. 
Computer scientists can use such numbers in analyzing the time or space requirements 
of their algorithms, and chemists can make use of these numbers in organizing and cata- 
loging lists of chemical molecules with various shapes. Indeed, any time that graphs are 
used to model some form of physical structure, the techniques of graphical enumeration 
can be extremely valuable. 

Many of the techniques for counting graphs are based on the master theorem in the 
historic 1937 work of George Pélya. See [PoRe87] for an English translation. Frank 
Harary [Ha55] and others exploited this master theorem in counting simple graphs, 
multigraphs, digraphs, and similar graphical structures. 

Tree counting began with Arthur Cayley [Ca57, Ca89], who was the first to use 
the word “tree” for these structures. Methods for counting trees representing chemi- 
cal compounds were developed by Blair and Henze [BlHe3la, BlHe31b]. Generic tree 
counting methods were advanced by Polya [PoRe87], Richard Otter [Ot48], Harary and 
Prins [HaPr59] and many others. 

An exhaustive survey of results in graphical enumeration, far beyond what can be 
included here, can be found in [HaPa73]. Alternatively, if you know the first few terms of 
a graph-counting sequence, you can quite likely find more terms, references, and further 
information in the On-Line Encyclopedia of Integer Sequences [OEIS]. 


6.3.1 Counting Simple Graphs and Multigraphs 


When counting graphs it is important to distinguish between the enumeration of 
labeled graphs and that of unlabeled graphs. Labeled graphs are relatively easy to count, 
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usually requiring only factorials, exponentials, and binomial coefficients. Unlabeled 
graphs require rather sophisticated counting techniques, often utilizing permutation 
group theory and generating functions. 


Labeled Graphs 


DEFINITION 


D1: A labeled graph is a graph with distinct labels, typically v1, v2,..., Un, assigned 
to its vertices. Two labeled graphs with the same set of labels are considered the same 
only if there is an isomorphism from one to the other that preserves the labels. 


EXAMPLES 


E1: Figure 6.3.1 shows the three isomorphically distinct simple graphs with 4 vertices 
and 3 edges. There are 4 essentially different ways to label each of the first two and 
12 ways to label the third. Thus there are 20 different labeled simple graphs with 4 
vertices and 3 edges. Only the last two graphs shown are connected. 


Figure 6.3.1: Simple graphs with 4 vertices and 3 edges. 


E2: Figure 6.3.2 shows the three isomorphically distinct loopless multigraphs that 
together with the graphs in Figure 6.3.1 form the six different multigraphs with 4 vertices 
and 3 edges. There are 6 essentially different ways to label the first and third graphs in 
Figure 6.3.2 and 24 ways to label the middle graph. Thus the graphs in Figures 6.3.1 
and 6.3.2 represent the total of 56 labeled loopless multigraphs with 4 vertices and 3 


edges. 
e e e o—_e 
Figure 6.3.2: Additional loopless multigraphs with 4 vertices and 3 edges. 


FACTS 


F1: The number of labeled simple graphs with n vertices and m edges is the binomial 


coefficient ( z ). These numbers form sequence A084546 in [OEIS]. See Table 6.3.1. 
m 


F2: For m> (5) /2, the number of labeled simple graphs with n vertices and m edges 
is the same as the number of labeled simple graphs with n vertices and C) —m edges. 
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F3: The total number of labeled simple graphs with n vertices is ofa), This is sequence 


A006125 in [OEIS]. See Table 6.3.1. 


WAM Us Dee Be = 5 6 7 8 
Oi Wie Hae Ai, 1 1 1 1 

1 ae ne 10 15 21 28 

2 3 15 45 105 210 378 

3 1 20 120 455 1,330 3,276 

4 15 210 1,365 5,985 20,475 

5 6 252 3,003 20,349 98,280 

6 1 210 5,005 54,264 376,740 

és 120 6,435 116,280 1,184,040 

8 45 6,435 203,490 3,108,105 

9 10 5,005 —-:293,930 6,906,900 

10 1 3,003 = 352,716 ~—-13,123,110 

11 1,365 352,716 ~—-.21,474,180 

12 455 293,930 30,421,755 

13 105 203,490 37,442,160 

14 15 116,280 40,116,600 
Total | 1 2 8 64 1,024 32,768 2,097,152 268,435,456 


Table 6.3.1: Labeled simple graphs with n vertices and m edges. 


F4:; [Gi56] The number K,, of connected labeled simple graphs with n vertices can be 
determined from the recursive formula 


= & 9) Lyme 
Aa), and Kp So — = Dok)? 27K, forn>1. 
heal 


This is sequence A001187 in [OEIS]. See Table 6.3.2. 


MW ake. De 83 4 5 6 7 8 


—= 


K, | 1 1 4 38 728 26,704 1,866,256 251,548,592 


Table 6.3.2: Connected labeled simple graphs with n vertices. 


F5: Asymptotically, most labeled simple graphs are connected. Thus the sequence K; 
satisfies ™ 
RK, ~ 2). 


F6: The number of labeled loopless multigraphs with n vertices and m edges is the 


binomial coefficient (er (3)-1 . When n = 1 this expression should be interpreted as 1 
when m = 0 and 0 otherwise. See Table 6.3.3. These numbers form sequence A098568 
in [OEIS]. 
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mG || b- 25 -3 4 5 6 7 8 
Ose Ae SE 1 1 1 1 1 
1 1. ie 6 10 15 21 28 
2 L3G, 72H 55 120 231 406 
3 1 10 56 220 680 171 4,060 
4 1 15 126 715 3,060 10,626 31,465 
5 1 21 252 2,002 11,628 53,130 —- 201,376 
6 1 28 462 5,005 38,760 230,230 1,107,568 


Table 6.3.3: Labeled loopless multigraphs with n vertices and m edges. 


Unlabeled Graphs 
DEFINITIONS 


D2: The symmetric group S, is the group of all n! permutations y acting on the 
set X, = {1,2,...,n}. 


D3: The order of a permutation group is the number of permutations it contains. 
The degree of a permutation group is the number of objects being permuted. The 


symmetric group S,, has order n! and degree n. 


D4: The cycle index Z(G) of a permutation group G of order m and degree d is a 


polynomial in variables aj, a2,...,@q given by the formula 
1 i 
Z(G@)=— > JJ at, 
yEG k=1 


where j,(y) is the number of cycles of length k in the permutation 7. For example, for 
G = $3 = {(1)(2)(3), (123), (132), (1)(23), (2)(13), (3)(12)}, the symmetric group of 
order 6 and degree 3, the cycle index is 


Z(G3) = = (af + 2a3 + 3a,a2). 


Dire 


D5: The pair permutation y°) induced by the permutation y acting on the set X;, 
is the permutation acting on unordered pairs of distinct elements of X,, defined by the 
rule 


y) ({x1, 22}) = {7(21), ¥(a2)}. 


D6: The symmetric pair group Ss?) induced by the symmetric group S$, is the 
permutation group {-y(2) | vE Sat This group, used in counting graphs, has order n! 
and degree n(n — 1)/2. 
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FACTS 


F7: The cycle index Z (s?)) of the symmetric pair group, used in counting graphs 
with n vertices, is 


(2) i 1s jak ,RI2k-+1 gcd(r,8)jrjs 
4(Sn°) ~ nl “Te ke ‘(aya ‘) *2k+1 “em(r,s) 


r<s 


where lcm(r, 5) and gcd(r, s) are the least common multiple and greatest common divisor 
of r and s, respectively. The sum is taken over all partitions (j) = j1,j2,---,jn of the 
integer n as an unordered sum of parts, where j;, is the number of parts of size k. For 
example, in the partition of 7 as 2+2-+3 we have jg = 2, 73 = 1, and 7) = j4 = J5 = 
je = j7 = 0. Explicit formulas for Z (s??) for small values of n are: 


Z(sP)) = 1 

Z(S;?) = a 

2(S) = 3 (a} +3a,a) +2a,) 

Z(s?) = ala + 933 + 8a3 + 6a20,) 

Z(s®?) = atat? 10ata3 + 20a, a3 + 15a3a$ + 30aa% + 20,0305 + 2402) 
Z(s?) = (als L5a7a4 + 40a3a4 + 60a?aS + 180a,a,a3 + 120a,a,a2a¢ 


car) 


+ 144a3 + 40a3 + 120a30§) 


F8: [Ha55, PoRe87] Let G,,, denote the number of simple graphs with n vertices and 
m edges, and let g,(x) be the generating function for n-vertex simple graphs, so that 


(5) 
= S- Grimx™ 
m=0 


Pélya’s enumeration theorem states that this generating function gp, (a ’) can be obtained 


from the cycle index Z(S;; iS )) by replacing each variable a; with 1+‘. See Table 6.3.4. 
These numbers form sequence A008406 in [OEIS]. 


F9: For m > (5)/2, the number of simple graphs with n vertices and m edges is the 
same as the number of simple graphs with n vertices and i) —m edges. 
F10: The total number G,, of simple graphs with n vertices is obtained from the cycle 


index Z (Ss?) by replacing each variable a; with the number 2. See Table 6.3.4. This is 
sequence A000088 in [OEIS]. 


F11: Asymptotically, the sequence Gy, satisfies Gy ~ (2 2) | n!. 
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HT || Le 2s Be A 6 ‘3 8 
Ob a De at 1 1 1 

1 Lt ste Oe al 1 i 1 

2 Ly Be 22 2 2 2 

3 i 3 4 5 5 5 

4 Be "6 9 10 11 

5 T. © 15 21 24 

6 te B ~-21 41 56 

7 4 24 65 115 

8 2 24 97 221 

9 1 21 131 402 

10 > gap 148 663 

i 9 148 980 

12 5 ti: 4319 

13 2 97 1,557 

14 1 65 1,646 
Total | 1 2 4 11 34 156 1,044 12,346 


Table 6.3.4: Simple graphs with n vertices and m edges. 


631 


F12: [Ca71] The enumeration of connected simple graphs requires an auxiliary sequence 


A, defined recursively by 


n-1 
Ay=1, and A,=nG,—S > Ag-Gn—~ forn>1. 
k=1 


This sequence 1, 3 ,7, 27, 106, 681, 5972, 88963, ... is sequence A003083 in [OEIS]. 
The number K,, of connected simple graphs with n vertices can then be computed as 


i 
Kn = 7 S7 u(d) Ansa, 
d| 


where the sum is over all divisors of n and ps is the Mobius function defined by 


1 ifn =0 
p(n) =< 0 if m?|n for some m > 1 
(-1)* ifn is the product of k distinct primes. 


See Table 6.3.5. The sequence K,, is sequence A001349 in [OEIS]. 


n|}]1l 2 3 +4 5 6 7 8 
Kk, |} 1 1 2 6 21 112 853 11,117 


Table 6.3.5: Connected simple graphs with n vertices. 


F13: Asymptotically, most simple graphs are connected. Thus the sequence K,, satis- 


fies Ky, ~ 22) in), 
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F14: [Ha55, PoRe87] Let M,,,, denote the number of loopless multigraphs with n 
vertices and k edges, and let m,,(a) be the generating function for n-vertex loopless 
multigraphs, so that 

(3) 


Mn(x) = a Mn px”. 
m=0 


Pélya’s enumeration theorem states that this generating function m,,(x) can be obtained 
from the cycle index Z (s)) by replacing each variable a; with the infinite series 1 + 
vi +a 47%! +.--. See Table 6.3.6. Column n = 3 is sequence A001399 in [OEIS]. 
Column n = 4 is sequence A003082; column n = 5 is sequence A014395; and column 


n = 6 is sequence A014396. 


m\n 1 2 3 4 5 6 
0; 1 1 1 1 1 1 
1 1 1 1 1 1 
2 1 2 3 3 3 
3 IE 3 6 7 8 
4 1 4 11 17 21 
5 1 5 18 35 52 
6 1 7 32 76 132 
vA 1 8 48 149 313 
8 1 10 75 291 741 
9 1 12 111 539 ~— 1,684 

10 1 14 160 974 ~~ 3,711 


Table 6.3.6: Loopless multigraphs with n vertices and m edges. 


6.3.2 Counting Digraphs and Tournaments 
Labeled Digraphs 


DEFINITIONS 


D7: A labeled digraph is a digraph with distinct labels, typically v1, v2,...,Un, 
assigned to its vertices. Two labeled digraphs with the same set of labels are considered 
the same only if there is an isomorphism from one to the other that preserves the labels. 


D8: A tournament (or round-robin tournament) is a digraph in which, for each 
pair u, v of distinct vertices, either there exists an arc from u to v or an arc from v to 
wu but not both. 


D9: A digraph is strong (or strongly connected) if for each pair u, v of vertices, 
there exist directed paths from u to v and from v to u. A strong tournament is also 
called an irreducible tournament. 
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EXAMPLES 


E3: Figure 6.3.3 shows the four isomorphically distinct simple digraphs with 3 vertices 
and 3 arcs. The last two are tournaments. There are 6 essentially different ways to 
label each of the first three digraphs and 2 ways to label the fourth. Thus there are 20 
different labeled simple digraphs with 3 vertices and 3 arcs. Only the last digraph is 
strong—an irreducible tournament. 


&& AA 


Figure 6.3.3: The four simple digraphs with 3 vertices and 4 arcs. 


E4: Figure 6.3.4 shows the four isomorphically distinct tournaments with 4 vertices. 
There are 24 essentially different ways to label the first and last tournaments, and 8 
ways to label each of the middle two. Thus there are 64 different labeled tournaments 
with 4 vertices. Only the last tournament is strong. 


Xt Xt dt 


Figure 6.3.4: The four tournaments with 4 vertices. 
FACTS 


F15: The number of labeled simple digraphs with n vertices and m arcs is the binomial 
coefficient Ceres See Table 6.3.7. These numbers form sequence A123554 in [OEIS]. 


m 


m\ n 1 2 3 4 5 
0 1 1 1 1 1 

1 2 6 12 20 

2 tr 15 66 190 

3 20 220 1,140 

4 15 495 4,845 

5 6 792 15,504 

6 1 924 38,760 

7 792 77,520 

8 495 125,970 

9 220 167,960 

10 66 184,756 
Total | 1 4 64 #£444,096 1,048,576 


Table 6.3.7: Labeled simple digraphs with n vertices and m arcs. 
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F16: For m > n(n—1)/2, the number of labeled simple digraphs with n vertices 
and m arcs is the same as the number of labeled simple digraphs with n vertices and 
n(n — 1) — m arcs. 


F17: The total number of labeled simple digraphs with n vertices is 2“\"—)). See 
Table 6.3.7. This is sequence 4053763 in [OEIS]. 


F18: The number of labeled tournaments with n vertices is gfe). the same as the 
number of labeled simple graphs with n vertices. See Table 6.3.8. This is sequence 
A006125 in [OEIS]. 


F19: [MoMo62] The number Si, of strong labeled tournaments with n vertices can be 
computed from the recursive formula 


n-1 
— S n rae — 
S,=1, and sn = 2 57 (fa! Ee. forn > 1. 


See Table 6.3.8. This is sequence A054946 in [OEIS]. 


n | Labeled Tournaments | Strong Labeled Tournaments 
1 1 1 
2 2 0 
3 8 2 
4 64 24 
5 1,024 544 
6 32,768 22,320 
7 2,097,152 1,677,488 
8 268,435,456 236.522,496 


Table 6.3.8: Labeled tournaments and strong labeled tournaments with n vertices. 


F20: Asymptotically, most labeled tournaments are strong. Thus the sequence Sn 


counting strong labeled tournaments satisfies Sn ~ (2), 


Unlabeled Digraphs 


DEFINITIONS 


D10: The ordered pair permutation y"! induced by the permutation ¥ acting on the 
set X, = {1,2,... ,n} is the permutation acting on ordered pairs of distinct elements 
of X,, defined by the rule 


y) ((a1,22)) = (7(#1),1(#2))- 


D11: The reduced ordered pair group sl induced by the symmetric group S,, is 
the permutation group { yl? | ve Sn}. This group, used in counting digraphs, has 
order n! and degree n(n — 1). 
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FACTS 


F21: The cycle index Z (sl? 1) of the reduced ordered pair group, used in counting 
digraphs with n vertices, is 


(k—-1)jn+2k(2 cd(r,8)jrds 
Sn => qemu ld baal Py dieteae 


r<s 


where lcm(r, 5) and gcd(r, s) are the least common multiple and greatest common divisor 
of r and s, respectively. The sum is taken over all partitions (j) = j1,J2,---,jn of the 
integer n as an unordered sum of parts, where j, is the number of parts of size k. See 
Fact F7 for an example of a partition. Explicit formulas for 7(S;, ? 1) for small values of 
n are: 


+a) 


6 438 + 2a3) 


029 + 10aSa3 + 20a2a8 + 15a}? + 3005 + 20aza2a2 + 2404) 


a2? + 15al7a2 + 40a%a? +.4502a)* + 90a7a! + 120aza5az 


Ql Ale Bl Rp S)e [rR 7 


( 
( 
(aj? + 6aja3 + 8a3 + 3a$ + 6a}) 
( 
( 


+ 144a$ + 15a}° + 90a2aj + 40a5° + 120a8) 


F22: [Ha55, PoRe87] Let Dy», denote the number of simple digraphs with n vertices 
and m arcs, and let d,(x) be the generating function for n-vertex simple digraphs, so 


that 
n(n—1) 


= y Diet 
m=0 


Pélya’s enumeration theorem states that this generating function d,, (a ’) can be obtained 


from the cycle index Z ($7, P 1) by replacing each variable a; with 1+ x‘. See Table 6.3.9. 
These numbers form sequence A052283 in [OEIS]. 


F23: For m > n(n —1)/2, the number of simple digraphs with n vertices and m arcs 
is the same as the number of simple digraphs with n vertices and n(n — 1) — m arcs. 


F24: The pila nemRE D,, of simple digraphs with n vertices is obtained from the 


cycle index Z (si 21) by replacing each variable a; with the number 2. See Table 6.3.9. 
This is sequence A000273 in [OEIS]. 


F25: Asymptotically, the sequence D,, satisfies Dy ~ 2™"—) /nl. 
F26: [Da54] The number T,, of tournaments with n vertices is given by the formula 


D(j) 


/ 4 
es ees Tha! 
(3) k 


636 Chapter 6. Algebraic Graph Theory 


m\n 1 2 3 4 5 
0o;1 1 1 1 1 

1 1 1 1 1 

2 1 4 5 5 

3 4 13 16 

4 4 27 61 

5 1 38 154 

6 ol 48 379 

7 38 707 

8 27 ~=-:1,155 

9 13 ~—-:1,490 

10 5 1,670 
Total | 1 3 £4.16 218 9,608 


Table 6.3.9: Simple digraphs with n vertices and m arcs. 


where the sum is over all partitions (j) of n into odd size parts, and where 


D(j) a ; (>: oe ged(r, S8)IrJs 7 > x) . 


r=1 s=1 k=1 
See Table 6.3.10. This is sequence A000568 in [OEIS]. 


F27: [Wr70] The number S,, of strong tournaments with n vertices can be determined 
by the recurrence relation 


n-1 
S,=1, and S,=T,- S- To. Se for nS bt, 
k=1 


where TJ), is the number of tournaments from Fact F26 above. See Table 6.3.10. Note 
that there are no strong tournaments with exactly two vertices. This is sequence 
A051337 in [OEIS]. 


F28: Asymptotically, most tournaments are strong. Thus the sequence S,, counting 
strong tournaments satisfies S;, ~~ 2”"—) /nl. 


6.3.3. Counting Generic Trees 


When counting generic trees, we must be careful to distinguish among labeled trees, 
rooted trees, unlabeled trees, and various other species. While labeled trees can be 
counted easily, unlabeled trees, both rooted and unrooted, are counted using generating 
functions. 
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n Tournaments Strong Tournaments 
1 1 1 
2 1 0 
3 2 1 
4 4 1 
5 12 6 
6 56 35 
7 456 393 
8 6,880 6,008 
9 191,536 178,133 

10 9,733,056 9,355,949 
11 903,753,248 884,464,590 
12 | 154,108,311,168 152,310,149,735 


Table 6.3.10: Tournaments and strong tournaments with n vertices. 


DEFINITIONS 


D12: A labeled tree is a tree in which distinct labels, typically v1, v2,..., Un, have been 
assigned to the vertices. Two labeled trees with the same set of labels are considered 
the same only if there is an isomorphism from one to the other that preserves the labels. 


D13: A rooted tree is a tree in which one vertex, the root, is distinguished. Two 
rooted trees are considered the same only if there is an isomorphism from one to the 
other that maps the root of the first to the root of the second. 


D14: A rooted labeled tree is a labeled tree in which one vertex, the root, is distin- 
guished. Two rooted labeled trees with the same set of labels are considered the same 
only if there is an isomorphism from one to the other that preserves the labels and maps 
the root of the first to the root of the second. 


D15: A reduced tree (or homeomorphically reduced tree) is a tree with no vertices 
of degree 2. These trees are sometimes called irreducible trees. 


EXAMPLES 
E5: Figure 6.3.5 shows the three isomorphically distinct trees with 5 vertices. There 


are 60 essentially different ways to label each of the first two and 5 essentially different 
ways to label the third. Thus there are 125 different labeled trees with 5 vertices. 


Figure 6.3.5: The three trees with 5 vertices. 
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E6: There are 3 essentially different ways to root the first tree in Figure 6.3.5, 4 
essentially different ways to root the second, and 2 essentially different ways to root the 
third. Thus there are 9 rooted (unlabeled) trees with 5 vertices. 


E7: Each of the 125 labeled trees discussed in Example E5 can be rooted at any of its 
five vertices, yielding 625 possible rooted labeled trees. 


E8: The third tree in Figure 6.3.5 is the only reduced tree with 5 vertices. 


FACTS 


F29: Cayley’s formula [Ca89]|: The number of labeled trees with n vertices is n"~?. 
See Table 6.3.11. This is sequence A0000272 in [OEIS]. 


F30: The number of rooted labeled trees with n vertices is n”~!. 
This is sequence A000169 in [OEIS]. 


See Table 6.3.11. 


n Labeled Trees Rooted Labeled Trees 
1 1 1 
2 1 2 
3 3 9 
4 16 64 
5 125 625 
6 1,296 7,776 
7 16,807 117,649 
8 262,144 2,097,152 
9 4,782,969 43,046,721 

10 100,000,000 1,000,000,000 

11 2,397,947,691 25,937,424,601 

12 61,917,364,224 743,008,370,688 

13 1,792,160,394,037 23,298 ,085,122,481 

14 56,693,912,375,296 793,714,773,254,144 

15 1,946,195,068,359,375 29,192,926,025,390,625 

16 | 72,057,594,037,927,936 1,152,921 504,606,846 ,98032 


Table 6.3.11: Labeled trees and rooted labeled trees with n vertices. 


F31: [Ca57] Let R, denote the number of (unlabeled) rooted trees with n vertices, 
and let r(a) be the generating function for rooted trees, so that 
r(z) = > Raw” =x 40° +203 + 4a + 90° + 2008 $+. 
n=1 


The coefficients R, of this generating function can be determined by means of the 


recurrence relation 
[oe 


r(x) =2 [[c = ah" Pe, 


k=1 
See Table 6.3.12. This is sequence A000081 in [OEIS]. 
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F32: 


n Rooted Trees ‘Trees Reduced Trees 
1 1 1 1 
2 1 1 1 
3 2 1 0 
4 4 2 1 
5 9 3 1 
6 20 6 2 
7 48 11 2 
8 115 23 4 
9 286 47 5 

10 719 106 10 

11 1,842 235 14 

12 4,766 551 26 

13 12,486 1,301 42 

14 32,973 3,159 78 

15 87,811 7,741 132 

16 235,381 19,320 249 

17 634,847 48,629 445 

18 1,721,159 123,867 842 

19 4,688,676 317,955 1,561 

20 12,826,228 823,065 2,988 

21 35,221,832 2,144,505 5,671 

22 97,055,181 5,623,756 10,981 

23 268,282,855 14,828,074 21,209 

24 743,724,984 39,299,897 41,472 

25 2,067,174,645 104,636,890 81,181 

26 5,759,636,510 279,793,450 160,176 

Qf. 16,083,734,329 751,065,460 316,749 

28 45,007,066,269 2,023,443,032 629,933 

29 126,186,554,308 5,469,566,585 1,256,070 

30 354,426,847,597 14,830,871,802 2,515,169 

31 997,171,512,998 40,330,829,030 5,049,816 

32 2,809,934,352,700 109,972,410,221 10,172,638 

33 7,929,819,784,355 300,628,862,480 20,543,579 

34 22,409,533,673,568 823,779,631,721 41,602,425 

35 63,411,730,258,053 2,262,366,343,746 84,440,886 

36 179,655,930,440,464 6,226,306,037,178 171 794,492 

37 509,588,049,810,620 | 17,169,677,490,714 350,238,175 

38 1,447,023,384,581,029 | 47,436,313,524,262 715,497,037 


Table 6.3.12: Rooted trees, trees, and reduced trees with n vertices. 


and let t(x) be the generating function for trees, so that 


Co 
t(z) = S > T,2” =2+0? +203 + 2a* + 32° + 60°+---. 


n=1 


639 


Otter’s formula [Ot48]: Let T,, denote the number of trees with n vertices, 
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The coefficients T;, of this generating function t(x) can be determined from the gener- 
ating function r(x) for rooted trees in Fact F31 above by using the formula 


t(x) = r(x) — § (r(x) — r(x?)). 
See Table 6.3.12. This is sequence A000055 in [OEIS]. 
F33: Counting reduced trees requires an auxiliary sequence Q@,, with generating func- 
tion q(x), so that 
q(t) => Qua® =a +0 +04 + 2x° + 30° + 6x7 +1028 +---. 
k=1 
The coefficients Q; of this generating function can be determined from the recurrence 
relation 


a2) = [Ja-2)*. 


+2 
This is sequence A001678 in [OEIS]. 
F34: [HaPr59] Let H,, denote the number of reduced trees with n vertices, and let 


h(a) be the generating function for reduced trees, so that 


[oe) 
h(c) = 0 Hyw” =a +4+0° +044 0° + 20° 4 227 + 4n% +... 


n=1 


The coefficients H,, of this generating function h(a) can be determined from the auxiliary 
function g(a) in Fact F33 above by using the formula 


nie) = (1+ a)ate) - ($*) eee) + (5) ates 


See Table 6.3.12. Note that there are no reduced trees with exactly 3 vertices. This is 
sequence A000014 in [OEIS]. 


6.3.4 Counting Trees in Chemistry 


DEFINITIONS 
D16: A 1-4 tree is a tree in which each vertex has degree 1 or 4. 


D17: A 1-rooted 1-4 tree is a 1-4 tree rooted at a vertex of degree 1. 


REMARKS 


R1: The 1-4 trees model many types of organic chemical molecules such as saturated 
hydrocarbons or alkanes. These molecules have the chemical formula C,H2n+2 and 
consist of n carbon atoms of valence 4 and 2n + 2 hydrogen atoms of valence 1. 


R2: The l-rooted 1-4 trees model the monosubstituted hydrocarbons such as the 
alcohols with the chemical formula C,H2,410H and consisting of n carbon atoms, 2n+1 
hydrogen atoms, and an OH group. 
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Figure 6.3.6: The three 1-4 trees with 5 vertices of degree 4. 


EXAMPLES 


E9: Figure 6.3.6 shows the three different 1-4 trees with 5 vertices of degree 4 and 12 
vertices of degree 1. 


E10: The first 1-4 tree in Figure 6.3.6 can be rooted at a vertex of degree 1 in 3 
essentially different ways, the second in 4 essentially different ways, and the third in 
essentially only 1 way. Thus there are 8 different 1-rooted 1-4 trees with 5 vertices of 
degree 4. 


FACTS 
F35: A 1-4 tree with n vertices of degree 4 always has 2n + 2 vertices of degree 1. 


F36: [BlHe3la] Let A, denote the number of l-rooted 1-4 trees with n vertices of 
degree 4, and let a(x) be the generating function for the number of 1-rooted 1-4 trees, 
so that 


a(x) = S> Ana” =1+a+a° + 20° + 4e* + 80° +1708 +---. 


n=0 


The coefficients A, of this generating function a(x) can be determined from the recur- 
rence relation 
a(z)=1+4 (a3(z) + 3a(x)a(x7) + 2a(x*)). 


See Table 6.3.13. This is sequence A000598 in [OEIS]. 


F37: Counting (unrooted) 1-4 trees requires first counting 1-4 trees rooted at a vertex 
of degree 4. Let G,, be the number of 4-rooted 1-4 trees with n vertices of degree 4, and 
let g(x) be the generating function for the number of 4-rooted 1-4 trees, so that 


g(x) = S°Gpx” =a +27 + 20° + 404 + 90° + 182% +--+ 
n=1 
The coefficients G', of this generating function g(x) can be obtained by using the formula 
g(x) = & (a4(x) + 6a? (x)a(x?) + 8a(x)a(x?) + 3a?(x?) + 6a(x*)), 


where a(x) is the generating function for 1-rooted 1-4 trees from Fact F36. 
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1-Rooted 1-4 Trees 1-4 Trees 

n (Alcohols) (Alkanes) 
1 1 1 
2 1 1 
3 2 1 
4 4 2 
5 8 3 
6 17 5 
7 39 9 
8 89 18 
9 211 35 
10 507 75 
11 1,238 159 
12 3,057 355 
13 7,639 802 
14 19,241 1,858 
15 48,865 4,347 
16 124,906 10,359 
17 321,198 24,894 
18 830,219 60,523 
19 2,156,010 148,284 
20 5,622,109 366,319 
21 14,715,813 910,726 
22 38,649,152 2,278,658 
23 101,821,927 5,731,580 
24 269,010,485 14,490,245 
25 712,566,567 36,797,588 
26 1,891,993,344 93,839,412 
27 5,034,704,828 240,215,803 
28 13,425,117,806 617,105,614 
29 35,866,550,869 1,590,507,121 
30 95,991,365,288 4,111,846,763 
31 257 332,864,506 10,660,307,791 
32 690,928,354,105 27,711,253,769 
33 1,857,821,351,559 72,214,088,660 
34 5,002,305,607,153 188,626,236,139 
35 13,486,440,075,669 493,782,952,902 
36 36,404,382,430,278 | 1,295,297,588,128 
37 98,380,779,170,283 | 3,404,490,780,161 
38 | 266,158,552,000,477 | 8,964,747,474,595 


Table 6.3.13: 1-Rooted 1-4 trees and 1-4 trees with n vertices of degree 4. 


F38: [BlHe31b] Let B, denote the number of (unrooted) 1-4 trees with n vertices of 
degree 4, and let b(a) be the generating function for 1-4 trees, so that 


B(x) = 5° Ba” =1+e+e27 42° +204 + 30° + 508+... ; 


n=0 
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The coefficients B,, of this generating function b(x) can be determined from the functions 
a(az) and g(a) from facts F36 and F37, respectively, by using the formula 


b(a) = g(s) + a(x) — 3 (a2(w) — a(e*)). 


See Table 6.3.13. This is sequence A000602 in [OEIS]. 


6.3.5 Counting Trees in Computer Science 


DEFINITIONS 


D18: An ordered tree is recursively defined as consisting of a root vertex and a 
sequence t,,to,..., tm of m > 0 principal subtrees that are themselves ordered trees. 
The root vertex of an ordered tree is joined by an edge to the root of each principal 
subtree. 


D19: A binary tree consists of a root vertex and at most two principal subtrees that 
are themselves binary trees. Each principal subtree must be specified as either the left 
subtree or the right subtree. 


D20: The children of the root vertex of an ordered tree or a binary tree are the roots 
of the principal subtrees. 


D21: A left-right tree is a binary tree in which each vertex has either 0 or 2 children. 


EXAMPLES 


E11: Figure 6.3.7 shows the 5 ordered trees with 4 vertices. 


oe he 


Figure 6.3.7: The 5 ordered trees with 4 vertices. 


E12: Figure 6.3.8 shows the 5 binary trees with 3 vertices. 


f<* F% 


Figure 6.3.8: The 5 binary trees with 3 vertices. 
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E13: Figure 6.3.9 shows the 5 left-right trees with 7 vertices. 


Pan’ p 


Figure 6.3.9: The 5 left-right trees with 7 vertices. 
REMARKS 


R3: In computer science, trees are usually drawn with the root at the top. 


R4: Ordered trees are used to represent structures such as family trees, showing all 
descendants of a person represented by the root. The roots of the principal subtrees 
represent the children of the root person, in order of birth. 


R5: Binary trees are some of the tree structures most easily represented in a computer. 
Other types of trees are often converted into binary trees for computer representation. 


R6: Left-right trees are frequently used to represent arithmetic expressions, in which 
the leaves correspond to numbers and the other vertices represent binary operations 
such as+,—,X,or-+. 


FACTS 


F39: The Catalan numbers C,, can be written as 


(2n)! 


for n > 0. 


1 2n 
va aa) ( 


n+1)!n! 


See Table 6.3.14. This is sequence A000108 in [OEIS]. 


n Catalan Number n Catalan Number 
1 1 17 129,644,790 
2 2 | 18 477,638,700 
3 5 | 19 1,767,263,190 
4 14 | 20 6,564,120,420 
5 42 | 21 24,466,267 ,020 
6 132 | 22 91,482,563,640 
7 429 | 23 343,059,613,650 
8 1,430 | 24 1,289,904,147,324 
9 4,862 | 25 4,861,946,401,452 

10 16,796 | 26 18,367,353,072,152 

11 58,786 | 27 69,533,550,916,004 

12 208,012 | 28 263,747,951,750,360 

13 742,900 | 29 1,002,242,216,651,368 

14 2,674,440 | 30 3,814,986,502,092,304 

15 9,694,845 | 31 14,544,636,039,226,909 

16 35,357,670 | 32 | 55,534,064,877,048,198 


Table 6.3.14: The Catalan numbers. 
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F40: The number of ordered trees with n vertices is the Catalan number C,,_1. See 
Table 6.3.14. 


F41: The number of binary trees with n vertices is the Catalan number C;,,. See Table 
6.3.14. 


F42: The number of left-right trees with 2n + 1 vertices is also C),. See Table 6.3.14. 
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INTRODUCTION 


Electrical circuit theory is one of the earliest applications of graph theory to a prob- 
lem in physical science. The dynamic behavior of an electrical circuit is governed by 
three laws: Kirchhoff’s voltage law, Kirchhoff’s current law, and Ohm’s law. Each ele- 
ment in a circuit is associated with two variables, namely, the current variable and the 
voltage variable. Kirchhoff’s voltage law requires that the algebraic sum of the voltages 
around a circuit is zero, and Kirchhoff’s current law requires that the algebraic sum 
of the currents across a cut is zero. Thus, circuits and cuts define a linear relation- 
ship among the voltage variables and a linear relationship among the current variables, 
respectively. It is for this reason that circuits, cuts, and the vector spaces associated 
with them have played a major role in the discovery of several fundamental properties 
of electrical circuits arising from the structure or the interconnection of the circuit ele- 
ments. Several graph theorists and circuit theorists have immensely contributed to the 
development of what we may now call the structural theory of electrical circuits. The 
significance of the results to be presented in this section goes well beyond their applica- 
tion to circuit theory. They will bring out the fundamental duality that exists between 
circuits and cuts and the influence of this duality on the structural theory of graphs. 
Most of the results in this section are also relevant to the development of combinatorial 
optimization theory as well as matroid theory. 
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6.4.1 Basic Concepts and Definitions 


Although the terms node and oriented graph are commonly used in electrical circuit 
theory, we use the terms vertex and directed graph along with all the other basic termi- 
nology of graph theory established in Chapter 1. For the sake of completeness, we begin 
with a review of certain basic concepts and definitions. For concepts not discussed here, 
the reader is referred to [GrYe06] and [ThSw92]. 

NOTATION: Unless otherwise specified, G = (V, £) is a graph (or digraph) with n ver- 
tices, V = {v1,v2,..-,Un}, and m edges, E = {e1,e€2,...em}. 

NOTATION: If vertices v; and v; are the endpoints (or end vertices) of an edge then, 
when there is no ambiguity, we denote that edge by the ordered pair (v;, v;). 


DEFINITIONS 


D1: A graph is called a trivial graph if it has only one vertex and no edge. A graph 
with no edges is called an empty graph. A graph with no vertices and hence no edges 
is called a null graph and will be denoted by @. 


REMARK 


R1: In this section we consider only graphs in which all edges have two distinct 
endpoints (i.e., no self-loops). 


EXAMPLE 


E1: Examples E1 through E9 in this section refer to the graph shown in Figure 6.4.1. 


Figure 6.4.1: A graph used in several examples. 


Subgraphs and Complements 


DEFINITIONS 


D2: A graph G’ = (V’, E’) is called a subgraph of graph G = (V,E)if V’ CV,E’ CE 
and V’ contains all the endpoints of all the edges in E’. 


D3: Each subset E’ C E defines a unique subgraph G’ = (V’, E’) of graph G = (V, E), 
where V’ consists of only those vertices which are the endpoints of the edges in E’. The 
subgraph G’ is called the induced subgraph of G on the edge set E’. Note that an 
edge-induced subgraph will not have isolated vertices. 
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D4: Each subset V’ C V defines a unique subgraph G’ = (V’, E’) of graph G = (V, E), 
where E’ consists of those edges whose endpoints are in V’. The subgraph G’ is called 
the induced subgraph of G on the vertex set V’. Note that a vertex-induced 
subgraph may have isolated vertices. 


D5: Given a subgraph G’ = (V’,E’) of graph G = (V,E), the subgraph G” = 
(V, E — E’) is called the (edge-)complement of G' in G. 
EXAMPLES 


E2: For the set E’ = {e1, e3, eg}, the corresponding edge-induced subgraph of graph G 
in Figure 6.4.1 is shown in Figure 6.4.2(a). For the set V’ = {v1,v2, va}, the correspond- 
ing vertex-induced subgraph of G is shown in Figure 6.4.2(b). 


v1 v1 
eal 3 ei 
v2 V3 V2 
Vs o—.®: @v. 
(a) An edge-induced subgraph of the graph G (b) A vertex-induced subgraph of the graph G 


Figure 6.4.2: An edge-induced subgraph and a vertex-induced subgraph. 


E3: The complement of the subgraph G’ of Figure 6.4.3(a) in the graph G of Figure 
6.4.1 is shown in Figure 6.4.3(b). 


V1 Vi 
3 ial 
v2 V3 v2 @” 
4 e7 = 
eo 
es 
V5 V4 Vs V4 
(a) Subgraph G (b) Complement of G' in the graph G 


Figure 6.4.3: A subgraph G’ and its complement in G. 


Components, Spanning Trees, and Cospanning Trees 


DEFINITIONS 


D6: A closed trail is a closed walk with no repeated edges. 
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TERMINOLOGY: A closed trail is also called a circ, which we formally state in Defini- 
tion D20 and use thereafter. 


D7: A circuit is a closed trail with no repeated vertices except the initial and terminal 
ones. 


TERMINOLOGY: Several authors use the term cycle instead of circuit. In electrical circuit 
literature, the term circuit is commonly understood as defined in Definition D7. 


D8: A graph G is connected if there is a path between every pair of vertices of G. 


D9: A maximal connected subgraph of a graph is called a component of the graph. 
An isolated vertex is by itself considered a single component. 


D10: A tree of a graph G is a connected subgraph containing no circuits. If a tree of 
a connected graph G contains all the vertices of G then it is called a spanning tree of 


G. The complement of a spanning tree T in G is called a cospanning tree of G. 


D11: A spanning forest of a non-connected graph G with p components is a collection 
of p spanning trees, one for each component. 


D12: The edges of a spanning tree T are called the branches of T. The edges of a 
cospanning tree are called the chords of the spanning tree. 


D13: Let G be an n-vertex graph with m edges and p components. The rank p(G) 
and nullity (G) of G are given by p(G) = n—p and u(G) = m—n-p. 
EXAMPLE 


E4: A spanning tree T and the corresponding cospanning tree of the connected graph G 
of Figure 6.4.1 are shown in Figure 6.4.4. 


V1 Vi 
a C3 
v2 V3 V2@ V3 
e2 
4 e7 6 
V5 V4 V5 V4 
es 
(a) A spanning tree T of G (b) The cospanning tree with respect to T 


Figure 6.4.4: A spanning tree and corresponding cospanning tree of graph G. 


FACTS 


F1: There is exactly one path between any two vertices of a spanning tree. 
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F2: A spanning tree of a connected n-vertex graph has n—1 branches and a cospanning 
tree has m—n+1 chords. A spanning forest of a graph having p components has n — p 
branches and m — n + p chords. 


REMARK 


R2: Unless stated otherwise, all graphs G considered in this section are connected. 


Cuts and Cutsets 


DEFINITIONS 


D14: Consider a connected graph G = (V, FE). Let Vi and V2 be two disjoint subsets 
of V such that V = Vi UV) (ie., Vi and V2 form a partition of V). Then the set of all 
those edges of G having one end vertex in V; and the other in V2 is called a cut of G. 
This cut is denoted as (Vi, V2). The set of edges incident on a vertex forms a cut, and 
is called an incidence set. 


D15: Removal of the edges in a cut from a connected graph G will disconnect the 
graph. In other words, the resulting graph will have at least two components. A cut 
of a connected graph is called a cutset if the removal of the edges in the cut results 
in a non-connected graph with exactly two components. Equivalently, a cutset of a 
connected graph is a minimal set of edges whose removal disconnects the graph. 


EXAMPLE 


E5: For the graph G in Figure 6.4.1, the cut (Vi, V2), where Vj = {v1, v3, U5} and 
V2 = {v2,v4} consists of the edges e1,e2,e4,e7, and eg, as shown in Figure 6.4.5(a). 
Removing these edges results in a non-connected graph with three components. So, 
(Vi, V2) is not a cutset. A cutset consisting of the edges e4,e5,eg, and e7 is shown 
in Figure 6.4.5(b). Removing these edges results in a non-connected graph with two 
components. 


(a) Accut of the graph G (b) A cutset of the graph G 


Figure 6.4.5: A cut and a cutset of the graph G. 
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Vector Space of a Graph under Ring Sum of Its Edge Subsets 
DEFINITIONS 


D16: Suppose the edge-set of a graph G is E = {e1,e2,€3,...,€m}. Then each subset 
E” of E can be represented by a binary m-vector in which the ith component is 1 
if and only if the edge e; is in £’. For example, the binary vector (1,0,0,1,1,1,0,0) 
represents the edge subset {e1, e4, e5,e6} of the graph G of Figure 6.4.1. 


D17: The ring sum (or symmetric difference) of two sets FE, andE2, denoted as 
Fi, ® Eg, is the set of those edges which belong to FE; or to Ey but not to both Fy 
and Ep. 


D18: The ring sum of two m-vectors 
X = (%1,%2,03,..-,Li,;---;%m) and Y = (y1, Y2,Y3,---5Yis--+5Ym) 
is the vector 
Z = (21, 22, 23, ++ +5 Ziy- ++) 2m); 


where z; = x; ® y;, and where ® is the logical exclusive-or operation (i.e., 1 ®0 = 
1, 0@1=1; 0@0=0; and1@1=0). 


FACT 


F3: The m-vector representing the ring sum of two subsets of edges is the ring sum of 
the m-vectors representing these edge subsets. The set of m-vectors representing all the 
2™ edge subsets of a graph G (including the null set) forms an m-dimensional vector 
space over GF(2), the field of integers modulo 2, under the ring sum operation ©. 


NOTATION: This vector space of edge subsets of a graph G (and hence of the corre- 
sponding edge-induced subgraphs of G') is denoted by U(G). 


REMARKS 

R3: Throughout this section all vectors are assumed to be row vectors. 

RA: In this section an edge subset is used to refer to the corresponding edge-induced 
subgraph. The vector space ¥(G) will be used to denote the vector space of all binary 
m-vectors as well as the vector space of all edge-induced subgraphs of G. Observe that 
the null set (or null graph @) is the 0-vector of U(G). 


R5: In electrical engineering literature, a cut is also referred to as a seg [Re61]. 


R6: Proofs of most results in this section may be found in standard texts [SeRe61], 
[Ch71b], [De74], [ThSw92], and [SwTh81]. 
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6.4.2 Circuit Subspace of an Undirected Graph 


DEFINITIONS 


D19: A graph is even if the degree of every vertex in the graph is even. Clearly, a 
circuit is an even graph. 


D20: A circ of a graph is a closed trail. The null graph is considered as a circ. 


NOTATION: The set of all circs of a graph G is denoted by C(G). In other words, C(G) 
is the set of all circuits and unions of edge-disjoint circuits of the graph G (including 
the null graph 0). 


FACTS 
F4: A subgraph of a graph is a circ if and only if it is even. 


F5: A circ is a circuit or union of edge-disjoint circuits. Thus, the edge set of an even 
graph can be partitioned into edge subsets such that each subset in the partition forms 
a circuit. 


F6: The ring sum of any two even subgraphs of a graph is even. Thus, the set C(G) 
is closed under ring sum. 


F7: C(G) is a subspace of the vector space U(G) and is called the circuit subspace 
of G. 


EXAMPLE 


E6: Two circs of the graph G of Figure 6.4.1 and their ring sum, which is clearly a 
circ, are shown in Figure 6.4.6. 


V1 


a 3 
Vv. V3 V. a2 V3 
e4 e7 e4 
Vs V4 Vs 
es 
Cire Gy Cire G, G19 G, 


Figure 6.4.6: Two circs of the graph G and their ring sum. 


REMARKS 
R7: Fact F5 is attributed to Veblen [Ve31]. 


R8: A connected, even graph G is eulerian, i.e., there exists a circ that contains all 
the edges of G (see §4.2). 
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Fundamental Circuits and the Dimension of the Circuit Subspace 
DEFINITION 


D21: Adding a chord c to a spanning tree T' of a connected graph G produces a unique 
circuit in G, called the fundamental circuit of G with respect to chord c. 


NOTATION: If e; is a chord of a spanning tree T, then C; will denote the fundamental 
circuit with respect to e;. 


FACTS 


F8: Given a connected graph G and a spanning tree T, there are m—n-+1 fundamental 
circuits, one for each chord of T. 


F9: The fundamental circuit with respect to chord c contains only one chord of the 
spanning tree T, namely, the chord c. The chord c is not present in any other funda- 
mental circuit with respect to T. 


F10: The (m—n-+1) fundamental circuits with respect to a spanning tree of a 
connected graph G are linearly independent in the circuit subspace C(G). 


F11: Ifa circ of a graph G contains the chords eg,ep,...,ex, then the circ can be 
expressed as the ring sum of the fundamental circuits Cy, Cb,..., Cr. 


F12: The fundamental circuits with respect to a spanning tree of a connected graph 
G constitute a basis for the circuit subspace C'(G), and hence, the dimension of C(G) 
is equal tom —n-+1, the nullity (G). 


F13: The dimension of the circuit subspace C (G) of a graph having p components is 
equal to un(G) =m-—n+p. 


EXAMPLE 


E7: The set of fundamental circuits with respect to the spanning tree T = {e1, e2, e4, e7} 
of the graph shown in Figure 6.4.1 is 


Chord e3 C3 = {e3, €1,€2} 
Chord és C5 = {e5,€1;e4} 
Chord e¢ Ce = {€6, €2, ea} 
Chord eg Cg = {ég, €2, €4, €7} 


It can be verified that the circ {e1, €4, 5, eg, e7,e€s}, which contains the chords es, e¢ 
and eg, is the ring sum of the fundamental circuits Cs, Cg, and Cg. This illustrates 
Fact F11. 
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6.4.3. Cutset Subspace of an Undirected Graph 


Recall from the definitions in §6.4.1 that a cutset is also a cut. Several facts that 
highlight the duality between cuts and circs will be presented next. 


DEFINITION 


D22: The collection of all cutsets and unions of edge-disjoint cutsets of a graph G 
is called the cutset subspace of G and is denoted by \(G). The null graph © is 
considered a cut and hence belongs to A(G). 


FACTS 


F14: Every cut of a connected graph G is the union of some edge-disjoint cutsets of G. 
Thus, A(G) is the collection of cuts of G. 


F15: The cutset subspace \(G) of a graph G is a subspace of the vector space U(G). 


F16: The ring sum of any two cuts of a graph G is also a cut of G; i.e., A(G) is closed 
under ring sum. 


EXAMPLE 


E8: Consider the graph in Figure 6.4.1 and the cuts $; = (Vi, V2) and S2 = (V3, V4) in 
Figure 6.4.5, where Vi = {U1, 03, U5}, V2 = {v2, va}, V3 = {U1, V2, v3}, and V4 = {U4, Us}. 
Then S$; = {e1, €2, €4, C7, es}, So = {ea, €5, €6, e7}, and S; @ Sy = {e1, eo, €5, €6, eg}. 


Moreover, it can be seen that S$; @ Sg = (AUD, BUC), where 


A = ViNV3 = {v1, v3} 
B= ViNV4 = {vs} 
C = VaNV3 = {v2} 
Doe Bava 


In fact, this illustration is also the basis of the proof of Fact F16. 


Fundamental Cutsets and the Dimension of the Cutset Subspace 


DEFINITIONS 


D23: Let T be a spanning tree of a connected graph G, and let b be a branch of T. If 
V, and V2 are the vertex-sets of the two components of T — b, then we can verify that 
the cut (Vi, V2) is a cutset of G. This cutset is called the fundamental cutset of G 
with respect to the branch b of T. 


NOTATION: If e; is a branch of a spanning tree T, then S; denotes the fundamental 
cutset with respect to the branch e;. 


D24: An incidence set of a vertex v in a graph G is the cut consisting of the set of 
edges of G that are incident on v. 
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FACTS 


F17: Given a connected graph G and a spanning tree T’,, there are n — 1 fundamental 
cutsets, one for each branch of T. 


F18: The fundamental cutset with respect to branch b of a spanning tree T contains 
only one branch, namely, the branch b. The branch 6 is not present in any other 
fundamental cutset with respect to T. 


F19: The n—1 fundamental cutsets with respect to a spanning tree of a connected 
n-vertex graph G are linearly independent in the cutset subspace A(G). 


F20: If a cut of a graph G contains the branches é,, e,,...e,%, then the cut can be 
expressed as the ring sum of the fundamental cutsets S_,5p,..., Sk. 


F21: The fundamental cutsets with respect to a spanning tree of a connected graph G 
constitute a basis for the cutset subspace \(G) of G, and hence the dimension of A(G) 
is equal to n — 1, the rank p(G). 


F22: The dimension of the cutset subspace A(G) of a graph having p components is 
equal to p(G) =n -— p. 


F23: The incidence sets of any n — 1 vertices of a connected n-vertex graph G form a 
basis of the cutset subspace A(G). 


EXAMPLE 


E9: For the graph shown in Figure 6.4.1, the fundamental cutsets with respect to the 
spanning tree T = {e1, €2,€4,e7} are 


Branch e1 S1 = {€1,€3, €5} 
Branch eg So = {€2, €3, €6, es} 
Branch e4 Si = {€4, €5, €6, €8) 
Branch e7 Sy = {€7, eg} 


It can be verified that the cut = {e1, e2, e4,e7,eg} containing the branches e), e2, e4, 
and e7 is the ring sum of the fundamental cutsets $ , S2, S4, and $7. This illustrates 
Fact F20. 


6.4.4 Relationship between Circuit and Cutset 
Subspaces 


By now it should be evident that circs and cuts are dual concepts in the sense that 
for each result that involves circuits or circs, there is a corresponding result involving 
cutsets or cuts. Facts F5 through F13 correspond to Facts F14 through F22. Spanning 
trees and cospanning trees provide the links between circs and cuts. This duality is 
further explored next. 
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Orthogonality of Circuit and Cutset Subspaces 


DEFINITIONS 


D25: The binary m-vector representing a circ is called a circuit vector; the binary 
m-vector representing a cut is called a cut vector; and the m-vector representing an 
incidence set is called an incidence vector. 


D26: Two subspaces W’ and W” of a vector space W are orthogonal to each other 
if the inner product (or dot product) of every vector in W’ with every vector in W” is 
zero. Note that the zero vector belongs to every subspace. 


FACTS 


F24: A circuit and a cutset of a connected graph have an even number of edges in 
common. Hence, a circ and a cut have an even number of edges in common. 


F25: The inner product of a circuit vector and a cut vector over GF(2) is zero under 
the ring sum operation. 


F26: A subgraph of a graph G belongs to the circuit subspace of the graph if and 
only if it has an even number of edges in common with every subgraph in the cutset 
subspace of G. Equivalently, a vector is a circuit vector if and only if it is orthogonal 
to every cut vector. 


F27: <A subgraph of a graph G belongs to the cutset subspace of the graph if and 
only if it has an even number of edges in common with every subgraph in the circuit 
subspace of G. Equivalently, a vector is a cut vector if and only if it is orthogonal to 
every circuit vector. 


F28: The circuit and cutset subspaces of a graph are orthogonal to each other. 


Circ/Cut-Based Decomposition of Graphs and Subgraphs 


DEFINITION 


D27: Two orthogonal subspaces W’ and W” of a vector space W are orthogonal 
complements if every vector in W can be expressed as the ring sum of a vector of 
W’ and a vector of W”. Note that the zero vector is the only vector that is in the 
intersection of the orthogonal complements W’ and W”. 


FACTS 


F29: If the orthogonal subspaces W’ and W” of a vector space W are not orthogonal 
complements, then the dimension of their union is less than the dimension of the vector 
space W. 


F30: [Ch7la] The circuit and the cutset subspaces of a graph are orthogonal comple- 
ments if and only if the graph has an odd number of spanning forests. 
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F31: Ifthe circuit and cutset subspaces of a graph are orthogonal complements, then 
every subgraph (including the graph itself) can be expressed as the ring sum of a circ 
and a cut. 


F32: [Ch71b, WiMa71] Every graph can be represented as the ring sum of a circ and a 
cut of the graph. If the dimension of the intersection of the circuit and cutset subspaces 
of a graph is equal to k, then there are 2" such representations. 


EXAMPLES 


E10: Consider the graph G, in Figure 6.4.7. It can be verified that no nonempty 
subgraph of this graph is both a cire and a cut. So the cutset and circuit subspaces of 
G, are orthogonal complements. Then the set of fundamental cutsets and fundamental 
circuits with respect to a spanning tree of G, constitutes a basis of the vector space 
W(G). One such set with respect to the spanning tree formed by the edges e1, e2, es, 
and eg is as follows: 


Si = 
So = 
S3 
S4 
S5 
S6 
S7 
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It is easy to verify that every subgraph can be expressed as the ring sum of a circ 
and a cut, which illustrates Fact F31. For instance, the vector (0 0 1 10 1 1), which 
represents the induced subgraph on the edge subset {e3, e4, €6, e7}, can be expressed as: 


0 0 1101 1 = S@S,8CE8C7 
= (1 10000 0e 1:1 10 1 1) 


where (1 10000 0) represents a cut in G,, and (1 1110 1 1) represents a circ. 
e4 C7 
e2 es 


e\ 


Figure 6.4.7: Graph G, for illustration of Fact F31. 
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E11: Consider the graph G in Figure 6.4.8. In this graph the edges e1, e2, e3, and 
€5 constitute a circuit as well as a cut. Hence the circuit and cutset subspaces are not 
orthogonal complements. This means that there is a subgraph of G, that cannot be 
expressed as the ring sum of a circ and a cut. However, according to Fact F32, such a 
decomposition is possible for Gy. This is verified as follows: 


G44 1 i) = 4 0 10 OGO-0 LO. 1.1) 


where (1 10100) represents the cut of edges e1, e2, and e4, and (0 0 101 1) represents 
the circuit of edges e3, e5, and eg in Gp. 


V1 


el e4 


V2 €3 V3 


Figure 6.4.8: Graph G, for illustration of Fact F32. 


6.4.5 Circuit and Cutset Spaces in a Directed Graph 


In most engineering applications of graph theory, directed graphs are encountered. 
But, as we shall see next, the effect of orientation is minimal in so far as the results 
concerning circuits and cuts are concerned. Almost all the results presented earlier in 
this section have their equivalents in the directed case. In fact, we can view all the 
results on undirected graphs presented thus far as special cases of the results to be 
presented next. 


TERMINOLOGY: A circuit, cut, or spanning tree in a directed graph G is a subset of 
edges that constitutes a circuit, cut, or spanning tree, respectively, in the underlying 
graph of G. 


Circuit and Cut Vectors and Matrices 
DEFINITIONS 
D28: A circuit in a directed graph can be traversed in one of two directions, clockwise 


or counter-clockwise (relative to a plane drawing of the circuit). The traversal direction 
we choose is called the circuit orientation. 
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D29: Let C be a circuit in a directed graph and e = (u;,v,;) an edge in C directed 
from vu; to v;. Given an orientation of C, edge e is said to agree with the circuit 
orientation if the traversal of e specified by that orientation is from its tail v; to its 
head Uj. 


D30: A cut (V., Vp) in a directed graph can be traversed in one of two directions, from 
V, to V, or from Vy to Va. The direction chosen is called the cut orientation. 


D31: Given an orientation of a cut in a directed graph, an edge e = (v;,v;) in the 
cut is said to agree with the cut orientation if the traversal of e specified by that 
orientation is from vu; to v;. 


D32: Let G be a directed graph with edge-set E = {e1,e€2,...,€m}, and let C bea 
circuit in G with a given orientation. The circuit vector representing C' is the m-vector 
(@1,22,..-,L%m), where 


: if edge e; agrees with the orientation of C 
x= <-1, if edge e; does not agree with the orientation of C 


0, if edge e; is not in C 


D33: Let G be a directed graph with edge-set E = {e1,€2,...,€m}, and let S be 
a cut in G with a given orientation. The cut vector representing S is the m-vector 
(@1,¥2,...,L%m), where 


1, if edge e; agrees with the orientation of S 
v= 44-1, if edge e; does not agree with the orientation of S 
0, if edge e; is not in S 


D34: Let G be a directed graph with edge-set FE = {e1, €2,...,@m}. Let C1, C2,...,C; 
and $1, 5S5,...,5, be the circuits and cuts of G, respectively, each with a given traversal 
orientation. The circuit matrix of G is the t x m matrix whose ith row is the circuit 
vector representing circuit C;. The cut matriz of G is the r x m matrix whose ith row 
is the cut vector representing cut S;. 


Fundamental Circuit, Fundamental Cutset, and Incidence 
Matrices 


Next, we define two special matrices corresponding to the fundamental circuits and 
cutsets relative to a given spanning tree in a directed graph and a third matrix corre- 
sponding to the incidence vectors of the vertices. 


REMARK 


R9: The definitions of these three matrices depend on how the associated circuits and 
cuts are oriented. The orientations of each fundamental circuit and each fundamental 
cut are usually chosen to agree with the defining chord and branch, respectively, and 
we adopt that convention here. Also, for the cut consisting of the set of edges incident 
on a vertex v (i.e., the incidence set of v), we assume that the orientation is away from 
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vertex vu. Accordingly, the incidence vector of vertex vu is given by (2#1,%2,...,%m), 
where 


1, if edge e; is directed from v_ (v is the tail of edge e;) 
x;= 4-1, if edge e; is directed tov (v is the head of edge e;) 


0, if edge e; is not incident on v 


DEFINITIONS 


D35: Let T be a spanning tree of a connected directed graph. The fundamental 
circuit matrix of the graph with respect to T, denoted by By, is the (m — n+ 1)- 
rowed submatrix of the circuit matrix whose rows are the fundamental circuit vectors. 
Similarly, the fundamental cutset matrix with respect to T’, denoted by Qy, is the 
(n—1)-rowed submatrix of the cut matrix whose rows are the fundamental cutset vectors. 


D36: The incidence matrix of a given directed graph, denoted A,, is the n-rowed 
submatrix of the cut matrix whose rows are the incidence vectors of the directed graph. 
The submatrix of the incidence matrix containing any n — 1 of the incidence vectors is 
called a reduced incidence matriz and is denoted by A. 


D37: <A matrix of real numbers is unimodular if the determinant of every square 
submatrix of the matrix is equal to 1, —1, or 0. 


EXAMPLES 


E12: Consider the directed graph of Figure 6.4.9(a) below. A circuit and a cut with 
orientations are shown in Figures 6.4.9(b) and (c), respectively. The corresponding 
circuit and cut vectors are (1 —1 -1 0 1 0 0) and (0 1 0 0 1 1 O), 
respectively. 


Vs 
&4 e, 
Vi é1 v2 
V3 V4 
2 es 
ée 
es 
V3 63 V4 
al Vy 
a1 
(a) A directed graph. (b) A circuit with orientation. (c) A cut with orientation. 


Figure 6.4.9: A directed graph, a circuit, and a cut with orientations. 
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E13: Consider the spanning tree T of the graph of Figure 6.4.9(a) consisting of the 
edges e€1, €2, €3, and e4. The fundamental circuit and the fundamental cutset matrices 
with respect to T, and the incidence matrix of this graph with the column 7 in each 
matrix corresponding to edge e; are: 


Fundamental Circuit Matrix: 


Chordes f/1 -—1 -1 0 1 0 0 
Chordeg | 1 —1 0 0 O01 0 
Chorde7 \0 0O 1 -1 001 
Fundamental Cutset Matrix: 
Branch e; /1 0 0 0 -1 -1 O 
Branch eg | 0 1 00 1 1 0 
Branchez |0 0 1 0 1 0 -l 
Branches \O0 0 0 1 O 0 1 


Incidence Matrix: 


Node vy 1 1 0 0 0 0 0 
Node vg | —1 O 0 0 1 1 0 
Node v3 | 0 —-1 1 1 0 -1 O 
Node v4 0 0 -1 0O -1 O 1 
Node vs 0) 0 0 -1 O 0 -l 


Orthogonality and the Matrix Tree Theorem 


TERMINOLOGY: A directed edge that is in both a circuit and a cut is said to have the 
same relative orientation with respect to the circuit and the cut if the edge either 
agrees or disagrees with the assigned orientations of both the circuit and the cut. 


FACTS 


F33: A circuit and a cut in a connected graph have an even number of common edges. 
If a circuit and a cut have 2k common edges, then these edges can be partitioned into 
two sets, each of size k, such that each edge in one set has the same relative orientation 
with respect to the circuit and the cut, and that each edge in the other set agrees 
with one of the two assigned orientations (circuit or cut) and disagrees with the other 
assigned orientation. 


NOTATION: Let G be a directed graph and suppose that each of the circuits and cuts 
has been given an orientation. 

(a) The collection of all circuit vectors of G and their linear combinations over the real 
field is denoted by C(G). 

(b) The collection of all cut vectors of G and their linear combinations over the real 


field is denoted by \(G). 


F34: In a directed graph every circuit vector is orthogonal to every cut vector over the 
real field. 


F35: In a connected directed graph, every circuit vector can be expressed as a linear 
combination of fundamental circuit vectors with respect to a spanning tree of the graph. 


662 Chapter 6. Algebraic Graph Theory 


The coefficients in the linear combination are 1 or —1. Similarly, every cut vector in 
a connected directed graph can be expressed as a linear combination of fundamental 
cutset vectors with respect to a spanning tree of the graph. The coefficients in the linear 
combination are 1 or —1. 


F36: In a directed graph G, C(G) and A(G) are vector spaces over the real field and 
are orthogonal to each other. C(G) and X(G) are called the circuit space and the 
cutset space, respectively. 


F37: The fundamental circuit vectors and the fundamental cutset vectors with respect 
to a spanning tree of a connected directed graph G form a basis of the circuit space and 
a basis of the cutset space, respectively. The dimension of the circuit space is equal to 
m—n-+p, the nullity of G, and the dimension of the cutset space is equal to n — p, the 
rank of G, where p is the number of components of G. 


F38: Any set of n — 1 incidence vectors of a connected directed graph forms a basis of 
the cutset space of the graph. 


F39: The fundamental cutset and the fundamental circuit matrices of a connected 
directed graph are unimodular. 


F40: Consider a spanning tree T’ of a connected directed graph G with branches 
by, bg,...... ,bn—1 and chords cj, ¢2,¢3,.--;Cm—n+1- Suppose that the edges of G are 
labeled so that e1,€2,..-,€m = 01, b2,...,0n—1, C1, €2,---;Cm—n+1, respectively. Then 
the fundamental circuit matrix By has the form By = [Byi|Um—n+41], where Um—n+1 is 
the identity matrix of size m—n-+1 and By; is the submatrix of By consisting of the 
columns corresponding to the branches bj, b2,...,bn—1 of T. Similarly, the fundamental 
cutset matrix Q; has the form Q+ = [Un—1|Q ye], where U,-1 is the identity matrix of 
size n — 1 and Qy- is the submatrix of @ consisting of the columns corresponding to 
the chords cy, c2,..-,€m—n+1 of T. Moreover, Q fe = — Be. 


F41: The columns of the cut matrix of a connected directed graph G are linearly 
independent if and only if they correspond to the branches of a spanning tree. Similarly, 
the columns of the circuit matrix are linearly independent if and only if they correspond 
to the chords of a cospanning tree. 


F42: (Matrix Tree Theorem) For a connected directed graph, each cofactor of the 
matrix A.A‘ equals the number of spanning trees of the graph. 


EXAMPLE 


E14: The matrices in Example E13 illustrate Facts F33 through F41. 


REMARKS 


R10: By simply replacing —1 by 1 in all the matrices defined for directed graphs, we 
get the corresponding matrices for undirected graphs. 


R11: The rank and the nullity of the cut matrix of a connected graph are (n — 1) and 
(m —n +1), respectively. This motivated the definitions of the rank and nullity of a 
graph (see Definition D13). 
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R12: The matrix A,A* is called the degree matrix of the graph. It can be verified that 
the diagonal entry (7,7) of the degree matrix is equal to the degree of vertex v; and the 
off-diagonal entry (7, 7) is equal to the negative of the number of edges connecting vertex 
v; and vertex v,; (regardless of the orientations of these edges). A proof of Fact F42 may 
be found in [ThSw92]. A weighted version of the degree matrix plays an important role 
in electrical circuit analysis [SwTh81]. 


Minty’s Painting Theorem 


TERMINOLOGY: Two directed edges in a circuit or cutset are said to have the same di- 
rection (relative to that circuit or cutset) if both edges agree with the same orientation 
of that circuit or cutset. 


DEFINITIONS 


D38: A directed circuit is a circuit whose edges all have the same direction relative 
to it. 


D39: A directed cutset is a cutset whose edges all have the same direction relative 
to it. 


D40: A painting of a directed graph G is a partitioning of the edges of the graph 
into three sets R, Y, and B and the distinguishing of one element of the set Y. We can 
visualize this as coloring of the edges of G with three colors, each edge being painted 
red, yellow, or blue, and exactly one yellow edge being colored dark yellow. 


FACTS 


F43: (Painting Theorem) [Mi66] Let G be a directed graph. For any painting of the 
edges of G, exactly one of the following holds: 


1. There exists a circuit containing the dark yellow edge but no blue edges, in which 
all the yellow edges have the same direction as the dark yellow edge. 


2. There exists a cutset containing the dark yellow edge but no red edges, in which 
all the yellow edges have the same direction as the dark yellow edge. 


F44: Each edge of a directed graph is in a directed circuit or in a directed cutset, but 
no edge belongs to both. 


REMARK 


R13: Minty’s painting theorem (also known as the arc coloring lemma) has profound 
applications in electrical circuit theory. This theorem is also true for orientable matroids 
(see [ThSw92]). Fact F44 is a corollary of Fact F2. Other related works by Minty of 
considerable significance in electrical circuit theory are [Mi60, Mi61]. Some applications 
of the arc coloring lemma to problems in electrical circuit theory may be found in 
[VaCh80, ChGr76, Wo70]. 
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6.4.6 Two Circ/Cut-Based Tripartitions of a Graph 


In §6.4.4 we presented a result on the decomposition of a graph into a circ and a cut. 
But such circs and cuts may not be disjoint and hence they may not form a partition 
of the edge set of the graph. We now present two ways to partition a graph. These 
partitions are both tripartitions and are again based on circs and cuts. 


Bicycle-Based Tripartition 


DEFINITION 

D41: A subgraph that is in the intersection of the circuit and cutset subspaces of an 
undirected graph is called a bicycle. That is, a bicycle is a circ as well as a cut. 
EXAMPLE 

E15: The edges e1, e2, e3, and es in the graph of Figure 6.4.8 form both a cut and a 
circuit. 

FACT 


F45: [RoRe78] Any edge e of a graph G is of one of the following types: 
1. e is in a circ that becomes a cut when e is removed from it. 
2. e is in a cut that becomes a circ when e is removed from it. 
3. e is in a bicycle. 


TERMINOLOGY: The partition of the edges defined by Fact F45 is called the bicycle- 
based tripartition. 


REMARK 


R14: Rosenstiehl and Read [RoRe78] have proved several interesting results relating 
to circuits and cuts and their relationship. A proof of Fact F45 may also be found in 
[Pa94]. 


A Tripartition Based on Maximally Distant Spanning Trees 


DEFINITIONS 


D42: The tree distance d(T,, T2) between any two spanning trees T; and T> is defined 
as d(T), T2) = |E(T1) — E(T2)| = |E(2) — E11). 


D43: Two spanning trees T; and Tz are maximally distant if d(T,,T2) > d(Z;,T;) 
for every pair of spanning trees TJ; and T;. 

NOTATION: The maximum distance between any two spanning trees of a connected 
graph is denoted by dy. 


D44: Given a pair of maximally distant spanning trees 7, and T) of a connected graph 
G. Suppose c is a common chord of T; and T). The k-subgraph G,. of G with respect 
to c is the edge-induced subgraph constructed as follows: 
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1. Let Ly be the set of all the edges in the fundamental circuit with respect to T) 
defined by c. 


2. Let Lg be the union of the sets of edges in all the fundamental circuits with respect 
to T, defined by every edge in Ly. 


3. Repeating the above, we can obtain a sequence of sets of edges L,, L2,... until 
we arrive at a set Ly41 = Ly. Then the induced subgraph on the edge set Ly is 
called the k-subgraph G. with respect to c. 


D45: The k-subgraph Gy with respect to a common branch Bb can be constructed in 
a dual manner as in Definition D44. 


D46: The principal subgraph G with respect to the common chords (of a pair of 
maximally distant spanning trees T; and T>) is the union of the k-subgraphs with respect 
to all the common chords. The principal subgraph G2 with respect to the common 
branches is the union of the k-subgraphs with respect to all the common branches. 


FACTS 


F46: [Kika69] Let T; and T2 form a pair of maximally distant spanning trees of a 
connected graph G. 


1. The fundamental circuit of G with respect to T; or T> defined by a common chord 
of T, and T2 contains no common branches of these spanning trees. 


2. The fundamental cutset of G with respect to T, or T> defined by a common branch 
of T, and T2 contains no common chords of these spanning trees. 


F47: [KikKa69] Consider a graph G = (V,E). Let E, and E2 denote the edge-sets of 
the principal subgraphs G'; and Go, respectively, and let Eo = E(G) — (£1 U Ez). Then 
Eo, Fy, and E> form a partition of the edge-set E(G). The partition (Eo, £1, E2) is 
called the principal partition of G and is independent of the maximally distant trees 
used to construct it. 


EXAMPLE 
E16: It can be verified that T; = {e2,€3,e4,e7} and T> = {e1,€3,€5,e6} are a pair of 
maximally distant spanning trees for the graph in Figure 6.4.10 and that the associated 
principal partition is: E, = {e¢,e7,es}, Eo = {e1,e2,e3}, and Eo = {e4,e5}. 
v2 C1 V3 
@5 
V1 e2 
C6) €7 
a4 
V4 Vs 
63 


Figure 6.4.10: A graph. 
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REMARKS 


R15: In electrical circuit analysis one is interested in solving for all the current and 
the voltage variables. The circuit method of analysis (also known as the loop analysis) 
requires solving for only m—n-+1 independent current variables. The remaining current 
variables and all the voltage variables can then be determined using these m — n+ 1 
independent current variables. The cutset method of analysis requires solving for only 
n — 1 independent voltage variables. A question that intrigued circuit theorists for a 
long time was whether one could use a hybrid method of analysis involving some current 
variables and some voltage variables and reduce the size of the system of equations to 
be solved to less than both n — 1 and m—n-+1, the rank and nullity of the graph 
of the circuit. Ohtsuki, Ishizaki, and Watanabe [OhIsWa70] studied this problem and 
showed that d,,,, the maximum distance between any two spanning trees of the graph of 
the circuit is, in fact, the minimum number of variables required in the hybrid method 
of analysis. They also showed that the variables can be determined using the principal 
partition of the graph. The works by Kishi and Kajitani [KiKa69] on principal partition 
and by Ohtsuki, Ishizaki, and Watanabe [OhIsWa70] on the hybrid method of analysis 
are considered landmark results in electrical circuit theory. Swamy and Thulasiraman 
[SwTh81] give a detailed exposition of the principal partition concept and the hybrid 
and other methods of circuit analysis. 


R16: Lin [Li76] presented an algorithm for computing the principal partition of a 
graph. Bruno and Weinberg [BrWe71] extended the concept of principal partition to 
matroids. 


6.4.7 Realization of Circuit and Cutset Spaces 


In the application of graph theory to the electrical circuit synthesis problem, one 
encounters a certain matrix of integers modulo 2 and seeks to determine if this matrix 
is the cutset or the circuit matrix of an undirected graph. The complete solution to this 
problem was given by Tutte [Tu59]. Cederbaum [Ce58] and Gould [Go58] considered 
this problem before Tutte provided the solution. We now present the main result on the 
necessary and sufficient conditions for the realizability of a matrix of integers modulo 2 
as the circuit or the cutset matrix of an undirected graph. Related results leading to 
this main result are also presented. Seshu and Reed [SeRe61] discuss these results in 
considerable detail, except for proof of the sufficiency of Tutte’s realizability condition. 


DEFINITIONS 


D47: The graphs in Figure 6.4.11 are called Kuratowski graphs. 


V3 Vo Vs 


a 


V1 v4 


v. 
Vy V3 é V3 


(a) (b) 
Figure 6.4.11: The two Kuratowski graphs. 
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D48: A matrix F of the form F' = [F” | U], where U is the identity matrix, is said to 
be in normal form. 


D49: A matrix F of real integers in normal form is a regular matriz if for every linear 
combination X of the rows of M with coefficients —1, 1, and 0 we have the following: 


1. The elements of X are 1, —1, and 0, or 


2. There exists another such linear combination Y (with coefficients 1, —1, and 0) that 
has 1 and —1 for nonzero elements and these are at a (not necessarily proper) subset of 
the positions in which X has nonzero elements. 


D50: A matrix of integers mod 2 is regular if the replacement of a suitable set of 1’s 
by —1’s makes it regular. 


FACTS 


F48: For aconnected directed graph G, the fundamental cutset and fundamental circuit 
matrices with respect to a spanning tree T of G and the reduced incidence matrix A of 
G are all regular matrices. 


F49: A regular matrix in normal form is unimodular. 


F50: Given a regular matrix F of integers 1, —1, and 0 in normal form, the replacement 
of —1’s by 1’s will leave the ranks of the submatrices unaltered (where the rank of the 
derived matrix is with respect to modulo 2 arithmetic). 


F51: A matrix F of integers mod 2 is regular if and only if no normal form of F’ 
contains either the matrix No or its transpose, where 


1 11 £0 
No=|]1 1 0 1 
1011 
F52: A matrix F of integers mod 2 is realizable as the cutset matrix of an undirected 
graph if and only if it is regular and no normal form of F' contains the circuit matrix of 


either of the two Kuratowski graphs shown in Figure 6.4.11 


F53: A matrix F of integers mod 2 in normal form is realizable as the circuit matrix 
of an undirected graph if and only if it is regular and no normal form of F' contains the 
cutset matrix of either of the two Kuratowski graphs shown in Figure 6.4.11 


REMARKS 


R17: Mayeda [Ma70] gave an alternate proof of Tutte’s realizability condition, shorter 
than Tutte’s original proof, which is 27 pages long. 


R18: Early works on algorithms for constructing graphs having specified circuit or 
cutset matrices are in [Tu60, Tu64]. Bapeswara Rao [Ba70] defined the tree-path matrix 
of an undirected graph which is essentially the non-unit submatrix of the fundamental 
circuit matrix and presented an algorithm for constructing a graph with a prescribed 
tree-path matrix. This is also an algorithmic solution to the cutset and the circuit 
matrix realization problems. A detailed presentation of Bapeswara Rao’s algorithm is 
given in [SwTh81]. 
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R19: The circuit and the cutset matrix realization problems arise in the design of 
multi-port resistance networks. It was in the context of this application that Cederbaum 
[Ce58, Ced59] encountered the realization problem. Interestingly, Bapeswara Rao [Ba70] 
and Boesch and Youla [BoYo65] presented circuit-theoretic approaches to the realization 
of a matrix as the cutset or circuit matrix of a directed graph. Details of Bapeswara 
Rao’s algorithm based on this approach may also be found in [SwTh81]. 


Whitney and Kuratowski 


We believe that it is appropriate to conclude this section with a reference to two classic 
works by Whitney [Wh33] and Kuratowski [Ku30] relating to duality. While the results 
of this section bring out the duality between circuits and cutsets, Whitney introduced 
the concept of duality between graphs. His original definition was an algebraic one (see 
also [ThSw92]) relating the nullity and rank of certain corresponding subgraphs of dual 
graphs. Definition D51 is an equivalent one. 


DEFINITION 


D51: A graph G2 is a dual of a graph G, if there is a one-to-one correspondence 
between their edge-sets such that a set of edges in G2 is a circuit vector of G2 if and 
only if the corresponding set of edges in G, is a cutset vector of G1. 


FACTS 


F54: It follows from the duality between circuits and cutsets that if Gz is a dual of Gj, 
then G is a dual of Go. 


F55: [Wh33] A graph has a dual if and only if it is planar. 


F56: In another classic work, Kuratowski [Ku30] proved that a graph is planar if and 
only if it does not contain a subdivision of a Kuratowski graph. 


REMARK 


R20: See [We01] for a proof of Kuratowski’s theorem. It is quite interesting to see the 
role of the Kuratowski graphs in Tutte’s realizability conditions for the cutset and the 
circuit matrix realization problems. 


6.4.8 An Application: Cross-Layer Survivability in a 
Layered Network 


DEFINITIONS 
D52: A layered network is a pair (Gp,Gz,), where Gp = (Vp, Ep), the physical 


graph, and G, = (Vz, E_z), the logical graph, have the same vertex set. That is, 
Vp = V,. It is assumed that both Gp and Gy are two-edge connected. 
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D53: A path in Gp is called a lightpath. 


D54: A layered network (Gp,G_z) is survivable if there exists a mapping of each edge 
(i,7) € Ey into an i — 7 lightpath in Gp such that deletion of an edge in Gp does not 
cause Gy, to become disconnected. Such a mapping is called a survivable logical topology 
mapping. 


The following is a generalization of the concept of ear decomposition defined in §7.2. 


D55: A generalized partial circuit ear decomposition of a connected graph G is 
the collection of circuit ears obtained by the following algorithm: 


1. Let Go =G. 

2. Pick a circuit C of size two (called circuit ear) in Gc. 
3. Contract the edges in C, resulting in a new Go. 

4. Repeat 1 and 2 until Go is a single node. 


D56: Some of the edges in G may not be in any circuit ear in a partial circuit ear 
decomposition, and such edges are called straddling edges. 


D57: A generalized partial cutset ear decomposition of a connected graph G is 
the collection of cutset ears obtained by the following algorithm: 


. Let Gg =G. 
. Pick a cutset S of size two (called cutset ear) in Gg. 
. Delete the edges in S, resulting in a new Gg. 


rwn 


. Repeat 1 and 2 until every cutset in Gg is a single edge. 


D58: Some of the edges in G may not be in any cutset ear in a partial cutset ear 
decomposition, and such edges are called ztsolated edges. 


FACTS 


F57: [KuTh05] A layered network (Gp,G) is survivable if and only if there is a 
generalized partial circuit ear decomposition of Gz such that all the edges in a circuit 
ear can be mapped into edge disjoint lightpaths in Gp. 


F58: [ThJaXu09] Given a generalized partial cutset ear decomposition of Gz in a 
layered network, let G7, be graph obtained by contracting the isolated edges in the 
decomposition. Then, the layered network (Gp, G7.) is survivable if and only if there is 
a generalized partial cutset ear decomposition of G7 such that all the edges in a cutset 
ear can be mapped into edge disjoint lightpaths in Gp. 


REMARKS 


R21: An IP (Internet Protocol) over a WDM (Wavelength Division Multiplexed) 
network is an example of a layered network. Here Gp represents the optical network 
and the logical graph Gy represents the network at the IP layer. 


R22: The concepts of generalized partial circuit and cutset ear decompositions were 
introduced in [ThJaXu09] and studied further in [Th09], [ThJaXu10], [ThLiJaXu10], 
and [ThLiZhXul2]. 
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INTRODUCTION 


Spectral graph theory involves the investigation of the relationship of the usual 
(topological) properties of a graph with the (algebraic) spectral properties of various 
matrices associated with it. By far the most common matrix investigated has been the 
0-1 adjacency matrix. The subject had its genesis with the paper by L. Collatz and 
U. Sinogowitz [CoSi57] in 1957. Since that time the subject has steadily grown and has 
shown surprising interrelationships with other mathematical areas. 

Throughout this section, graphs are assumed to be simple. 


6.5.1 Basic Matrix Properties 


Many spectral properties of graphs follow from direct application or easy exten- 
sions of known results in matrix theory. An older but compact and useful reference is 
[MiMa64]. A more encyclopedic one is [Ga60]. 


DEFINITIONS 


D1: The adjacency matrix of a (simple) graph G is a square matrix A (or Ag) with 
rows and columns corresponding to the vertices. 


1 if uv; and v; are adjacent 
Aij= 


0 otherwise 
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D2: The characteristic polynomial of a graph is the determinant det(aI — A) of 
its adjacency matrix. 


D3: The eigenvalues of a graph are the roots to the characteristic polynomial. 


D4: The algebraic multiplicity of an eigenvalue is the number of times it occurs 
as a root of the characteristic polynomial. 


D5: The spectrum of a graph is the multiset of eigenvalues. For a graph with n 
vertices, there are n eigenvalues. 


D6: The geometric multiplicity of an eigenvalue is the dimension of the eigenspace 
{x | Av = Ax}. 


FACTS 


F1: The eigenvalues of a graph are real. This follows because the adjacency matrix is 
real and symmetric, which implies that it is Hermitian, i.e., each entry a;,; equals the 
complex conjugate of a;;. (The eigenvalues of any Hermitian matrix are real.) 


F2: The geometric and algebraic multiplicities of each eigenvalue are equal. 
F3: The eigenvalues have a corresponding set of eigenvectors that is orthonormal. 


F4: The adjacency matrix A of a graph can be diagonalized, that is, there is a square 
matrix U (of eigenvectors) such that UAU? = UAU~! is a diagonal matrix with the 
eigenvalues as diagonal entries. 


F5: The trace of the adjacency matrix of a graph, that is, the sum of the eigenvalues 
of a graph, is 0, since the adjacency matrix is 0 on the diagonal. 


F6: The spectrum of a graph is the union of the spectra of its connected compo- 
nents, since connected components of a graph are just blocks down the diagonal of the 
adjacency matrix. 


F7: If r is the largest eigenvalue of a graph, then |\| < r for any eigenvalue of 
that graph. (Since the adjacency matrix has nonnegative entries and connectivity im- 
plies irreducibility, this follows from the well-known Perron—Frobenius theorem — see 
[MiMa64].) 


F8: Ifa graph is connected, then the largest eigenvalue has multiplicity 1. It has an 
eigenvector with all entries positive. (This fact is another consequence of the Perron— 
Frobenius theorem.) Since the adjacency matrix is symmetric, being imprimitive is 
equivalent to the graph being bipartite. 


F9: Let r be the largest eigenvalue of a graph. Then a graph is bipartite if and only 
if the number —r is also an eigenvalue. 


F10: Whether or not a graph is bipartite can be determined by its spectrum. (This 
follows immediately from Fact F9.) 


F11: A graph is bipartite if and only if the spectrum is symmetric around 0, that is, 
A is an eigenvalue if and only if —A is an eigenvalue. 


Section 6.5. Spectral Graph Theory 675 


REMARK 


R1: Because of Fact F6, for most results in this chapter the graphs under consideration 
may be assumed to be connected, with no loss of generality. 


EXAMPLES 


The spectrum of a graph is given as a set of eigenvalues with the multiplicities as 
exponents (and thus, the determinant is taken as the product of the set entries). 


E1: The complete graph K,: {(n—1)1,-1" +}. 

E2: The complete bipartite graph Ky n: {./mn, min 2. —./mn'}. 

E3: The path with n vertices P,: {2cos(kr/n+1)!,k =1,...,n}. 

E4: The circuit with n vertices Cp: {2cos(2km/n)!,k =1,...,n}. 

Notice that these eigenvalues are not distinct. The eigenvalue 2 and, when n is even, 


the eigenvalue —2 are simple and all others have multiplicity 2. 


E5: A cocktail party graph CP(n) is a complete graph on 2n vertices with a 1-factor 
deleted. Spectrum: {(2n — 2)1,0",-2"~1}. 


E6: The d-dimensional hypercube Qy: {(d—2k)(*),k =0,...,d}. 


E7: A wheel W,, (with n+ 1 vertices) is the join of an n-cycle C,, and an additional 
vertex. Spectrum: {2cos(2km/n)!,k=1,....n-1}U{1+V14n}. 


E8: The platonic graphs 

tetrahedral graph Ky: {3,—1°}. 

cube graph Q3: {31, 13, -18, —31}. 

octahedral graph O3(& CP(3)): {41,0%, -27}. 
dodecahedral graph: {31, A ae 1°, 0*,—2*; =a Yi 
icosahedral graph:  {5?, VB, 15, 4/5 1 


6.5.2 Walks and the Spectrum 
Walks and the Coefficients of the Characteristic Polynomial 


DEFINITIONS 


D7: An elementary figure in a graph is a subgraph that is isomorphic to K2 or to 
a cycle graph C,. 


Ds: A basic figure is a vertex-disjoint union of elementary figures. 
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FACTS 


F12: If A is the adjacency matrix of a graph, then A*;; is the number of walks of 
length k from the vertex v; to v;. 


F13: If A is the adjacency matrix of a graph, then AP is the number of closed walks 
of length & from the vertex v; to vj. 


F14: The trace of A’ is the total number of closed walks of length k in the graph. 
F15: The trace of A? is twice the number of edges in the graph. 
F16: The trace of A® is six times the number of triangles in the graph. 


F17: [Sa64] If a,2”" + a,_12"-! +--+ + a,2 + dp is the characteristic polynomial of a 
graph, then a, = 1 and a,_; = 0. Also, —a,;,_2 is the number of edges and —a,_3 is 
twice the number of triangles in the graph. (This fact follows readily from an expansion 
of the determinant det(aI — A).) 


F18: [Sa64] If anv” + an_j2"~ 1 +--+ +12 + ao is the characteristic polynomial of 
a graph, then a, = )>(—1)°°™P) 2%), where the sum is over all basic figures with 
n —k vertices, comp(B) is the number of connected components of B, and circ(B) is 
the number of circuits of B. 


REMARK 


R2: Fact F18 is sometimes called the coefficients theorem. Its interesting history 
is given in [CvDoSa95]. It extends the idea of Fact F17 to other coefficients of the 
characteristic polynomial. The coefficient of x* will be determined by permutations 
with exactly k fixed points (vertices). For the other vertices, the permutations will have 
cycles of length 2 (corresponding to an edge in the graph) or cycles of length greater than 
two (corresponding to a circuit in the graph). Thus, in order to determine the coefficient 
of x* we need to count all of the basic figures that have n — k vertices. Furthermore, 
within each basic figure, a permutation corresponding to a circuit contributes 2 to the 
determinant (once clockwise, once counterclockwise) and an edge contributes 1 to the 
determinant. 


Walks and the Minimal Polynomial 


DEFINITIONS 


D9: The minimal polynomial of a graph G is the monic polynomial q(x) of smallest 
degree, such that q(Ag) = 0. 


D10: The eigenvalues-diameter (lower) bound for the number of eigenvaues of a 
graph G is diam(G) + 1. 
FACTS 


F19: The minimal polynomial of a graph is m(x) = [](a# — ;) where the product is 
taken over all distinct eigenvalues. 
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F20: Given two vertices vj and v; at distance t in a graph with adjacency matrix A, 
we have AF, =O<k <t, and At, £0. 


F21: Ifa graph has diameter d and has m distinct eigenvalues, then m > d+ 1. This 
substantiates the eigenvalues-diameter bound. It follows from Fact F20. 


F22: The degree of the minimal polynomial is larger than the diameter of a graph. 


F23: The complete graph is the only (connected) graph with exactly two distinct 
eigenvalues. 


F24: The complete graph K, is determined by its spectrum. (This follows from 
Fact F23, since the total number of eigenvalues — taking multiplicities into account — 
equals the number of vertices.) 


EXAMPLES 


E9: Note that in our previous examples, the graphs Ky, Km, CP(n), Pa, Cn, and 
Qn all attain the eigenvalues-diameter bound. 


E10: The wheel W,, has approximately n/2 distinct eigenvalues and diameter 2. 


RESEARCH PROBLEM 


RP1: Characterizing those graphs meeting the eigenvalues-diameter bound remains 
an open question. Of the 31 connected graphs with 5 or fewer vertices, there are 12 
that meet the bound. 


Regular Graphs 


DEFINITION 


D11: The Hoffman polynomial for a regular, connected graph of degree r is the 
polynomial 
(x — ri) 
h(a) = ——____ 
() =n] 5 
the product being taken over all distinct eigenvalues not equal to r. 
FACTS 


NOTATION: Let J denote a square matrix with every entry equal to 1. 


F25: The largest eigenvalue of a regular graph of degree r is r itself. A corresponding 
eigenvector is (1,1,...,1)7. (This follows from the Perron—Frobenius theorem.) 


F26: Any eigenvector corresponding to an eigenvalue other than r has coordinates 
that sum to 0. (This is because eigenvectors from different eigenvalues are orthogonal.) 


F27: The multiplicity of the eigenvalue r is the number of connected components. 
(Each connected component contributes 1 to the multiplicity of r.) 
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F28: A graph is regular if and only if A and J commute. 


F29: The complement of a regular graph with n vertices has an adjacency matrix equal 
to J — A—TI. Hence the eigenvalues for the complement of a regular connected graph 
are n —r—1 and —A; — 1, where ; runs over the eigenvalues of A not equal to r. 


F30: [Ho63] If a regular connected graph has adjacency matrix A and Hoffman poly- 
nomial h(x), then h(A) = J. 


6.5.3. Line Graph, Root System, Eigenvalue Bounds 


An early problem in spectral graph theory was bounding the eigenvalues of a graph 
from below. One of the basic tools for bounding eigenvalues comes from matrix theory 
and is called the interlacing theorem. Other uses for the interlacing theorem are given 
in [Ha95). 


DEFINITIONS 


D12: A principal submatriz of an n x n square matrix is obtained by deleting the 
i*” row and the i*® column for some 1 < i <n. 


D13: The line graph of a graph G, denoted L(G), has the edges of G as vertices with 
two vertices in L(G) adjacent if, as edges of G, they have an endpoint in common. 


D14: The vertex-edge incidence matrix of a graph G has rows corresponding 
to its vertices and columns corresponding to its edges. An entry is 1 if the vertex 
corresponding to the row is incident to the edge corresponding to the column, and is 0 
otherwise. It is denoted by K(G) or, simply, by K. 


D15: Given a graph G with n vertices and nonnegative integers a,,...,@,, the gen- 
eralized line graph L(G;a1,...,@,) is formed as follows: first, take disjoint copies of 
the line graph L(G) and cocktail party graphs CP(a1),...,CP(a,). In addition, if a 
vertex in L(G) corresponds to the edge joining v; to v; in G, then join it to all vertices 
in CP(a,;) and CP(a,). 


NOTATION: A(G) denotes the smallest eigenvalue of a graph G. 
NOTATION: A(G) denotes the largest eigenvalue of a graph G. 


FACTS ABOUT INTERLACING 


F31: Let A be a real symmetric matrix with eigenvalues \; > Ag > ++: > An, having 
a principal submatrix with eigenvalues 4 > 2 >--: > Un—1. Then 


Ay > fr > Ag > pea 23+ & An=1 = Un-1 = An 


F32: If H is an induced subgraph of G, if uy > po > +--+: > Mm are the eigenvalues of 
A, and if Ay > Ag > --- > Ay are the eigenvalues of G, then 


Ni > bi = Ait¢n—m fori=1,...,m 
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FACTS ABOUT THE SMALLEST EIGENVALUE X(G) 
F33: If H is an induced subgraph of G, then A(H) < A(G). 


F34: The least eigenvalue of a connected graph is always nonpositive. It equals zero if 
and only if the graph is ky. 


F35: No graph has a least eigenvalue between 0 and —1. 


F36: The only connected graphs with least eigenvalue —1 are the complete graphs 
with two or more vertices. 


F37: There are no graphs with least eigenvalue between —1 and —V2. 
F38: The graph Kj,2 is the only connected graph whose least eigenvalue equals 4/2, 


F39: There are infinitely many connected graphs with their least eigenvalues between 
—v2 and —2. (This follows from Example E3.) 


Line Graphs and Generalized Line Graphs 


FACTS ABOUT THE LINE GRAPH L(G) 
F40: If K is the vertex-edge incidence matrix of a graph G, then KK? = 21+A(L(G)). 


F41: If K is the vertex-edge incidence matrix of a graph G, then the matrix KK” is 
positive semidefinite, and hence has nonnegative eigenvalues. 


F42: [Ho75] For any graph G, A(L(G)) > —2. 


F43: [Do70] A graph G satisfies A(L(G)) > —2 if and only if G is a tree or G has 
exactly one circuit, that circuit being odd. 


F44: There are infinitely many graphs G with —2 < \(G) < —v2. 


FACTS ABOUT THE GENERALIZED LINE GRAPH X(L(G;a1,...,@n) 


The results for generalized line graphs are similar to those for line graphs. Form the 
matrix K’ by appending columns and rows to the vertex-edge incidence matrix. For 
each a;, append a; pairs of new columns, each of which has two nonzero entries. For 
each pair there is a 1 in the row corresponding to v; and a new row is added with one 
1 and one —1 in the new columns. All other entries are 0. Use K’K’" as before. 


F45: [CDS81] For any graph G with n vertices \(L(G;a1,...,@n) > —2. 


F46: [CDS81] A graph G satisfies the lower bound A(L(G3a1,...,an) > —2 if and only 
if G = L(T;1,0,...,0) where T is a tree or G = L(H) where H is a tree and H has 
exactly one cycle, that cycle being odd. 


F47: [Ho75] If G is a regular connected graph of degree r with n vertices and eigenvalues 
r=, > Aq >A3 >--+ > An then the eigenvalues of L(G) are A; +r — 2,7 =1,...,n 
plus —2 of (additional) multiplicity n(r — 2)/2. 
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EXAMPLES 


E11: The line graph of a complete graph is called a triangular graph. From the 
spectrum of K,, (Example E1) we see that the spectrum of the triangular graph L(K,,) 
is (2n— 4) ,n— 47-1, 27-3) /21, 


E12: The line graph of a regular complete bipartite graph is called a lattice graph. 
From the spectrum of K,,,, (Example E2), it follows that the spectrum of the lattice 


graph L(Ky,,) is {2n — 21,n —22"-2,-20-))}, 


Root Systems 


The converse of Fact F46 is almost true. The exact statement involves the root systems 
used in the classification of semisimple Lie algebras (found in [Ca89], for example). Root 
systems are sets of vectors used to form the columns of the matrices K’ as above. 


FACTS ABOUT ROOT SYSTEMS 

The possible real root systems are denoted A,, Dn, Eg, E7, and Eg. These root systems 
have n(n + 1)/2, n(n — 1), 36, 63, and 120 vectors, respectively. They also satisfy 
An—-1 © Dy, and Eg C Ev C Eg. One of the most beautiful results in spectral graph 
theory relates root systems with eigenvalues of graphs. 


F48: [GCSS76] \(H) > —2 if and only if there is a matrix K’ whose columns are taken 
from the root systems A, D,, Eg, E7, or Eg such that A= 274+ K'K'. 


F49: [GCSS76] H = L(G) where G is bipartite if and only if K’ can be formed from 
vectors in the root system Ap. 


F50: [GCSS76] H = L(G;a,,...,a,) if and only if K’ can be formed from vectors in 
the root system D,,. 


F51: [GCSS76] With only a finite number of exceptions, A(H) > —2 implies that H is 
a generalized line graph. 


The exceptional graphs from the last fact are those constructed from (the finite number 
of) vectors in Fg, E7, and Eg by using them as columns of a matrix K’ as described in 
Fact F48. Using a variety of techniques (both computer-assisted and otherwise) much 
is known about these exceptional graphs. 


F52: If \(H) > —2, then either H can be formed by vectors in the root system D,, or 
HT is one of 20 graphs with six vertices, 110 graphs with seven vertices, or 443 graphs 
with eight vertices. 


A type of characterization of graphs with \(G) > —1— V2 by a different generalization 
of line graphs has been given in [WoNu95]. 


EXAMPLE 


E13: Let {e,,...,e,} be the canonical basis for R”. Consider the following set of 
vectors: {e; — ej41,2 = 1,...,n — 1} U {e, + €2,en_1 + en}. In fact, these vectors are 
in the root system D,. Use these vectors as columns for K’ and define a graph, also 
denoted by D,,, from the matrix equation A = 2J + KK'T. The graph then appears as 
in Figure 6.5.1. 
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Figure 6.5.1: The graph D,. 


Eigenvalue Bounds 


FACTS ABOUT THE LARGEST EIGENVALUE A(G‘) 
Many of the above ideas can also be used to obtain upper bounds on the eigenvalues of 
graphs. 


F53: If H is an induced subgraph of G, then A(H) < A(G). If H is a proper subgraph 
of G, then A(H) < A(G). 


F54: If 7 is the average degree and rmax is the maximum degree of a graph G, then 
F< A(G) < rmax. Equality is attained if and only if the graph is regular. 


F55: [CvDoGu82] If A(G) < 2, then G is P,, T(1,1,n), T(1,2,4), T(1,2,3), or 
T(1,2,2). The graph T(i, j,k) is formed by taking three paths uo,..., ui, Vo,---,U,; 
and wo,.-., Wx and identifying the vertices uo, vg, and wo. It clearly has one vertex of 
degree three, three vertices of degree one, and all other vertices of degree two. 


Figure 6.5.2: The graph T(i, j,k). 
F56: [CvDoGu82] If A(G) = 2 then G is Cy, Kia, T(2,2,2), T(3,3,1), T(5, 2,1), 
or Dn. 
Similar results are available for graphs with A(G) < (2+ V5)!/? (see [CvDoGu82)). 


Bounds on the second largest eigenvalue of a graph, A2(G), have also been of considerable 
interest. 


DEFINITIONS 


D16: A Ramanujan graph is regular of degree k& for which any eigenvalue \ 4 +k 
satisfies |A| < 2k — 1. In particular \_ < 2Vk — 1. 


D17: The spectal gap of a graph is A; — Az. 


D18: If the vertices of the graph G are partitioned into {V,W}, then the boundary 
of V, denoted 0(V), is the number of edges with one end point in V and one in W. 
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D19: A graph is an c-expander if, for every partition of the vertices {V,W} with 
|V| < |W], it follows that 0(V) > e|V|. In other words, the average outdegree from V 
to W is bounded from below by e. 


FACTS 


F57: [LuPhSa88] Families of Ramanujan graphs with fixed degree & and a large number 
of vertices are difficult to construct. One such family uses the Caley graphs of groups 
derived from PSL2(q). 


F58: [LuPhSa8gg], [Fr91] Consider graphs G;,, with n vertices and regular of degree 
k. Then liminfp.o.0 Grn > 2Vk—1, and for given €, the probability that Gyn < 
2/k—1+€ goes to lasn—- ow. 


F59: [Mu03] Ramanujan graphs make good expanders. 
F60: [A186] Good expanders have a large spectral gap. 
F61: [Sm70] A2(G) < 0 if and only if G is a complete multipartite graph. 


F62: [Li94] A2(G) is dense in the interval ((2+ V5)!/?,00) but nowhere dense in 
(—00, V2 — 1). 


Graphs with A9(G) < /2—1 and \2(G) < (V5 —1)/2 are described in [CvSi95], [Li94], 
and [Si95]. 


FURTHER READING 


A complete discussion of the constructions of Ramanujan graphs using PSL2(q) can be 
found in [DaSaVa03]. 


An excellent survey of the relationships between Ramanujan graphs and analogues of 
the Riemann zeta function is given in [Mu03]. 


A complete reference of the spectral properties of line graphs and their generalizations 
is [CvRoSi04]. 

A more general discussion on different bounds on the eigenvalues of graphs can be found 
in [PeRa0l]. 


6.5.4 Distance-Regular Graphs 


Eigenvalues are crucial for understanding the properties of distance-regular graphs. 
DEFINITIONS 
D20: A graph of diameter d is distance-regular with parameters 
{pk | 0<3,9,h < ah 


if for each triple (i, 7,&) and for any pair of vertices such that the distance between them 
is k, the number of vertices at distance i from the first and distance 7 from the second 
is pe j;- Each of these numbers pt ; 1s independent of the particular choice of vertices. 
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D21: In the Hamming graph H(d,n), the vertices are all d-tuples (71,...,vq) with 
1 <a; <n. Two vertices are joined if, as d-tuples, they agree in all but one coordinate. 
The distance between two vertices is then the number of coordinates in which, as d- 
tuples, they differ. 


D22: In the Johnson graph J(d,n), the vertices are the @) subsets of {1,...,n}. 
Two vertices are joined if, as subsets, their intersection has cardinality d—1. The 
distance bewteen two vertices is k if, as subsets, they have an intersection of cardinality 
d—k. 


EXAMPLES 


E14: H(d,2) is isomorphic to the hypercube graph Qg. It is distance regular with 
diameter d. 


E15: J(2,n) is isomorphic to the triangular graph L(K,,). It is distance regular with 
diameter 2. 


Distance-Regular Graphs and the Hoffman Polynomial 


A distance-regular graph is regular (from P) 1) and connected, and so it has a Hoffman 
polynomial. 


DEFINITIONS 


D23: The I**-order adjacency matriz is defined with Ag = I as the identity matrix, 
A, = Aas the usual adjacency matrix, and A; as the matrix with 1 in the (i, 7) position 
if the corresponding vertices are at distance | from each other and 0 otherwise. 


D24: The I'*-order parameter matriz is the matrix P, with the distance-regularity 
parameter p’,,, in the (i, 7) entry. 
FACTS ABOUT THE MATRIX A, 


F63: For a distance-regular graph with diameter d, we have A;A; = pee pi Ar. 
Also, ee Ax = J, the all-one matrix. 


F64: The algebra generated by {Ao,..., Aa} is of dimension d+ 1 (since matrices A; 
and A; commute and the A; are linearly independent). All the matrices A* are in this 
algebra for k = 0,1,...,d. 


F65: The number of distinct eigenvalues of the adjacency matrix A of a distance-regular 
graph of diameter d is d+ 1. 


F66: The Hoffman polynomial of any distance-regular graph of diameter d is a poly- 
nomial of degree d. 


F67: Any distance-regular graph meets the eigenvalues-diameter bound. 
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FACTS ABOUT THE PARAMETERS Pe 


F68: P,P; = es Pi Pe, and so the commutative algebra generated by {Po,..., Pa} 
is isomorphic to the one generated by {Ao,..., Aa}. 


F69: The minimal polynomial for A; and P, is the same, and so A; and P, have the 
same distinct eigenvalues. 


F70: The eigenvalues of P; are simple. That is, they occur with multiplicity one. 


F71: The parameters of a distance-regular graph determine the spectrum. 


Strongly Regular Graphs 


DEFINITION 


D25: <A strongly regular graph is a distance-regular graph of diameter 2. The 
parameters are (n,r,, 4) where n is the number of vertices, r is the degree, A = Pin 
and w= Pi: To avoid trivialities, K, and its complement are not strongly regular. 


EXAMPLES 
E16: Triangular graphs: L(/,,) has parameters (n(n — 1)/2,2n — 4,n — 2,4). 
E17: Lattice graphs: L(K,,,) has parameters (n?,2n — 2,n — 2,2). 


E18: Paley graphs: P(p") has as vertices the elements of the finite field GF(p”) with 
two vertices adjacent if, as field elements, their difference is a quadratic residue (for 
this relation to be symmetric p” must be 1 mod 4). The Paley graph has parameters 


(p”, (p” — 1)/2, (p” — 5)/4, (p™ — 1)/4). 


FACTS 

The parameters of a strongly regular graph are not independent. Pick a vertex and 
count the number of paths of length two starting at that vertex and ending at a different 
one. There are n — r — 1 vertices at distance two from our given vertex, and each one 
contributes jz such paths. Also, for each of the r vertices adjacent to the given vertex, 
there are r — 1 — X choices for a second edge to get a desired path. 


F72: For a strongly regular graph with parameters (n,r,A, 4), we have u(n—1r —1) = 
r(r—1—A). 

F73: Since there are only three types of entries in A, A”, and J, for a strongly regular 
graph (corresponding to equal, adjacent, and nonadjacent vertices) it’s easy to recognize 


the Hoffman polynomial and hence the eigenvalues for a strongly regular graph. In 
particular, for a strongly regular graph with parameters (n,7r, A, {1), we have 


A? +(u-AA+(u—r)I = pw. 


F74: The eigenvalues of a strongly regular graph with parameters (n,r, A, js) are r and 
the two roots of the polynomial x? + (uw —A)a+(u—7r). 


F75: A regular connected graph is strongly regular if and only if it has three distinct 
eigenvalues. 


It is also easy to compute the multiplicities of the eigenvalues, since r is a simple 
eigenvalue, the sum of the multiplicities is n, and the trace of A is 0. 
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F76: For a strongly regular graph with parameters (n,7r,.,), the eigenvalues are 
A= 1, Ao = (X= p)/2+ AV? and Az = (A — p)/2— A’/?, where A = po? — 2ur44+d? — 
4p — 4X. The respective multiplicities are 1, m2, and m3 where mz + m3 = n— 1 and 
MzrA2 +mM3A3 = —7r. Since Ay # A3, the solution for mz and m3 is unique. 


F77: If A in Fact F76 is not a square, then m2 = m3. Such a graph is called a 
conference graph. 


F78: One test to see if a potential set of parameters (n,r, A, 4) is actually attained by 
a graph is to see if the multiplicities mz and m3 are integers. 


EXAMPLE 


E19: For the Paley graph, 2? + (u—A)a+(u—r) =2? +a —(p" +.1)/4, {r2, A3} = 
{$(-1 + p”/?)\1, and mg = m3 = (p” — 1)/4. 


FURTHER READING 

Further details and results concerning strongly regular graphs can be found in the 
excellent reference [GoRo01]. The encyclopedic reference for distance-regular graphs is 
[BrCoNe89]. More recent results can be found in [BrHa12]. 


6.5.5 Spectral Characterization 


One of the earliest and continuing questions in spectral graph theory asks the fol- 
lowing: when is a graph characterized by its spectrum? Finding pairs of nonisomorphic 
graphs with the same spectrum can pinpoint properties of a graph that cannot be de- 
termined spectrally. 


EXAMPLES 


E20: Figure 6.5.3 shows the two smallest graphs with the same spectrum, which 
is {2',0,-2'}. This example implies that the number of quadrilaterals (unlike the 
number of triangles) cannot be determined from the spectrum. Similarly, neither the 
degree sequence nor the connectivity can be determined by the spectrum. 


AN L] 


Figure 6.5.3: The smallest pair of cospectral graphs. 


E21: Figure 6.5.4 shows the two smallest aes iia with the same spectrum. 
The characteristic polynomial of both graphs is ( x? — a? — 54 +1)(x+4+1)?. 


XN 


Figure 6.5.4: The smallest pair of connected cospectral graphs. 
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E22: Figure 6.5.5 shows two cospectral trees with the smallest possible number of 


vertices. The spectrum is +3 val (all four simple) and 0 with multiplicity 4. The 


. 4(x? + 2 —3)(x? —a@ — 3). 


characteristic polynomial in this case is & 


Figure 6.5.5: Cospectral trees with the minimum number of vertices. 


E23: Figure 6.5.6 shows a pair of strongly regular cospectral graphs with 16 vertices 
and spectrum {61, 2°,—2°}. Some interpretation is necessary. The graph on the left is 
actually drawn on the torus, that is, the vertices on the outside edges in the same row 
or column are identified. In the graph on the right, any pair of vertices in the same row 
or column are joined. The graph on the right is actually L(/¢44) and hence is strongly 
regular with parameters (16,6, 2,2). Being cospectral, the one on the left (called the 
Shrikhande graph) must be strongly regular with the same parameters. 


a bedia 


e e 
a if 
g g 
ab edo 


Figure 6.5.6: A pair of strongly regular cospectral graphs. 


Eigenvalues and Graph Operations 


One method for constructing cospectral graphs is by using various graph operations. 


DEFINITIONS 


D26: The cartesian product of two graphs G and G2, denoted G; ® G2, has as 
vertex set all possible pairs (v1, v2) where v1 is a vertex of Gi and v2 is a vertex of Go. 
Two vertices are joined if, as ordered pairs, they are identical in one coordinate and 
adjacent in the other. 


D27: The coalescence of two (disjoint) graphs G; and G2 with distinguished vertices 
vy and v2, denoted G, - Go, is formed by identifying v; and v2, that is, the vertices vj; 
and v2 are replaced by a single vertex v adjacent to the same vertices in G, as v; and 
the same vertices in G2 as vo. 


FACTS 


F79: The straightforward extension of the cartesian product to an iterated product of 
more than two graphs is associative. 


F80: The eigenvalues of the product graph G, ® G2 are precisely all possible sums 
Ai + Ag, where A; is an eigenvalue of G and 2 is an eigenvalue of G2. 
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F81: Let G, and G2 be a pair of cospectral nonisomorphic graphs. For t = 0,1,...,m, 
we define H, by taking the cartesian product G; ®--- ® G, ® Gp ®--: ® Gg using ¢ 
copies of G; and m —t copies of Gj. Then the graphs H; are pairwise cospectral and 
nonisomorphic. 


F82: [Sc73] Let P(x) be the characteristic polynomial of the graph H. Then 
Pey-Gs (x) = Pe, (2) Pe, —ve (x) + Pey-v (x) Pa, (x) — £PG,-v, Pez-v2 


F83: [Sc73] As the number of vertices gets large, the probability that a tree has a 
cospectral mate goes to 1. 


EXAMPLES 


The first example of this type of construction was given by A. J. Hoffman. 


£24: (A. J. Hoffman in [Mo72]) Let G; and G2 be a pair of regular cospectral noniso- 
morphic graphs. Define H; by taking t copies of G and m—t copies of Gp and taking 
the complement, for t = 0,1,...,m. Then the H; are all cospectral, connected, regular, 
and nonisomorphic. 


E25: The Hamming graph H(d,n) is simply K,, ®--- @ Ky, where the number of 
factors is d. 


REMARK 


R3: The cartesian product has been generalized to the NEPS graph by D. Cvetkovié 
[CvDoSa95]. 


6.5.6 The Laplacian 


The Laplacian is an alternative to the adjacency matrix for describing the adjacent 
vertices of a graph. It has many interesting properties, and, although not as much 
information is known about the spectral properties of the Laplacian of a graph, more 
recent work has indicated that there is much more to be found. 


DEFINITIONS 


D28: The Laplacian of a graph is a square matrix whose rows and columns correspond 
to the vertices of a graph. A diagonal entry is the degree of the corresponding vertex; 
an off-diagonal entry is —1 if the corresponding vertices are adjacent and 0 otherwise. 
In other words, L = D — A, where D is the diagonal matrix of degrees of the vertices 
and A is the usual adjacency matrix. 


If the graph is regular, then D = rJ, and the eigenvalues of A and L are obtainable 
from each other. Thus for regular graphs the study of the adjacency matrix and the 
Laplacian are identical. 


D29: [Fi73] The algebraic connectivity of a connected graph whose Laplacian L 
has eigenvalues A, < Ag <--- < Ay, is defined to be Xo. 
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FACTS 


The oldest result about the Laplacian concerns the number of spanning trees of a graph. 
Let + = 7(G) be the number of spanning trees of a graph. Let L;,; be the matrix 
obtained by deleting the i-th row and j-th column from L. Also, let adj L be the 
adjoint of L. 


F84: 7(G) = (—1)'* det(L;,;). 
F85: adj(L) =7J. 


F86: The multiplicity of 0 as an eigenvalue of L is the number of connected components 
in the graph. 
F87: [];_.A; = n7(G). 


1=2 


F88: Since \2 = A3 =--:=An =n for Ky, T(Kn) = 1r"?. 


F89: The algebraic connectivity is positive if and only if the graph is connected. 
F90: A2(Gi & G2) = min{A2(G1), A2(G2)}. 


F91: [Ne00] If a graph G has diameter d, then A2(G) > 4. 


EXAMPLES 
E26: A2(P,) = 2(1 — cos(7/n)). 

E27: A2(Cp,) = 2(1 — cos(27/n)). 
E28: A2(Qn 


E29: do(K,) =n. 


E30: A2(Km,n) = min{m,n}. 


FURTHER READING 


A good introduction to further properties of the Laplacian is given by B. Mohar [Mo92]. 
Another excellent synopsis is by M. Newman [Ne00]. 
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INTRODUCTION 


Every graph gives rise to a matroid so every theorem for matroids has an immediate 
consequence for graphs, although many of these are easy to derive directly. On the other 
hand, numerous results for graphs have analogs or generalizations to matroids. This 
link between graph theory and matroid theory is so close that the famous graph theorist 
W. T. Tutte (1917-2002) wrote [Tu79]: “If a theorem about graphs can be expressed in 
terms of edges and circuits only it probably exemplifies a more general theorem about 
matroids.” This section provides an overview of the rich interaction between graph 
theory and matroid theory. 


6.6.1 Matroids: Basic Definitions and Examples 


The edge-sets of cycles in a graph and the minimal linearly dependent sets of columns 
in a matrix share many similar properties. Hassler Whitney (1907-1989) aimed to 
capture these similarities when he defined matroids in 1935 [Wh35]. 
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DEFINITIONS 
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D1: A matroid M isa pair consisting of finite set E(M/) (the ground set of M) anda 
collection C(M) of nonempty incomparable subsets of E(M), called circuits, such that 
if C, and C3 are distinct members of C(M) and e € Cy M Co, then there is a member 
C3 of C(M) such that C3 ie (Ci U C2) a {e}. 


NOTATION: Frequently, E(M/) and C(M) are abbreviated to E and C. 


D2: A subset of E is dependent if it contains a member of C and is independent 


otherwise. 


D3: A basis (or base) is a maximal independent set. 


D4: The matroid M, is isomorphic to the matroid M2, written M, = Mo, if there is 
a 1-1 function ¢ from E(M,) onto E(M2) such that C is a circuit of M, if and only if 
@(C) is a circuit of Mo. 


NOTATION: The collections of independent sets and bases of M are denoted by Z(M) 
and B(M), respectively. 


REMARK 


R1: It follows easily from the definition of a matroid that all bases are of the same 


cardinality. 


EXAMPLES 


E1: Three different classes of examples of matroids are given in Table 6.6.1. 


Table 6.6.1: Examples of matroids 


(0<m<n) 


MATROID GROUND SET CIRCUITS INDEPENDENT BASES 
M E(M) C(M) SETS, Z(M) B(M) 
M(G), cycle E(G), edge- edge-sets {IC E(G): For connected 
matroid of set of G of cycles I contains G: edge-sets 
graph G no cycle} of spanning 
trees 
M|A], vector column minimal {ICE:I maximal 
matroid of labels linearly labels a linearly linearly 
matrix A of A dependent independent independent 
over field F multisets multiset of sets of 
of columns columns} columns 
Uniform {1,2,...,n} {CCE: {I CE: {BCE: 
matroid, Umn |C| =m-+1} |I| < m} |B| = m} 
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E2: Let M be the matroid with E(M) = {1,2,...,6} and C(M) = {{1}, {5,6}, 
{3, 4, 5}, {3,4,6}}. Then M = M(G,) = M(G2) where G, and G2 are the graphs 
shown in Figure 6.6.1. Also B(M) = {{2,3, 4}, {2, 3, 5}, {2, 3, 6}, {2, 4,5}, {2,4, 6}}, and 
M = M{A] where A is the following matrix over R. 


12 3 4 5 
0 
1 
1 


oC oO 
ePreoo 


1 0 
0 1 
0 0 


Figure 6.6.1: Graphs G', and G2 yield the same matroid M. 


6 


E3: In Table 6.6.2, several classes of matroids are defined. A matroid M is in the 
specified class if it satisfies the indicated condition. 


Table 6.6.2: Some classes of matroids 


Class Condition 
graphic M = M(G) for some graph G 
representable over F | M = MA] for some matrix A over the field F 
binary representable over GF'(2), the 2-element field 
ternary representable over GF'(3) 
regular representable over all fields 


FACTS 
In each of the following, M is a matroid. 


F1: The unique smallest non-graphic matroid is U2,4. 
F2: If M is graphic, then M ~ M(G) for some connected graph G. 


F3: (Whitney’s 2-Isomorphism Theorem [Wh33]) Two graphs have isomorphic cycle 
matroids if and only if one can be obtained from the other by a sequence of the following 
operations: (i) choose one vertex from each of two components and identify the chosen 
vertices; (ii) the reverse of (i); (iii) in a graph that can be obtained from the disjoint 
union of two graphs G'; and G2 by identifying vertices u; and v; of G; with vertices u2 
and v2 of G2, twist the graph by identifying, instead, u; with v2 and ug with v1. 
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The following is an immediate consequence of the last fact. 
F4: A 3-connected loopless graph is uniquely determined by its cycle matroid. 
F5: If M is graphic, then M is regular. 


F6: JM is regular if and only if WM can be represented over the real numbers by a totally 
unimodular matrix, a matrix for which all subdeterminants are in {0,1,—1}. 


6.6.2 Alternative Axiom Systems 


Matroids can be characterized by numerous different axiom systems. Two examples 
of these systems follow. Others may be found, for example, in [Ox11]. Throughout, EF 
is a finite set and 2” is its set of subsets. 


Independent set axioms. A subset Z of 2” is the set of independent sets of a matroid 
on F if and only if 


(11) the empty set is in Z; 
(12) every subset of a member of TZ is in Z (Z is hereditary); and 


(13) if X and Y are in Z and Y has more elements than X, then there is an element e 
of Y — X such that X U {e} is in TZ. 


Basis axioms. A subset B of 2” is the set of bases of a matroid on E if and only if 
(B1) 6 is nonempty; and 


(B2) if By and Bz are in B and x € B, — Bo, then there is an element y of By — By 
such that (B, — {a}) U {y} € B. 


DEFINITIONS 
In all of the following, M is a matroid with ground set E. 


D5: If AC E, all maximal independent subsets of A have the same cardinality, the 
rank, r(A) (or ry(A)), of A. 


D6: The rank r(M) of the matroid M is the rank r(£) of its ground set. 
D7: A spanning set of M is a subset of F of rank r(M). 

D8: A hyperplane of M is a maximal nonspanning set. 

D9: The closure cl(X) of X is {x € E: r(X U {x}) = r(X)}. 

D10: A set Y is a flat (or closed set) if cl(Y) =Y. 

D11: A loop of M is an element e such that {e} is a circuit. 

D12: If {f,g} is a circuit, then f and g are parallel elements. 


D13: A simple matroid (or combinatorial geometry) is a matroid that has no 
loops and no parallel elements. 
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FACTS 


F7: If X is a set of edges of a graph G and G[X] is the subgraph of G induced by X, 
then ry(q)(X) = |V(G[X])| — k(GLX]) where k(G[X]) is the number of components of 
G|X]. 


F8: If X is a set of elements in a matroid M, then cl(X) = X U{e: M has a circuit C 
such thate € CC XU {e}}. 


F9: For a matroid M, let B*(M) = {E(M)-B: Be B(M)}. Then B*(M) is the set 
of bases of a matroid on E(M). 


EXAMPLE 


E4: For the graphs G; and G2 shown in Figure 6.6.1, r(M(G1)) = |V(Gi)| —1 = 3 
and r(M(G2)) = |V(G1)| — 3 = 3. In each matroid, cl({2,5}) = {1, 2, 5,6} and the last 
set is a flat of rank 2. 


6.6.3 The Greedy Algorithm 


Matroids have an important relationship to the greedy algorithm that makes them 
important in optimization problems. Kruskal’s algorithm for finding a minimum-cost 
spanning tree in a connected graph G is one of the best-known efficient algorithms in 
graph theory. This algorithm works precisely because the spanning trees of G form the 
bases of a matroid. 


Algorithm 6.6.1: The Greedy Algorithm for (Z,w) 


Let E be a finite set and Z be a nonempty hereditary subset of 2”. Let w 
be a real-valued function on EF. For X C F, let w(X), the weight of X, be 
dive x w(x), and let w(0) = 0. 


(i) Set Xo =@ and j = 0. 


(ii) If H — X; contains an element e such that X, U {e} © Z, choose such 
an element e;+1 of maximum weight, let X41 = X,; U{e;41}, and go to 
(iii); otherwise let X; = Bg and go to (iv). 


(iii) Add 1 to 7 and go to (ii). 


(iv) Stop. 


EXAMPLE 


E5: Let G be a connected graph with each edge e having a cost c(e). Define w(e) = 
—c(e). Then the greedy algorithm is just Kruskal’s algorithm and the result, Ba, is the 
edge-set of a spanning tree of minimum cost. 
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FACT 


F10: A nonempty hereditary set Z of subsets of a finite set F is the set of independent 
sets of a matroid on F if and only if, for all real-valued weight functions w on E, the set 
Be produced by the greedy algorithm is a maximal member of Z of maximum weight. 


6.6.4 Duality 


Matroid theory has an attractive theory of duality that extends both the concept of 
a planar dual of a plane graph and the notion of orthogonality in vector spaces. This 
duality means that every graph gives rise to another matroid in addition to its cycle 
matroid. 


DEFINITIONS 


D14: For a matroid M, the dual M* of M is the matroid on E(M) having 6*(M) as 
its set of bases (see Fact F9). 


D15: Circuits, bases, loops, and independent sets of M* are called cocircuits, cobases, 
coloops, and coindependent sets of M. 


D16: For a graph G, the matroid (M(G))* is the called the bond matroid of G and 
is denoted by M*(G). 


D17: A matroid M is cographic if M ~ M*(G) for some graph G. 


D18: A class of matroids is closed under duality if the dual of every member of the 
class is also in the class. 


EXAMPLES 


E6: ‘Table 6.6.3 specifies the duals of certain types of matroids. 


Table 6.6.3: Duals of some basic examples 


Matroid | M(G) for G plane Umn M{I,|D] for 
r xX n matrix [J,-|D] 


Dual M(G*) where G* Op nin M[-D? |In_+], 
is the dual of G* same order of column 
labels as [I,|D] 


E7: The graph G? in Figure 6.6.2 is the planar dual of the graph G in Figure 6.6.1. 
Observe that M(G7) is isomorphic to M(G,) under the permutation of E(G,) that 
interchanges 1, 3, and 4 with 2, 5, and 6, respectively. 
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Figure 6.6.2: Graph Gj is the planar dual of G1. 


FACTS 
For all matroids /: 


Fll: (M*)*=M. 
F12: The rank function of M* is r*(X) = |X|—r(M)+r(£- X). 


F13: The cocircuits of M are the minimal sets having nonempty intersection with 
every basis of M. 


F14: The cocircuits of M are the minimal nonempty sets C* such that |C*NC| #1 
for every circuit C of M. 


F15: For a graph G, the circuits of M*(G) are the bonds or minimal edge-cuts of G. 
In particular, the loops of M/*(G) are the isthmuses of G. 


F16: A graphic matroid is cographic if and only if it is planar (isomorphic to the 
cycle matroid of a planar graph). 


F17: Every row of the matrix [I,|D] is orthogonal to every row of [—D?|I,,_,]. 


F18: The following classes of matroids are closed under duality: uniform matroids, 
matroids representable over a fixed field F, planar matroids, and regular matroids. The 
classes of graphic and cographic matroids are not closed under duality. 


Table 6.6.4: Special sets and their complements in M and M* 


xX basis of M independent set of circuit of M 
E-—X | basis of M* spanning set of M* hyperplane of M* 


REMARKS 


R2: The fact that both the cycles and bonds of a graph are the circuits of a matroid 
means that cycles and bonds share many common properties. 


R3: Matroids in general do not have vertices. In a 2-connected loopless graph G, the 
set of edges meeting a vertex is a bond of G and hence is a cocircuit of M(G). Although 
M(G) will usually have many cocircuits that do not arise in this way, in many contexts, 
an appropriate matroid analog of a vertex is a cocircuit. 
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6.6.5 Matroid Union and Its Consequences 


The operation of matroid union, which was introduced by Nash-Williams [Na66], 
led to very straightforward proofs of two graph results whose original proofs were quite 
intricate. 


FACTS 


F19: Let My, M2,...,M, be matroids on a common ground set &. Then there is a 
matroid M, V Mj2V...V M, (the union of M1, Mz,...,M,) on E whose independent 
sets are all subsets of E of the form 1, UIgU...U TJ, such that I; € Z(M;) for all 7. 


F20: If M; has rank r;, then the rank of X in M, V M2 V...V My is 
min{)> ri(Y)+|xX -—Y|:Y CX}. 
i=1 


The following covering and packing results for matroids are easily proved by taking the 
union of a matroid with itself multiple times, although the original proofs preceded the 
introduction of the operation of matroid union. The second result is the dual of the 
first. 


F21: [Ed65] A matroid M has & disjoint bases if and only if, for every subset X of 
E(M), 
kr(X) + |E(M) — X| > kr(M). 


F22: [Ed65] A matroid M has k independent sets whose union is E(M) if and only if, 
for every subset X of E(M), 
kr(X) > |X|. 


The last two results have the following immediate consequences for graphs. 


F23: [Tu61, Na61] A connected graph G has k edge-disjoint spanning trees if and only 
if, for every partition 7 of V(G), the number of edges joining vertices in different classes 
of the partition is at least k(|z| — 1) where || is the number of classes in 7. 


F24: [Tu61] The edge-set of a graph G can be partitioned into k disjoint forests if and 
only if, for all subsets X of V(G), 


E(G[X])| 2 k(|X| — 1). 


F25: [Ed65] Let G be a connected graph. Players B and C alternately tag edges of 
G where an edge is destroyed if it is tagged by C' and made invulnerable to destruction 
if it is tagged by B. The goal for B is to tag all the edges of some spanning tree of G 
and the goal for C' is to prevent B from achieving this goal (by tagging all the edges of 
some bond). The following are equivalent: 


(i) Player C plays first and B can win against all possible strategies of C. 
(ii) G has two edge-disjoint spanning trees. 


(iii) For all partitions 7 of V(G), the number of edges joining vertices in different 
classes of the partition is at least 2(|| — 1). 
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6.6.6 Fundamental Operations 


Duality is one of the three basic operations for matroids. Two other basic operations, 
deletion and contraction, are defined in Table 6.6.5 below along with the operation of 
direct sum, a special case of matroid union which generalizes the operation of direct 
sum of vector spaces. Each of these operations generalizes an operation for graphs. 


Table 6.6.5: Three basic matroid constructions 


Matroid Ground Set c A. 
M\T, E(M)—T {CCE(M)—-T: | (IC E(M)-T: 
the deletion Cec(M)} Ie TZ(M)} 
of T from M 
M/T, E(M)-—T minimal non- {IC E(M)-T: 
the contraction empty members IUBreT(M)} 
of T from M of {C-T: for some Br 
CeC(M)} in B(M|T)} 
M, 6 Mo, E(M,)U E(M2) C(M,) UC(M2) {I, Ul, : 
direct sum I, € I(M;)} 
of M, and Mo, 
E(M,) N E(M2) = 0 


DEFINITIONS 


D19: The matroids M\T and M/T are also written as M|(E—T) and M.(£—T) and 
are called the restriction and contraction of M to E-T. 


D20: A matroid N is a minor of a matroid M if N can be obtained from M by a 
sequence of deletions and contractions. The minor N is proper if N 4 M. 


D21: A graph H is a minor of a graph G if H can be obtained from G by a sequence 
of edge deletions, edge contractions, and deletions of isolated vertices. 


NOTATION: For an element e of a matroid M, the matroids M\{e} and M/{e} are 
frequently written as M\e and M/e. 


EXAMPLES 


E8: M(G)\e = M(G\e), where G\e is the graph that is obtained from the graph G 
by deleting the edge e. 


E9: M(G)/e = M(G/e) where G/e is the graph that is obtained from the graph G by 
contracting the edge e, that is, by identifying the ends of e and then removing e. 


E10: Ifa graph H is a minor of a graph G, then the cycle matroid M(#) is a minor of 
the cycle matroid M(G). To see that the converse of this fails, note that, for the graphs 
G, and G2 in Figure 6.6.1, M(G) is a minor of M(G2) as the two matroids are equal. 
But G; is clearly not a minor of G9. 


E11: Unn\e = Umn-1 form #n, and Un n\e = Un-1.n-1- 
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E12: Unn/e = Um-in-1 for m #0, and Up n/e = Uon-1- 


E13: M|A)\e is the vector matroid of the matrix that is obtained by deleting column 
e from the matrix A. 


E14: Ife corresponds to a unit vector in A, then M[A]/e is the vector matroid of the 
matrix obtained by deleting both the column e and the row containing the one of e. 


E15: IfG and G2 are vertex-disjoint graphs, then 1/(G)®M(Gz2) is the cycle matroid 
of the graph that is obtained by taking the disjoint union of G; and Gg. Moreover, if v1 
is a vertex of G; and vy is a vertex of G2, then M(G,) @ M(G2) is also the cycle matroid 
of the graph that is obtained by identifying v; and v2, this graph being a l-sum of G1 
and Go. 


FACTS 
In the following, M, M,, and M2 are matroids and E(M,) 9 E(M2) = 0. 


F26: (M/T)* = M\T, and (M\T)* = M/T. (Deletion and contraction are dual 
operations.) 


F27: If X and ¥ are disjoint subsets of E(M), then M\X\Y = M\(XUY) = M\Y\X, 
M/X/Y = M/(X UY) =M/Y/X, and M\X/Y = M/Y\X. 


F28: If X C E(M) —T, then ryyr(X) = ru(X UT) — ru (7). 
F29: M, @ Mo = Mo @ M,. 


F30: Let N; and No be the rank-zero matroids on E(M)) and E(M2). Then M1 @Mp = 
(Mi @ No) V (Ni ® Ma). 


6.6.7 Connectedness, 2- and 3-Connectedness 
for Graphs and Matroids 


Although connectedness for graphs does not carry over to matroids, 2-connectedness 
and 3-connectedness do. 


DEFINITION 


D22: A matroid M is 2-connected if, for every two distinct elements e and f of M, 
there is a circuit containing {e, f}. 


TERMINOLOGY: 
For matroids, the terms “2-connected” and “connected” are used interchangeably. 
Another synonym that is also used is “nonseparable”. 


EXAMPLE 


E16: For the graphs G; and G» in Figure 6.6.1, M(G,) = M(G2) but G, is a connected 
graph and G2 is not. Thus, in general, for a graph G, one cannot tell from M(G) whether 
or not G is connected. 
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FACTS 


F31: Let G be a graph without loops or isolated vertices and assume that |V(G)| > 3. 
Then G is 2-connected if and only if, for every two distinct edges e and f of G, there is 
a cycle of G containing {e, f}. 


F32: A matroid M is 2-connected if and only if MW cannot be written as the direct 
sum of two matroids with nonempty ground sets. 


F33: A matroid is 2-connected if and only if its dual is 2-connected. 
F34: [Tu65] If M is 2-connected and e € E(M), then M\e or M/e is 2-connected. 


F35: [Le64] If M is 2-connected, then M is uniquely determined by the set of circuits 
containing some fixed element of E(M). 


By combining Facts F31 and F33, one obtains the following: 


F36: Let G be a graph without loops or isolated vertices and assume that |V(G)| > 3. 
Then G is 2-connected if and only if, for every two distinct edges e and f of G, there is 
a bond of G containing {e, f}. 


Bounds on the number of elements 


NOTATION: For a matroid M having a circuit and a cocircuit, let c(MZ) and c*(M) be 
the sizes of, respectively, a largest circuit and a largest cocircuit of M. Ife € E(M) and 
e is not a loop or a coloop, let ce(MZ) and c*(M) be the sizes of, respectively, a largest 
circuit of M containing e and a largest cocircuit of M containing e; and let d.(M) and 
dé(M) be the sizes of a smallest circuit of M containing e and a smallest cocircuit of 
M containing e. 


F37: [LeOx01] Let M be a 2-connected matroid with at least two elements. 
(i) If e is an element of M, then |E(M)| < (ce(M) — 1)(c2(M) — 1) +1. 
(ii) |E(M)| < ge(M)c*(M). 


F38: (Length-width inequality [Le79]) Let M be a regular matroid with at least two 
elements and suppose e € E(M). Then 


|E(M)| > (de(M) — 1)(d2(M) — 1) +1. 
The next two facts for graphs are immediate consequences of Fact F37. 


F39: Let u and v be distinct vertices in a 2-connected loopless graph G. Then |E(G)| 
cannot exceed the product of the length of a longest u—v path and the size of a largest 
bond separating u from v. 


F40: [Wu97] Let G be a 2-connected loopless graph with circumference c and let c* be 
the size of a largest bond. Then |E(G)| < $cc*. 


Wu [Wu00] showed that the graphs attaining equality in the last bound are certain 
series-parallel graphs, including cycles. Wu’s bound is sometimes better and sometimes 
worse than the following bound of Erdés and Gallai, whose hypotheses are slightly 
different. 


702 Chapter 6. Algebraic Graph Theory 


F41: [ErGa59] Let G be a simple graph with circumference c. Then 
|E(G)| < 5e(|V(@)| - 1). 


The last bound motivated the question whether Fact F37(ii) is true for matroids. This 
question was answered for graphs before it was answered for all matroids. The following 
is a generalization of Fact F40. 


F42: [NeRiUr99] Every 2-connected loopless graph with circumference c has a collection 
of c bonds such that every edge lies in at least two of them. 


The (matroid) dual of the last result is also true. 


F43: [Mc05] Every 2-connected loopless graph whose largest bond has size c* has a 
family of c* cycles so that every edge lies in at least two of them. 


The last result was proved as a partial answer to the following problem of Vertigan (in 
[Ox01]), which remains open in general. 


PROBLEM 


P1: Let M be a 2-connected matroid with at least two elements. Does M have a family 
of c(M) cocircuits such that every element is in at least two of them? 


2-sums and 3-sums 


As noted in Example E15, matroid direct sum generalizes the operation of 1-sum 
for graphs. The graph operation of 2-sum generalizes to all matroids, while 3-sum 
generalizes to binary matroids. 


DEFINITIONS 


D23: Let M, and M2 be 2-connected matroids on disjoint sets, each with at least 
three elements. Let p; and pg be elements of M4, and Mz, respectively. The 2-sum of 
M, and Mz with respect to p, and po is the matroid M, @2 M2 with ground set 
(E(M,) — {pi}) U (E(M2) — {p2}) for which the circuits are all circuits of M, avoiding 
pi, all circuits of M2 avoiding pz, and all sets of the form (Cy — {pi}) U (C2 — {pe}) 
where C; is a circuit of M; containing p;. 


D24: A matroid is 3-connected if it is 2-connected and cannot be written as a 2-sum. 


D25: Let M, and M2 be binary matroids each having at least seven elements. Suppose 
that E(M,)N E(M2) = T where T is a 3-element circuit in each of M, and M2, and 
that T does not contain a cocircuit of M, or Mj. The 3-sum of M, and Mg is the 
matroid on (E(M,)U E(M2)) —T whose flats are those sets F —T such that F'7 E(M;) 
is a flat of M; for each 7. 


EXAMPLES 


E17: Let G, and G2 be 2-connected loopless graphs and p; be an edge of G; for each 7. 
Let G be one of the two graphs that can be obtained from G and G2 by identifying 
pi with po and then deleting the identified edge, that is, G is a 2-sum of the graphs G 
and Go. Then M(G,) a2 M(G2) a M(G). 
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E18: Let G; and G2 be the graphs in Figure 6.6.3, where E(G1) M E(G2) = {1, 2, 3}. 
Then the graph G obtained by sticking G; and G2 together across the 3-cycle {1, 2,3} 
and then deleting {1, 2,3} is the 3-sum of the graphs G,; and Gz. The matroid M(G) 
is the 3-sum of the matroids M(G) and M(G%2). 


e 
1 
3 
2 
@ 
G, G, G 


Figure 6.6.3: M(G) is the 3-sum of M(G) and M(G2). 


FACTS 
F44: (M, ®2 M2)* = My 6 M5. 
F45: A matroid M is 3-connected if and only if M* is 3-connected. 


F46: Let G be a graph without isolated vertices and suppose that |V(G)| > 4. Then 
M(G) is 3-connected if and only if G is 3-connected and simple. 


REMARK 


R4: Tutte [Tu66] defined a notion of k-connectedness for matroids for all k > 2 that 
includes the cases considered above. It has the advantage of being preserved under 
matroid duality but the disadvantage that it departs from graph k-connectedness when 
k > 4. Several authors [Cu81, InWe81, Ox81] introduced the notion of vertical k- 
connectedness for matroids, which generalizes k-connectedness for graphs but need not 
be preserved under duality. 


6.6.8 Graphs and Totally Unimodular Matrices 


One of the most significant achievements of matroid theory is Seymour’s result show- 
ing that all totally unimodular matrices are obtainable from graphs and one additional 
special matroid. This result leads to a polynomial-time algorithm to test whether a 
given matroid is totally unimodular, which is particularly useful in combinatorial opti- 
mization (see, for example, [Sc86]). Recall that a matroid is regular if and only if it can 
be represented by a totally unimodular matrix. 


EXAMPLES 


E19: Let G bea graph. Arbitrarily orient the edges of G and let D be the vertex-edge 
incidence matrix of the resulting directed graph. Then D is a totally unimodular matrix 
that represents M(G) over all fields. 
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E20: [Bi77] Consider the matrix A over GF'(2) whose columns consist of the ten 5- 
tuples with exactly three ones. Let Rio be the matroid represented by A. Then Rip = 
Rio. Moreover, if e is an element of Rig, then Rio\e = M(K3.3) and Rio/e = M*(K3.3). 


FACT 


F47: [Se80] The class of regular matroids is the class of matroids that can be con- 
structed by direct sums, 2-sums, and 3-sums from graphic matroids, cographic matroids, 
and copies of Rio. 


6.6.9 Excluded-Minor Characterizations 


The Kuratowski-Wagner Theorem [Ku30, Wa37] that a graph is planar if and only 
if it has no minor isomorphic to Ks or K3,3 has a number of extensions for graphs and 
matroids. The search for such results is currently the most active area of research in 
matroid theory. 


DEFINITIONS 


D26: A class of matroids is minor-closed if every minor of a member of the class is 
also in the class. 


D27: An excluded minor of a minor-closed class is a matroid for which every proper 
minor is in the class yet the matroid itself is not. 


EXAMPLES 


E21: The class of simple matroids is not minor-closed since it contains the cycle 
matroid of a 3-edge cycle but not the cycle matroid of a 2-edge cycle. 


E22: The following classes of matroids are minor-closed: graphic matroids, cographic 
matroids, uniform matroids, matroids representable over a fixed field, regular matroids, 
and planar matroids. 


E23: Given a finite set E of points in the plane and a collection of lines (subsets of E 
with at least three elements), no two of which share more than one common point, there 
is a matroid with ground set E whose circuits are all sets of three collinear points and 
all sets of four points no three of which are collinear. Geometric representations of 
two such matroids are shown in Figure 6.6.4, where the reader is cautioned that these 
diagrams are not to be interpreted as graphs. Each matroid depicted has ground set 
{1,2,...,7}. On the left is the non-Fano matroid, F7. It differs from the Fano 
matroid, F7, on the right by the collinearity through 4, 5, and 6 in the latter. Neither 
of these two matroids is graphic. 


E24: ‘Table 6.6.6 specifies the collections of excluded minors for certain classes of 
matroids. The results in the last two rows of the table were proved in three landmark 
papers of Tutte [Tu58, Tu58a, Tu59]. The characterization of ternary matroids was 
proved independently by Bixby [Bi79] and Seymour [Se79]. 
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Figure 6.6.4: (a) The non-Fano matroid, F;. (b) The Fano matroid, F7. 


Table 6.6.6: Excluded-minor characterizations of certain classes of matroids 


Class Excluded minors Class Excluded minors 
uniform Uo, @ Oia ternary Ue5,U3,5, Fy, FF 
binary Usa graphic Uo.4, F7, F7, M* (5), M*(K3,3) 
regular Uz4,F7, FF cographic Uo4, Fy, FF, M (Ks), M(K3,3) 
FACTS 


F48: [RoSe04] For every minor-closed class of graphs, the set of excluded minors is 
finite. 


F49: [La58] For every field F of characteristic 0 so, in particular, for Q, R, and C, the 
class of matroids representable over F has an infinite set of excluded minors. 


CONJECTURES 
The following two conjectures are the main unsolved problems in matroid theory. 


C1: (Rota’s Conjecture [Ro71]) For all finite fields F, there is a finite set of excluded 
minors for the class of F-representable matroids. 


C2: For all finite fields F, if MM is some minor-closed class of matroids all of which are 
F-representable, then there is a finite set of excluded minors for M. 


REMARKS 


R5: Fact F48 is probably the deepest result ever proved in graph theory, appearing in 
the twentieth paper of a very difficult series. The sixteenth paper of that series [RoSe03] 
proves a very powerful structure theorem for graphs that is the main tool in the proof 
of Fact F48. Fact F49 shows that Fact F48 does not extend to matroids. The two 
conjectures above propose two natural classes of matroids to which Fact F48 may be 
extendable. 


R6: From Table 6.6.6, if g € {2,3}, then the set of excluded minors for the class 
of GF(q)-representable matroids is finite. Geelen, Gerards, and Kapoor [GeGeKa00] 
proved that the same is true for gq = 4, there being exactly seven excluded minors in 
this case. 
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R7: Rota’s Conjecture is open for all prime powers q exceeding 4. Some recent 
progress on this and on Conjecture C2 has been made by Geelen and Whittle [GeWh02] 
and by Geelen, Gerards, and Whittle [GeGeWh02, GeGeWh06, GeGeWh07]. These 
three authors have been working together on Conjectures Cl and C2 for more than a 
decade. A survey of their work up through 2006 appears in [GeGeWh07a] and more 
recent progress is discussed in Chapter 14 of [Ox11]. In 2008, Geelen [Ge08] announced 
that they had proved the structure theorem for binary matroids that is the analog of 
Robertson and Seymour’s main structure theorem for graphs. In 2009, Geelen [Ge09] 
announced that they had succeeded in using the structure theorem to prove Conjecture 
C2 for binary matroids, that is, for F = GF(2). The papers containing these results 
are currently in preparation. 


6.6.10 Wheels, Whirls, and the Splitter Theorem 


Tutte [Tu61] identified wheels as the basic building blocks of 3-connected simple 
graphs. Subsequently, he generalized that result to matroids [Tu66]. The Splitter The- 
orem, a powerful generalization of the last result, was proved for matroids by Seymour 
[Se80] and, independently, for graphs by Negami [Ne82]. 


DEFINITIONS 


D28: For n > 2, the wheel W,, is the graph that is formed from an n-cycle C;, by 
adding a new vertex and joining this by a single edge (a spoke) to every vertex of the 
rim Ch. 


D29: For r > 2, the rank-r whirl W" is the matroid on the edge set of W, whose set 
of circuits consists of all the cycles of W, except the rim, together with all sets of edges 
consisting of the rim plus a single spoke. 


D30: If M and N are matroids, then M has an N-minor if M has a minor isomorphic 
to N. 


D31: An n-spike with tip p is a rank-n matroid whose ground set is the union of n 
three-element circuits C1, C2,...,C, all containing a common point p such that, for all 
k <n-—1, the union of any & of Cy, Co,...,C, has rank k +1. 


EXAMPLES 


E25: Figure 6.6.5 shows the graph W3, which is clearly isomorphic to K4, together 
with geometric representations of the matroids M(W3) and W3. The line {4,5,6} in 
M(Ws3) corresponds to the rim of W3. 


E26: Both the Fano and non-Fano matroids are examples of 3-spikes. 
E27: The unique rank-r binary spike is the vector matroid of the matrix [J,-|I¢|1] over 


GF (2) where I¢ is the matrix obtained by interchanging the zeros and ones in the r x r 
identity matrix [,, and 1 is the column of all ones. 
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4 4 


Figure 6.6.5: (a) The graph W3. The matroids (b) M(W3) and (c) W?. 


FACTS 


F50: [Tu61] Let G be a simple 3-connected graph. Then, for every edge e of G, neither 
G\e nor G/e is both simple and 3-connected if and only if G is isomorphic to a wheel 
with at least three spokes. 


F51: [Tu66] Let M be a 3-connected matroid. Then, for every element e of M, neither 
M\e nor M/e is 3-connected if and only if M is isomorphic to M(W,.) or W" for some 
T 23% 


F52: (The Splitter Theorem [Se80]) Let M and N be 3-connected matroids such that 
N is a minor of M with |E(N)| > 4, and M is neither a whirl nor the cycle matroid 
of a wheel. Suppose that if N = W?, then M has no W?-minor while if N = M(W3), 
then M has no M(W4)-minor. Then there is a sequence Mp, My,..., M;, of 3-connected 
matroids such that Mp = M; M, = N; and, for all 7 in {1,2,...,n}, M; is a single- 
element deletion or a single-element contraction of Mj_1. 


The statement above of the Splitter Theorem is a slight strengthening — due to Coullard 
[Co85] (see also [CoOx92]) — of Seymour’s original result. The Splitter Theorem has 
numerous applications for both graphs and matroids. It played a key role in the proof 
of Fact F47 and can also be used to derive the following results, the first two of which 
preceded the Splitter Theorem. 


F53: [Wa60] Let G be a simple 3-connected graph having no K5-minor. Then either G 
has no Hg-minor or G © Hg where Hg is the 4-rung Mobius ladder shown in Figure 6.6.6. 


Figure 6.6.6: The 4-rung Mobius ladder, Hg. 


F54: [Ha43] Let G be a simple 3-connected graph. Then G has no K3,3-minor if and 
only if either G is planar or G = K;. 


F55: [Se80] Let M be a 3-connected binary matroid. Then M has no Fy-minor if and 
only if M is regular or M = Fe. 
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F56: Let G be a simple 3-connected graph. Then G has no W4-minor if and only if 
G = Ws. 


F57: [Ox89] Let G be a simple 3-connected graph. Then G has no Ws-minor if and 
only if 
(i) G is isomorphic to a simple 3-connected minor of one of the four graphs in Fig- 
ure 6.6.7; or 


(ii) for some k > 3, the graph G is obtained from K3,, by adding up to three edges 
joining distinct pairs of vertices in the 3-vertex class of the bipartition. 


4H AN AX 


Figure 6.6.7: Four graphs with no 5-wheel minor. 


The last fact and a result of Gubser [Gu93] motivated the following result on unavoid- 
able minors for graphs. Fact F59 generalizes Fact F58 to matroids. Fact F60 is an 
immediate consequence of Fact F37(ii). 


F58: [OpOxTh93] For every integer n > 3, there is an integer & such that every 3- 
connected simple graph with at least k edges has a minor isomorphic to W,, or [3 n. 


F59: [DiOpOxVe97] For every integer n > 3, there is an integer & such that every 
3-connected matroid with at least k elements has a minor isomorphic to U2 ,,Un—2,.n, 
M(K3.n),M*(K3.n),M(Wn), W”, or an n-spike. 


F60: For all positive integers n, a 2-connected matroid with more than sn? elements 
has a minor isomorphic to U1, or Un—1n. 
The next result generalizes a well-known consequence of Euler’s Polyhedron Formula, 
namely, that if G is a simple planar graph, then |E(G)| < 3|V(G)| — 6. Fact F62 is 
a far-reaching matroid generalization of Fact F61 that hints at the matroid structure 
theorem discussed in Remark R7. 
F61: [Ma67] For every integer n exceeding one, if G is a simple graph with no K,,- 
minor, then |E(G)| < (2" — 1)|V(G)|. 
F62: (Growth-Rate Theorem [GeKuWh09]) For a minor-closed class M of matroids, 
one of the following holds: 
(i) there is a real constant c; such that |E(M/)| < cir(M) for all simple matroids 
in M; 
(ii) M contains the class of graphic matroids and there is a real constant cy such that 
|E(M)| < c9(r(M))? for all simple matroids M in M; 
(iii) there is a prime power q and a real constant cs such that M contains the class of 
GF(q)-representable matroids and |E(M)| < c3q’™) for all simple matroids M 
in M; or 


(iv) M contains all simple rank-2-matroids. 
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6.6.11 Removable Circuits 


A result of Mader gave conditions under which a simple k-connected graph has a 
cycle whose edges can be deleted without destroying k-connectedness. The natural 
matroid generalization of this fails for k = 2 even for cographic matroids. However, 
loose analogs of Mader’s result hold for 2- and 3-connected matroids and these give new 
results for graphs. 


DEFINITIONS 


D32: A cycle C of a k-connected graph G is removable if the graph obtained from 
G by deleting all the edges of C' is k-connected. 


D33: For k in {2,3}, a circuit D of a k-connected matroid M is removable if M\D 
is k-connected. 


FACTS 


F63: [Ma74] If G is a simple k-connected graph with minimum degree at least k + 2, 
then G' has a removable cycle. 


The hypothesis of Mader’s result implies that |E(G)| > $(k + 2)|V(G)|. The next two 
facts show that imposing appropriate lower bounds on the number of elements in a 
matroid guarantees the existence of removable circuits. 


F64: [LeOx99] Let M be a 2-connected matroid with at least two elements and C’ be 
a largest circuit of M. If |E(M)| > 3r(M)+3-—-c(M), then M has a circuit C that is 
disjoint from C’ such that M\C is 2-connected and r(M\C) = r(M). In particular, if 
r(C’) = r(M) and |E(M)| > 2r(M) + 2, then M has a removable circuit. 


F65: [LeOx99a] Let M be a 3-connected matroid with at least two elements and C’ 
be a largest circuit of M. If 

3r(M)+1 when c(M) = r(M) +1, 
4r(M)+1-—c(M)_ otherwise, 


then M has a circuit C that is disjoint from C’ such that M\C is 3-connected and 
r(M\C) =r(M). 


The next two facts are obtained by applying the last two results to graphs. 


F66: Let G be a 2-connected loopless graph and C” be a largest cycle in G. If 
|E(G)| > 3|V(G)| — c(G), then G has a removable cycle having no common edges with 
C’. In particular, if G is hamiltonian and |E(G)| > 2|V(G)|, then G has a removable 
cycle. 


F67: Let G be a simple 3-connected graph and C” be a largest cycle of G. Suppose 
that 
3|V(G)| — 2 if G is hamiltonian, 


|E(G)| = ae —3-—c(G) otherwise. 


Then G has a cycle C that has no common edges with C” such that G\C is 3-connected. 
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F68: [GovaMc97] Let G be a 2-connected graph with minimum degree at least four. 
If G has no minor isomorphic to the Petersen graph, then G has two edge-disjoint 
removable cycles. 


F69: [Mc05a] Let G be a 2-connected graph that is not a multiple edge. If G has no 
minor isomorphic to Ks, then G has a bond C* such that G/C* is 2-connected. 


For 2-connected graphs, the condition that the graph is simple in Mader’s result (F63) 
can be replaced by a higher bound on the minimum degree. 


F70: [Si98] Let G be a 2-connected graph with minimum degree at least five. Then G 
has a removable cycle. 


Example E31 below shows that the last result does not generalize to all matroids. The 
next result implies that it generalizes to regular and hence cographic matroids and 
prompts the problem as to whether it extends to binary matroids. 


F71: [GoJa99] Let M be a 2-connected binary matroid in which every cocircuit has at 
least five elements. If M does not have minors isomorphic to both F7 and F7, then M 
has a removable circuit C such that r(M\C) = r(M). 


REMARKS 


R8: The last sentence of Fact F66 is easily deduced directly, but the result in the case 
when G is non-hamiltonian seems far less obvious. 


R9: Facts F64 and F65 can also be applied to the bond matroids of, respectively, 
2-connected loopless graphs and 3-connected simple graphs to give necessary conditions 
for such a graph G to have a bond C* for which G/C* is, respectively, 2-connected and 
loopless, or 3-connected and simple. 


R10: Arthur Hobbs provided much of the impetus for the study of removable cycles 
by asking whether every 2-connected Eulerian graph with minimum degree at least four 
contains a removable cycle. 


EXAMPLES 


E28: [LeOx99] Consider the simple graph that is constructed as follows: begin with 
K5,5 having as its two vertex classes {1, 2,3, 4,5} and {6,7, 8,9, 10}; for every 3-element 
subset X of {1,2,3,4,5} and of {6,7,8,9,10}, add two new vertices vx and wx each 
joined to all the members of X and to nothing else. Then the resulting graph G is 
2-connected, having every cycle of length at least four and having every bond of size at 
least three. Thus M*(G) is simple and 2-connected, having every cocircuit of size at 
least four. But 1/*(G) has no removable circuit because G has no bond C* for which 
G/C* is 2-connected. Thus the generalization of Fact F63 to cographic matroids fails 
when k = 2. 


E29: Jackson [Ja80] and, independently, Robertson (in [Ja80]) answered Hobbs’s ques- 
tion negatively by producing the modified Petersen graph G) in Figure 6.6.8(a). 
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(a) G, (b) G, 


Figure 6.6.8: Neither M(G) nor M*(G2) has a removable circuit. 


E30: [GovaMc97] For the dual problem, the graph G2 in Figure 6.6.8(b) is 2-connected 
but has no bond C* such that G2/C* is 2-connected. This motivated a conjecture, which 
McGuinness proved in Fact F69. 


E31: For r > 3, the uniform matroid U;,2, is 2-connected, has all its cocircuits and 
circuits of cardinality r+ 1, and has no removable circuits. 


PROBLEMS 


P2: [GoJa99] Is there an integer ¢ such that every 2-connected binary matroid in which 
every cocircuit has at least ¢ elements has a removable circuit? 


P3: [GoJa99] If M is a 2-connected binary matroid in which every cocircuit has at least 
5 elements, then does M have a removable circuit? 


6.6.12 Minimally k-connected Graphs and Matroids 


For k > 2, a k-connected graph for which no single-edge deletion is k-connected has 
many vertices of degree k. For & in {2,3}, this fact has some matroid analogs that lead 
to new graph results. 


DEFINITIONS 


D34: For k > 2, a k-connected graph G is minimally k-connected if no single-edge 
deletion of G is k-connected. 


D35: For k in {2,3}, a k-connected matroid M is minimally k-connected if no 
single-element deletion of M is k-connected. 


D36: Let M be a 2-connected matroid. A cocircuit C* of M is nonseparating if 
M\C* is 2-connected. 


EXAMPLES 
E32: Ifm>k > 2, then Kym is minimally k-connected. For all n > 3, the n-spoked 


wheel W,, is minimally 3-connected. The cycle matroids of K3, and W, are minimally 
3-connected matroids. 
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E33: The duals of the matroids F7 and F, are both minimally 3-connected. 


FACTS FOR ARBITRARY CONNECTIVITY 


F72: [Ma72] For all k > 2, every cycle of a minimally k-connected graph meets a vertex 
of degree k. 


F73: [Ma79] For all k > 2, the number of vertices of degree k in a minimally k- 
connected graph G is at least 
(k — 1)|V(G)| + 2k 
2k—-1 : 


F74: [Ox81b] For all k > 2, the number of vertices of degree k in a minimally k- 


ne |E(G)|=|V(G)|+1 
connected graph G is at least —“~~S23——. 


REMARKS 


R11: Fact F73 was proved when k = 2 by Dirac [Di67] and Plummer [P168], indepen- 
dently, and when & = 3 by Halin [Ha69]. The same authors proved Fact F76 below. 


R12: The bound in Fact F74, which was obtained from Fact F71 by using an elementary 
matroid argument, frequently sharpens the bound in Fact F72. 


R13: [Ma96] The graph that is obtained from a path P of length three by adding 
three vertices joined to each vertex of the path is minimally 4-connected but has a cycle 
meeting only one vertex of degree 4. Thus Fact F72 cannot be sharpened in general 
although it can be improved for k < 3. 


FACTS FOR SMALL CONNECTIVITY 


F75: If M is a 3-connected matroid and M = M(G) for some loopless graph G without 
isolated vertices, then a subset C* of E(M) is a nonseparating cocircuit of M if and 
only if C™ is the set of edges meeting at some vertex of G. 


F76: [Di67, P168, Ha69] For k in {2,3}, every cycle of a minimally k-connected graph 
meets at least two vertices of degree k. 


F77: [Ox81la, Ox81b] For k in {2,3}, let M be a minimally k-connected matroid with 
at least four elements. Then 


(i) every circuit of M meets at least two k-element cocircuits; and 

(ii) M has at least a) k-element cocircuits. 
F78: [Wu98] Let M be a minimally 3-connected binary matroid with at least four 
elements. Then 

(i) every circuit of M meets at least two 3-element nonseparating cocircuits; and 


r*(M)+2 
2 


(ii) M has at least 3-element nonseparating cocircuits. 
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F79: [Ha69a] Let G be a minimally 3-connected graph. Then aIV(e)| < |E(G)|. 
Moreover, 


2\V(G)|-2 if |V(G) 


Es fee —~9 if |V(G) 


The only graphs attaining equality in these bounds are W,, for 3 <n < 6 and K3,, for 
m > 4. 


F80: [Ox81a] Let M be a minimally 3-connected matroid with at least four elements. 
Then 

2r(M) if r(M) <5; 
3r(M)—6 if r(M)>6. 


The only binary matroids attaining equality in these bounds are M(W,,) for 3 <n <6 
and M(K3,m) for m > 4. 


On combining Fact F73 and Fact F77(ii) and using a small amount of additional argu- 
ment, one gets the following: 


F81: [Ox81b] Let G be a minimally 3-connected graph with n3 vertices of degree 3. 


Then 
2|V(G)|+7 when JV(@I < |E(G)| < 9|V(G)|-3, 
na > 5 Diary = 5 , 
a> PQ ae es =3 < |E(G)| < 3|V(G)| -9. 


9|V(G)| 
5 


Applying Fact F77(ii) and Fact F78(ii) to cographic matroids gives the following: 
F82: Let G be a graph. 


(i) If G is 2-connected and loopless having no single-element contraction that is 2- 
connected and loopless, then the number of 2-cycles in G is at least |V(G)]. 


(ii) Let G be a simple 3-connected graph for which no single-edge contraction is both 
simple and 3-connected. Then the number of 3-cycles in G such that G/C is 


2-connected and loopless is at least Weber 


6.6.13 Conclusion 


Many areas of the interaction between graphs and matroids have not been discussed 
above. The most notable omission relates to the Tutte polynomial and, in particular, 
to colorings and flows. The interested reader is referred to the surveys in [BrOx92] and 
[We99]. In spite of this omission, numerous examples of this interaction are provided 
above. These include examples of matroid results that produce new graph results when 
applied to graphic or cographic matroids, graph results that have generalizations or 
analogs for matroids, and graph results that arise by viewing graphs from a matroid 
perspective. The already strong ties between matroid theory and graph theory are 
continuing to grow, most notably with the ongoing work of Geelen, Gerards, and Whittle 
generalizing Robertson and Seymour’s Graph Minors Project to matroids representable 
over finite fields. 
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Glossary for Chapter 6 


adjacency matrix — of a simple graph G: the 0-1 matrix Ag whose rows and columns 
correspond to the vertices of G, with an entry being 1 if and only if the corresponding 
row and column vertices are adjacent. 

__, [*-order: inductively defined with Aj = J as the identity matrix, A; = A as the 
usual adjacency matrix, and A; as the matrix with 1 in the (7,7) position if the 
corresponding vertices are at distance | from each other and 0 otherwise. 

algebraic connectivity of a graph whose Laplacian has the eigenvalues A; <--- < Xn: 
the eigenvalue Az. 

almost transitive automorphism group — for an infinite graph G: a group of auto- 
morphisms of G that acts with only finitely many orbits. 

almost transitive graph: a graph G whose full automorphism group Aut(G) is almost 
transitive. 

s-arce: a directed walk in a graph of length s in which consective edges are distinct. 

arc-transitive graph: a graph G (undirected) whose automorphism group induces a 
transitive group action on the set of ordered edges of G. 

asymmetric graph: a graph whose automorphism group is trivial. 

automorphism group — of a graph G: the set of all automorphisms of a graph, made 
with the operation of composition into a group, usually denoted Aut(G). 

automorphism — of a graph: an isomorphism of the graph onto itself. 

___, bounded ~— for an infinite graph G: an automorphism such that there is a uniform 
bound on the distances between every vertex and its image. 

basic figure: a vertex-disjoint union of elementary figures. 

basis — of a matroid M: a maximal set containing no circuit of M. 

bicycle — in a graph: a subgraph that is both a circ and a cut. 

binary vector — representing a subset E’ of edges in an undirected graph: a row vector, 
whose i is 1 if the i** edge of the graph is in E’ and is otherwise 0. 

block — of objects under a permutation group action: a subset B of the set X of objects 
on which a permutation group P acts, such that for every permutation 7 € P, the 
image 7(B) either coincides with B or is disjoint from B. 

___, nontrivial: a block other than 9, a singleton set, or the entire set of objects on 

which a permutation group is acting. 
bond — of a graph G: a minimal set of edges whose deletion from G increases the 
number of connected components. 
branch, — at a vertex v of a tree T: a maximal subtree having v as a leaf. 
branch, — of a spanning tree: an edge of the spanning tree. 
cage: a smallest 3-valent graph with a given girth. 
Catalan numbers: the sequence of numbers defined by the recursion 
Co=1, Cy = CoCn-14+CiCn_2+-::+Cyn_-1Co forn>1 
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Cayley digraph — for a group A with connection set X of elements: the graph whose 
vertices are the elements of group A and such that, for each element a € A and each 
connection x € X, there is a directed edge from vertex a to vertex ax. Commonly 
denoted C(.A, X). 

Cayley graph: any graph isomorphic to the underlying undirected graph of a Cayley 
digraph. 

Cayley graphy: a Cayley digraph. 

Cayley graphs: where the connection set X is stipulated to be a generating set. 

Cayley graphy: where the connection set X is stipulated to be balanced, which means 
that 2 € X if and only if x-! € X. 

characteristic polynomial — of a graph: the determinant det(xJI— A) of its adjacency 
matrix. 

chord — of a spanning tree: an edge of the cotree. 

ClI-graph: a Cayley graph C(A,) such that whenever C(A,%¥) = C(A, ¥’), there 
exists an automorphism a € Aut(A) such that X’ = a(X). 

CI-group: a group A such that every Cayley graph on A is a Cl-graph. 

circ — in a graph: a circuit or union of edge-disjoint circuits of the graph. 

circuit — in a graph: a subgraph isomorphic to any of the cycle graphs Cy. 

__, directed — in a directed graph: a circuit in which all the edges are oriented in 
the same direction. 

___, removable - of a k-connected matroid M: for k in {2,3}, a circuit of M such 
that the deletion of C from M is k-connected. 

circuit matrix — of a graph (directed or undirected): the matrix in which each row is 
a circuit vector, with one row for each circ in the graph. 

circuit space — of a directed graph G: the set of all circuit vectors and their linear 
combinations over the real field, denoted by C(G). 

circuit subspace — of an undirected graph G: the set of all circs of the graph, denoted 
by C(G). 

circuit vector — of an undirected graph: the binary m-vector representing a circ of the 
graph. 

circuit vector — in a directed graph: an m-vector representing a circ of the graph; the 
signs of the elements in the vector depend on the orientation assigned to each of the 
circuits in the circ. 

circulant graph: an undirected Cayley graph on the cyclic group Z,. 

closed under duality — of a class M of matroids: the dual of every member of M is 
also in M. 

closure — of a set X in a matroid M: the maximal subset of E(M) that contains X 
and has the same rank as X. 

coalescence — of two (disjoint) graphs G, and G2, with distinguished vertices v; and 
v1: the graph formed from their union by identifying the vertices v1; and v1; also 
called amalgamation at a vertex. 

cocircuit — of a matroid M: a circuit of the dual matroid of M. 

cocktail party graph CP(n): the regular graph of degree 2n — 2 with 2n vertices; 
another name for the n-dimensional octahedral graph. 

complement of a subgraph G’ — ina graph G: the graph G” = (V(G), E(G)—E(G’)). 

component — of a graph: a maximal connected subgraph. 

conjugate subgroups — in a group G: subgroups H,; and H2 for which there exists a 
subgroup H such that H; = HH2H!. 

connected graph: a graph in which there is a path between every pair of vertices. 

connection set: see Cayley graph. 
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contraction — of a set T from a matroid M: the matroid M/T on E(M) — T whose 
circuits are the minimal nonempty sets in {C —-T:C €C(M)}. 

cospanning tree — of a graph G with respect to a spanning tree T: the complement 
of T in G; this is generally called the cotree of T. 

cotree — of a spanning tree T in a graph G: the complement of T in G. 

cut matrix — of a graph (directed or undirected): the matrix in which each row is a 
cut vector and its number of rows is equal to the number of cuts in the graph. 

cut vector — in a directed graph: the m-vector representing a cut of the graph; the 
signs of the elements in the vector depend on the cut orientation. 

cut vector — in an undirected graph: a binary m-vector representing a cut of the graph. 

cut (Vj, V2) — in a graph G = (V,E): the set of edges with one end vertex in V, and 
the other in Vp = V — Vj. 

cutset,; — in a connected graph G: a set of edges whose removal increases the number 
of components. 

cutset, — in a connected graph G: a cut whose removal results in a graph with exactly 
two components. 

___, directed — of a directed graph: a cut in which all the edges are oriented in the 
same direction. 

cutset space or cutset subspace, — of a directed graph G: the set of all cut vectors 
of G and their linear combinations over GF'(2). 

cutset space or cutset subspace, — of a directed graph G: the set of all cut vectors 
of G and their linear combinations over the reals. 

cycle matroid — of a graph G: the matroid on the edge-set of G whose circuits are the 
edge-sets of the cycles of G. 

degree — of a vertex: the number of edges incident on that vertex. 

deletion — of a set T from a matroid M: the matroid M\T on E(M)—T whose circuits 
are the circuits of M contained in E(M) — T. 

dependent set — of a matroid M: a set containing a circuit of M. 

digraph, labeled: a digraph with labels, typically v1, v2,...,Un, assigned to the ver- 
tices. Two labeled digraphs with the same set of labels are considered the same only 
if there is an isomorphism from one to the other that preserves the labels. 

direct sum — of matroids M; and M2 on disjoint sets: the matroid on E(M))U E(M2) 
whose circuits consist of every set that is a circuit of M, or of Mo. 

distance-regular graph — with parameters PF 55 0 <i,j,k <d: a graph such that for 
any pair of vertices whose distance is k, the number of vertices at distance 7 from 
the first and distance j from the second is Di; 

divisor of a graph G - with respect to any product operation q: either of the coordinate 
factors A or B, when graph G is expressed as a graph product AgB. 

___, proper: a divisor of a graph other than itself and the trivial graph Kk). 

doubly transitive group: a permutation group that acts transitively on ordered pairs 
of elements. 

dual, of a graph G (Poincare dual): a graph obtained from a cellular imbedding of G 
on a surface, by drawing a dual vertex in each region, and then drawing a dual edge 
through each edge of G (see §7.6), so as to join dual vertices. 

dual, of a graph G (Whitney dual): see Definition D51 of §6.4. 

dual M* — of a matroid M: the matroid on E(M) whose set of bases consists of the 
set of complements of bases of M. 

edge-automorphism: an edge-isomorphism from a graph to itself. 

edge-group: the permutation group on the edge-set of a graph consisting of the edge- 
automorphisms. 
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edge-isomorphism: a bijection from the edge-set of one graph to the edge-set of 
another graph that maps every pair of adjacent edges to a pair of adjacent edges. 

edge-transitive graph: a graph whose automorphism group induces a transitive group 
action on the edge set of the graph. 

eigenvalues — of a graph: the roots of its characteristic polynomial. 

eigenvalues-diameter (lower) bound — for the number of eigenvaues of a graph G: 
diam(G) + 1. 

elementary abelian p-group: a group of the form Z7. 

elementary figure: a subgraph isomorphic either to a K2 or to a cycle graph C,. 

empty graph: a graph with no edges. 

end — of an infinite graph: an equivalence class of rays such that no two subrays can 
be separated by a finite subgraph. 

endomorphism — of a graph: a homomorphism of the graph to itself. 

Euler totient function ¢(n): the number of elements between 0 and n that are 
relatively prime to n. 

even graph: a graph in which the degree of every vertex is even. (Such graphs are 
more commonly called eulerian graphs.) 

excluded minor — of a minor-closed class of matroids: a matroid that is not in the 
class but has all its proper minors in the class. 

factor: see divisor. 

1-factor — of a graph: a subgraph in which every vertex has degree 1. 

1-factorization — of a graph: a partitioning of its edge-set into 1-factors. 

flat — of a matroid M: a maximal subset of E(M) of a fixed rank. 

fundamental circuit matrix — of a connected graph with respect to a spanning 
tree: the (m — n+ 1)-rowed submatrix of the circuit matrix in which each row is a 
fundamental circuit vector with respect to the spanning tree, and will be denoted 
by By; in a directed graph, the orientation of the fundamental circuit is chosen to 
agree with the orientation of the chord defining the fundamental circuit. 

fundamental circuit — with respect to a chord c and a spanning tree T of a graph: 
the unique circuit produced by adding chord c to spanning tree T. 

fundamental cutset matrix — of a connected graph with respect to a spanning tree: 
the (n — 1)-rowed submatrix of the cut matrix in which each row is a fundamental 
cutset vector with respect to the spanning tree; in a directed graph, the orientation 
of a fundamental cutset is chosen to agree with the orientation of the edge defining 
the fundamental cutset. 

fundamental cutset — of a graph with respect to an edge: the unique cutset < Vi, V2 >, 
where V; and V3 are the sets of vertices of the two trees that result when the edge 
is removed from the spanning tree. 

generalized dicyclic group: an abstract group generated by an abelian, but not 
elementary abelian, subgroup A of index 2 and an element b of order 4 such that 
conjugation by 6b inverts every element of A. 

geodesic: a geodetic double ray. 

geodetic: said of a path, ray, or double ray that contains a shortest path joining any 
two of its vertices. 

graph product, — of two graphs: the cartesian product. 

graph product? — of two graphs: a graph that results from applying any graph product 
operation q. 

graph, labeled: a graph with labels, typically v1, v2,...,un, assigned to the vertices. 
Two labeled graphs with the same set of labels are considered the same only if there 
is an isomorphism from one to the other that preserves the labels. 
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graphical regular representation of a group G: a graph whose automorphism group 
is isomorphic to G and acts regularly on the vertex set of the graph. 

growth — of an infinite graph G: liminf,_,..[d(k)/a*], where d(k) is the number of 
vertices of G at distance k from a fixed vertex and a > 1 is a real number. 

GRR: graphical regular representation. 

half-transitive: vertex-transitive and edge-transitive, but not arc-transitive. 

Hamilton decomposition — of a regular graph: a partition of the edge-set into Hamil- 
ton cycles (when the degree is even) or into Hamilton cycles and a 1-factor (when 
the degree is odd). 

Hamilton-connected graph: a graph such that for any two vertices u,v, there is a 
Hamilton path whose terminal vertices are u and v. 
Hamilton-laceable graph: a bipartite graph with parts A and B such that for any 
u € A and v € B, there is a Hamilton path whose terminal vertices are u and v. 
Hamming graph H(d,n): the graph whose vertices are the d-tuples (#1,...,¢q@) with 
1 < 2; < n; two vertices are joined if, as d-tuples, they agree in all but one coordinate. 
(The distance between two vertices is then the number of coordinates in which, as 
d-tuples, they differ.) 

hereditary collection — of sets: a collection A of sets such that every subset of a 
member of A is also in A. 
Hoffman polynomial — for an r-regular, connected graph: the polynomial h(x) = 
nq] one the product being taken over all distinct eigenvalues not equal to r. 
homomorphism of general graphs G and H: a pair of mappings f : Ve — Vy and 
f : Ea — Ex such that the endpoint-set of each edge e € Eg is mapped onto the 
endpoint-set of the image edge f(e) € Ex. 

homomorphism of simple graphs G and H: a mapping f : Ve — Vy such that 
whenever the vertices u and v are adjacent in G, the vertices f(u) and f(v) are 
adjacent in H. 

hyperplane — of a matroid M: a maximal subset of E(M) that does not contain a 
basis of M. 

incidence matrix, — of a graph: a matrix whose rows correspond to the vertices and 
whose columns correspond to the edges; the 77 entry is 2 if edge 7 is a self-loop and 
vertex 7 is its endpoint, 1 if edge 7 is a proper edge and vertex 7 is an endpoint, and 
0 otherwise. 

incidence matrix, — of a graph: the n-rowed submatrix of the cut matrix in which 
each row is an incidence vector. 

incidence set — of a vertex: the set of edges incident on that vertex. 

incidence vector — for a directed graph: the cut vector representing the set of edges 
incident on a vertex of the graph, with the orientation of the cut chosen to be away 
from the vertex. 

incidence vector — for an undirected graph: the binary cut vector representing the 
set of edges incident on a vertex of the graph. 

independent set — of a matroid M: a set containing no circuit of M. 

induced subgraph on an edge subset E’ Cc E(G): the subgraph of G with edge-set 
E’ and vertex-set consisting of the endpoints of the edges in E’. 

induced subgraph on a vertex subset V’ Cc V(G): the graph with vertex set V’ 
and edge-set consisting of those edges whose endpoints are in V’. 

infinite connectivity of a graph G — denoted k.(G): the cardinality of a smallest 
set of vertices whose deletion leaves a graph with at least two infinite components. 

isolated vertex: a vertex with degree zero. 


Glossary for Chapter 6 723 


isomorphic factorization — of a graph G: a partition of the edge set of G so that the 
subgraphs induced by the edges in each part are mutually isomorphic. 

isomorphic matroids: matroids M, and M2 for which there is a 1-1 function ¢ from 
E(M;) onto E(M2) such that C is a circuit of M; if and only if (C) is a circuit of 
Mg. 

isomorphic permutation groups: a pair of isomorphic groups whose actions on their 
respective sets are the same, up to a bijection from one object set to the other. 

isomorphism of general graphs G and H: a pair of bijections f : Ve — Vy and 
f : Eq — Ey such that the endpoint-set of each edge e € Eg is mapped onto the 
endpoint-set of the image edge f(e) € Ex. 

isomorphism of simple graphs G and H: a bijection f : V¢ — Vy such that vertices 
f(u) and f(v) are adjacent in H if and only if vertices u and v are adjacent in G. 

Johnson graph J(d,n): the graph whose vertices are the C) subsets of {1,...,n}; two 
vertices are joined if, as subsets, their intersection has cardinality d—1. (The distance 
bewteen two vertices is & if, as subsets, they have an intersection of cardinality d—k.) 

Kuratowski graph: either of the two graphs in Figure 6.4.11, which characterize 
nonplanarity. 

Laplacian matrix: a square matrix whose rows and columns correspond to the vertices 
of a graph, such that a diagonal entry is the degree of the corresponding vertex; an 
off-diagonal entry is —1 if the corresponding vertices are adjacent and 0 otherwise. 

line graph of a graph G, denoted by L(G): a graph whose vertex-set is the edge-set 
of G, with two vertices in L(G) adjacent if, as edges of G, they have an endpoint in 
common. 

__, generalized L(G; a,,...,a@,) — for a graph G with n vertices and nonnegative 
integers a1,...,@,: the graph formed by taking disjoint copies of the line graph 
L(G) and cocktail party graphs CP(a,),...,CP(a,); if a vertex in L(G) corre- 
sponds to the edge joining v; to v; in G, it is joined to all vertices in CP(a;) and 
CP(a;). 

loop — of a matroid M: an element e of E(M) for which {e} is a circuit. 

matroid M: a finite set E(M), the ground set of M, and a collection C(M) of non- 
empty incomparable subsets of E(M) called the circuits of M such that if C, and 
Cz are distinct members of C(M) and e € C1 M C2, then there is a member C3 of 
C(M) such that C3 Cc (C1 U C2) — {e}. 

__, binary: a matroid that is isomorphic to the vector matroid of a matrix over the 
2-element field GF(2). 

__, bond — of a graph G: the matroid on the edge-set of G whose circuits are the 
bonds of G. 

___, cographic: a matroid that is isomorphic to the bond matroid of some graph. 

___, 2-connected: a matroid in which, for every two distinct elements, there is a 
circuit containing both. 

___, 3-connected: a 2-connected matroid that cannot be written as a 2-sum. 

__, graphic: a matroid that is isomorphic to the cycle matroid of some graph. 

__, planar: a matroid that is isomorphic to the cycle matroid of a planar graph. 

___, regular: a matroid that is representable over all fields. 

___, representable — over a field F: a matroid that is isomorphic to the vector matroid 
of some matrix over F. 

___, simple: a matroid in which all circuits have at least three elements. 

__, uniform U;,,: for 0 <m <n, the matroid on {1,2,...,} in which the circuits 
consist of all (m + 1)-element subsets. 
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maximally distant trees: two spanning trees JT; and T> such that d(T,T2) > 
d(T;,T;), for every pair of spanning trees T; and 7). 

metric ray (double ray): a ray (double ray) with positive straightness. 

metric type: describes a ray in an infinite graph that is an a-essential ray for some 
automorphism a. 

minimally k-connected graph: a k-connected graph for which no deletion of an edge 
remains k-connected. 

minimally k-connected matroid: for k in {2,3}, a k-connected matroid for which 
no single-element deletion is k-connected. 

minimum polynomial of a graph G: the monic polynomial q(x) of smallest degree, 
such that q(Ag) = 0. 

minor — of a graph G: a graph that can be obtained from G by a sequence of edge 
deletions, edge contractions, and deletions of isolated vertices. 

minor — of a matroid M: a matroid that can be obtained from M by a sequence of 
deletions and contractions. 

___, proper — of a matroid M: a minor of M that is not equal to M. 

minor-closed — class of matroids: one in which every minor of a member of the class 
is also in the class. 

nonseparating cocircuit — of a 2-connected matroid M: a cocircuit whose deletion 
from M remains 2-connected. 

null graph: a graph with no vertices and hence no edges. 

nullity — of a graph G having n vertices, m edges and p components: nullity is equal 
to m—n-+p and is denoted p(G). 

orientation of a cut (Vi, V2) — in a directed graph: the direction, either from Vi to 
V2 or from V2 to V,, that we choose for the cut. 

orientation of a circuit — in a directed graph: the direction we choose to traverse the 
circuit. 

orthogonal complements — of a vector space: two subspaces whose intersection is 
the zero vector. 

orthogonal subspaces of a vector space: subspaces such that the inner product of 
every vector in one subspace with every vector in the other subspace is equal to 
Zero. 

painting — of a graph: a partitioning of the edges into three sets R (red), Y (yellow), 
and B (blue), and the distinguishing of one edge in the set Y. 

Paley graph: a Cayley graph formed on the additive group of a finite field GF(q), 
q = 1(mod 4), where the connection set is the set of quadratic residues in GF'(q). 
parallel elements e and f of a matroid M: elements such that {e, f} is a circuit of 

M. 
parameter matrix, ['’-order: the matrix P, with the distance-regularity parameter 
Pi, in the (i, 7) entry. 
permutation group: a nonempty set P of permutations (on the same set X of objects), 
such that P is closed under composition and inversion. 
__, doubly transitive: a permutation group that acts transitively on ordered pairs 
of objects. 
__, primitive: a transitive permutation group whose only blocks are trivial. 
___, regular: a permutation group that is both transitive and semiregular. 
___, semiregular: a permutation group all of whose vertex-stabilizers are trivial. 
___, transitive: a permutation group such that for any two objects of the set on which 
it acts, some permutation maps one object onto the other. 
prime graph — under a given product operation: a graph having no proper divisor. 


jth 
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primitive graph: a graph whose automorphism group acts as a primitive permutation 
group on the vertex-set. 

primitive group: a transitive permutation group that has no nontrivial blocks. 

principal subgraphs G, and G2 — of a graph G: see Definition D46 of 86.4. 

product operation — on two graphs G and H: any operation such that the vertex-set 
GH is the cartesian product of Vg and Vz, and such that the edge-set is determined 
exclusively by the adjacency relations in G and Hs. 

quadratic residue — in a finite field: an element of the form x?. 

rank — of a graph G having n vertices and p components: the number of edges in the 
complement of a spanning forest; the rank is equal to n — p and is denoted by p(G); 
usually called the cycle rank. 

rank — of a set A in a matroid: the cardinality of a maximal independent subset of 
A. The rank of a matroid M is the cardinality of a maximal independent subset of 

ray — in an infinite graph: a one-way infinite path. 

__, a-essential — in an infinite graph: a ray that is mapped onto one of its subrays 
by a positive power of the automorphism a. 
___, double — in an infinite graph: a two-way infinite path. 

reduced incidence matrix — of a graph: the submatrix of the incidence matrix con- 
taining any (n — 1) incidence vectors. 

regular action — of a permutation group: see permutation group, regular. 

regular matrix: see Definitions D49 and D50 of 86.4. 

s-regular graph: a graph that contains at least one s-arc and whose automorphism 
group acts regularly on its set of s-arcs. 

relatively prime graphs: graphs having no common proper divisor. 

removable circuit — of a k-connected matroid M: for k in {2,3}, a circuit of M such 
that the deletion of C from M is k-connected. 

removable cycle — of a k-connected graph G: a cycle of G such that the deletion of 
the edges of C' from G leaves a k-connected graph. 

ring sum of two sets £; and FE»: the set consisting of elements that belong to Ey or 
to E2, but not to both EF; and E>; denoted by Ey, @ Ep. 

ring sum of two vectors (21, 22,%3,..-,Zi,---;Um) and (Yi, Y2, Y3,---5Yis-+ +3 Ym): 
the vector Z = (21, 22, 23,---,2i;+--;2m), Where z; = x; ® y; and ® is the logical 
exclusive-or operation (1 ®0=1,0@1=1,0®@0=0,and1@1=0). 

semisymmetric graph: an edge-transitive graph with constant valence (i.e., a regular 
graph) that is not vertex-transitive. 

spanning forest — of a graph G having p components: a collection of p spanning trees, 
one for each component of G. 

spanning set — of a matroid M: a subset of E(M) containing a basis of E(M). 

spanning tree — of a connected graph: a tree that contains all the vertices of the graph. 

spectrum of a graph: the multiset of eigenvalues; for a graph with n vertices, there 
are n eigenvalues. 

stabilizer of a vertex u of a graph G: the subgroup of Aut consisting of the permuta- 
tions that fix vertex u. 

straightness of a ray or double ray D: the number lim infg, (1,1) 00 d(u, v)/dp(u, v), 
where u,v are vertices of D. 


strip: a connected graph G that admits a connected subgraph H and an automorphism 
a such that OH and H — a(#) are finite and a(H UOH) C H. 
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strongly regular graph — with parameters (n,7r,A,j): an r-regular n-vertex graph 
such that any pair of adjacent vertices is mutually adjacent to \ other vertices, and 
such that any pair of nonadjacent vertices is mutually adjacent to yw other vertices. 

2-sum of matroids: for 2-connected matroids M, and M2 on disjoint sets each having 
at least three elements, let p; be an element of M;; the 2-sum with respect to p 
and p2 is the matroid on (E(M) — {pi}) U (E(M2) — {p2}) whose circuits are the 
circuits of M, avoiding p,, the circuits of Mz avoiding po, and all sets of the form 
(C1 — {pi}) U (C2 — {p2}) where C; is a circuit of M; containing p;. 

3-sum of matroids: for binary matroids M, and M2 each having at least seven ele- 
ments such that E(M,) NM E(M2) is a 3-element circuit T of M, and M2 that does 
not contain a cocircuit of either matroid; the 3-sum is the matroid on (E(M,)N 
E(M2)) —T whose flats are those sets F — T such that FN E(M;) is a flat of M; for 
each 2. 

symmetric difference — of two sets E; and E2: the set consisting of only elements 
that belong to FE; or to Ey, but not to both Ey, and E,; denoted by E, @ EF. 

symmetric group S,,: the group of all permutations acting on the set {1,2,...,n}. 

system of imprimitivity: collection of images of a nontrivial block under the action 
of a transitive permutation group. 

ternary matroid: a matroid that is isomorphic to the vector matroid of a matrix over 
the 3-element field GF'(3). 

torsion subgroup: a subgroup of an infinite group, all of whose elements have finite 
order. 

totally unimodular matrix: a matrix over the real numbers for which the determi- 
nant of every square submatrix is in {0,1,—1}. 

tournament: a digraph in which, for each pair u, v of distinct vertices, either there 
exists an arc from wu to v or an arc from v to u but not both. 

__, strong: short for strongly connected tournament. 

___, strongly connected: a tournament such that for each pair u, v of vertices, there 
exist directed paths from u to v and from v to u. 

transitive action — of a permutation group: see permutation group, transitive. 

s-transitive graph: a graph that contains at least one s-arc and whose automorpism 
group acts transitively on its set of s-arcs. 

translation: an endomorphism of a graph that fixes no finite nonempty subset of the 
vertex set. 

tree — in a graph: a connected subgraph of the graph containing no circuits. 

__, 1-4: a tree in which each vertex has degree 1 or 4. 

__, 1-rooted 1-4: a 1-4 tree rooted at a vertex of degree 1. 

__, binary: a root vertex and at most two principal subtrees that are themselves 
binary trees. Each principal subtree must be specified as either the left subtree 
or the right subtree. 

___, homeomorphically reduced: a tree with no vertices of degree 2. 

__, labeled: a tree in which labels, typically v1, v2,...,Un, have been assigned to the 
vertices. Two labeled trees with the same set of labels are considered the same 
only if there is an isomorphism from one to the other that preserves the labels. 

__, left-right: a binary tree in which each vertex has either 0 or 2 children. 

__, ordered: a root vertex and a sequence ¢t1,to,..., tm of m > 0 principal subtrees 
that are themselves ordered trees. The root vertex of an ordered tree is joined 
by an edge to the root of each principal subtree. 

___, reduced: short for tree, homeomorphically reduced. 
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___, rooted: a tree in which one vertex, the root, is distinguished. Two rooted trees 
are considered the same only if there is an isomorphism from one to the other 
that maps the root of the first to the root of the second. 

trivial graph: a graph with a single vertex and no edge. 

unimodular matrix: a matrix of real numbers, the determinant of every square sub- 
matrix of which is equal to 1, —1, or 0. 

union — of matroids M,, M2,...,M, on a common set E: the matroid on E whose 
independent sets consist of all sets of the form J; U Jz U...U I, where J; is an 
independent set of M; for all 7. 

unit — of a ring: an element with a multiplicative inverse. 

vector matroid of a matrix: the matroid on the set of column labels of the matrix 
whose circuits are the minimal linearly dependent multisets of columns. 

vector space of a graph G: the set of all subsets of edges of G; also, the set of all 
vectors representing the subsets of edges of G; more commonly called the edge space 
of G. 

vertex-edge incidence matrix — of a graph: see incidence matriz. 

vertex-transitive graph: a graph whose automorphism group acts transitively on its 
vertex set. 

wheel, W,,: for n > 2, the graph with n+ 1 vertices that is obtained by joining each 
vertex of an n-cycle, called the “rim”, to one newly added vertex called the “hub” by 
an edge, called a “spoke”. 

wheel, W,,: a graph with n vertices, of which n — 1 form a cycle (the rim), with the 
remaining vertex (the hub) adjacent to all the rim vertices. 

whirl, W”: for r > 2, the matroid on the set of edges of W,. whose circuits are all the 
cycles of W, except the rim along with all sets consisting of the rim plus a single 
spoke. 

wreath product — of permutation groups G and H acting on sets S and T,, resp.: a per- 
mutation group on S x T of which each element 7 satisfies m(s,t) = (a(s), Gas) (t)), 
where a € G and (; € H for each t € T. 
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INTRODUCTION 


The need to imbed (draw) finite graphs on surfaces arises in various aspects of 
mathematics and science. Often the simplest surface in which such a graph can be 
imbedded is sought. Some generalizations of surfaces are briefly considered. 


7.1.1 Surfaces 


2-Manifolds and 2-Pseudomanifolds 


DEFINITIONS 


D1: The open unit disk, the closed unit disk, and the unit half-disk are the 
respective subsets 


{(z,y) |e’? +y? <1}, {@y) |e? +y? <1}, and {(z,y)|2 20,07 +y? <1} 
of the Euclidean plane, together with the inherited Euclidean topology. 
D2: An open disk, a closed disk, and a half-disk are any topological spaces homeo- 


morphic, respectively, to the open unit disk, the closed unit disk, or to the unit half-disk. 
A disk usually means a closed disk. 
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D3: A pinched open disk is a topological space obtained from k copies of open disks 
by identifying their respective centers to a single vertex, as shown in Figure 7.1.1. 


Figure 7.1.1: Three disks pinched together. 


D4: A 2-manifold is a topological space in which each point has a neighborhood that 
is homeomorphic either to an open disk or to a half-disk. 


D5: The boundary of a 2-manifold M is the subspace of those points in M that do 
not have neighborhoods homeomorphic to open disks. 


D6: A surface is a 2-manifold, often taken in context to be connected. 
D7: A closed surface is a compact surface without boundary. 


D8: If we relax the definition of a 2-manifold to allow the neighborhoods to be home- 
omorphic not only to open disks or half-disks but also to pinched open disks, then the 
resulting topological space is called a 2-pseudomanifold. 


D9: A pseudosurface is a 2-pseudomanifold (usually taken to be connected). It may 
be obtained from a 2-manifold by successively identifying finitely many pairs of vertices. 
FACTS 


F1: The boundary components of a compact surface are closed curves. That is, each 
boundary component is homeomorphic to the unit circle. 


F2: Every pseudosurface can be obtained from some 2-manifold by iteratively identi- 
fying finitely many pairs of points. 

EXAMPLES 

E1: The Euclidean plane is a non-compact surface. 

E2: The closed disk is a compact surface with a non-empty boundary. 

E3: The half-disk is a non-compact surface with a non-empty boundary. 


E4: The pinched torus is a pseudosurface obtained from a sphere by identifying two 
of its points, as at the left of Figure 7.1.2. 
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E5: A jellyfish pseudosurface (also called the spindle pseudosurface) is obtained 
from two spheres by pairwise identifying some number n of points on one sphere with 
n points on the other, as shown at the right of Figure 7.1.2. 


Figure 7.1.2: The pinched torus and a jellyfish pseudosurface. 


Some Standard Surfaces 


DEFINITIONS 


D10: A sphere (usually denoted by So) is any surface homeomorphic to the unit 
sphere {(x,y,z) |v? +y? +27 = 1}. See Figure 7.1.3 (left). 


D1i1: A cylinder (Or us) is any surface which is homeomorphic to the unit 
cylinder {(x,y,z) | x? +y? =1,-1< 2 <1}. See Figure 7.1.3 (right). 


ol 


Figure 7.1.3: The sphere So and the cylinder. 


D12: A projective plane (usually denoted by N;) is a closed surface homeomorphic 
to the surface obtained from the closed unit disk by identifying pairs of boundary points 
that are diametrically opposite relative to the center of the disk. 


D13: A Mobius band (or Mébius strip) is any surface that is homeomorphic to the 
surface obtained from a unit square {(z,y)| 1 <a«<1,-1< y < 1} by pasting the 
vertical sides together with the matching (—1, y) > (1,—y). See Figure 7.1.4. 


Figure 7.1.4: A Mobius band is a non-orientable surface with boundary. 
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D14: A torus (usually denoted by $1) is a closed surface homeomorphic to the 
subset of the Euclidean three-dimensional space obtained by rotating a circle {(z, y, z) | 
(2 — 2)? + y? = 1, z =0} around the y-axis. See Figure 7.1.5 (left). 


Figure 7.1.5: The torus S; and the Klein bottle No. 


D15: The Klein bottle (usually denoted by N2) is a closed surface homeomorphic 
to the surface obtained from the unit cylinder {(z,y,z) | 2? + y? = 1,-1 < z < 1} by 
identifying the pairs of points {(a, y, —1), (x, —y, 1)} on the two boundary components. 
See Figure 7.1.5 (right). 

FACTS 


F3: An annulus can be obtained by excising the interior of a disk from a sphere. 


F4: A Mobius band can be obtained by excising the interior of a disk from a projective 
plane. 


F5: The torus can be obtained by identifying the pairs of points {(z,y,—1), (a, y, 1)} 
on the two boundary components of the unit cylinder 


{(x,y,z) | 2? +y? =1,-1<2z< 1} 


EXAMPLES 
E6: The sphere, the torus, and the Klein bottle are closed surfaces. 


E7: A closed disk is a compact surface with one boundary component, while an annulus 
is a compact surface with two boundary components. 


E8: A Mobius strip is a compact surface with one boundary component. 


Surface Operations and Classification 


DEFINITIONS 


D16: The connected sum S#S' of two surfaces S and S’ is obtained by excising the 
interior of a closed disk in each surface and then gluing the corresponding boundary 
curves. 


734 Chapter 7. Topological Graph Theory 


D17: Adding an orientable handle to a surface S means forming the connected 
sum S#$}, as in Figure 7.1.6. 


Figure 7.1.6: (left) Handle; (right) adding a handle to S, to obtain S3. 


D18: The orientable surface with g handles or the g-torus is the connected sum 
of g copies of a torus. It is denoted by Sy. 


D19: Adding a crosscap to a surface S means forming the connected sum S#Nj,. 


D20: The non-orientable surface with k crosscaps, denoted by Nx, is the con- 
nected sum of & copies of the projective plane Nj. 


D21: A 2-manifold is non-orientable if it contains a subspace that is homeomorphic 
to the Mobius band. Otherwise it is orientable. 


D22: The genus 7(5S) of a closed orientable surface homeomorphic to S, is the 
number g of handles. 


D23: The crosscap number or non-orientable genus 4(S) of a closed non- 
orientable surface homeomorphic to Ny, is k. 


D24: The Euler characteristic y(S) of a closed surface is defined by these formulas: 
x(S,): = 2-29 and x(N;,) =2—k 


D25: A surface with k holes is obtained by removing the interiors of k disjoint disks 
from a closed surface. 


D26: A simple closed curve on a surface is a separating curve if its excision splits 
the surface into two components. Otherwise it is a non-separating curve. 


D27: A separating curve C' on a surface S is a contractible curve if the closure of 
one of the components of S — C is a disk. 


D28: A curve C on a surface S is an essential curve if it is not contractible. 


D29: A simple closed curve on a surface is orientation reversing if its regular 
neighborhood is a Mobius band. 


Section 7.1. Graphs on Surfaces 735 


FACTS 


F6: An equivalent way to add a handle to an orientable surface is to remove the 
interiors of two disjoint disks and then to match the boundary components of a cylinder 
to the resulting boundary components, so as to preserve the orientation. 


F7: The connected sum is well defined (up to homeomorphism of topological spaces) 
and is commutative and associative. 


F8: A 2-manifold is orientable if and only if it is homeomorphic to a two-sided subspace 
of Euclidean 3-space. 


F9: A closed 2-manifold is orientable if and only if it is homeomorphic to a surface in 
Euclidean 3-space. 


F10: Classification of Closed Surfaces: Each closed surface is homeomorphic to 
one and only one of the following surfaces: S,,g > 0, or Nx, k > 1. 


F11: Classification of Compact Surfaces: Each compact surface with non-empty 
boundary components is isomorphic to a closed surface with holes. Each compact 
surface is completely specified by its orientability, an integer giving the genus or crosscap 
number, and the number b of holes. 


F12: SoH#Sq! = So+9'5 NF Nx > Ne+k’s and Nr#S: = Nk+29 


F13: There are four mutually exclusive types of closed curves on surfaces: 
(a) separating and contractible 
(b) separating and non-contractible 
(c) non-separating and orientation preserving 


(d) non-separating and orientation reversing 
EXAMPLES 


E9: The sphere and the torus are orientable surfaces. Both are realizable in 3-space. 


E10: Since the Klein bottle and the projective plane are non-orientable closed surfaces, 
it follows that they cannot be realized in 3-space. 


E11: The Mobius strip is a non-orientable surface with boundary, and it can be realized 
in 3-space. 


E12: Whereas the Jordan curve theorem asserts that every closed curve on the sphere 
separates the sphere, the Schonfliess theorem asserts the stronger result that every closed 
curve on a sphere bounds a disk. 


E13: The pinched torus can be obtained by contracting (“pinching”) a non-separating 
closed curve on a torus to a point. 
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7.1.2 Polygonal Complexes 


DEFINITIONS 


D30: A polygon is oriented if one of the two possible directions of traversal (i.e., 
clockwise or counterclockwise) of its boundary has been designated as preferred. 


D31: Two topological spaces X and Y can be pasted together along homeomorphic 
subspaces by identifying the points of those subspaces under a homeomorphism. 


D32: A polygonal comple is a structure obtained from a set of oriented polygons 
by pasting some of these polygons to each other and to themselves along their sides 
(which also results in the identification of corners). Within a polygonal complex, 


e each polygon is called a face or a 2-cell; 


e the image of arbitrarily many polygon sides that have been pasted together is 
called an edge or a 1-cell;, 


e the image of arbitrarily many polygon corners that have been pasted together is 
called a vertex or a O-cell. 


D33: The 1-skeleton of a polygonal complex is the graph that is formed by its vertices 
and edges. 


D34: Each edge e of a polygonal complex is given a preferred direction of traversal, 
and a traversal of that edge in the reverse direction within a walk in the 1-skeleton 
is denoted et. 


D35: A polygonal complex is consistently oriented at edge e if within the union 
of the oriented boundary walks, it is not traversed twice in the same direction. Thus, a 
complex is consistently oriented at edge e if that edge results from a polygon side that 
was not pasted to another side, or if that edge results from pasting two sides together 
so that the traversal directions are opposite. 


D36: A polygonal complex is oriented if it is consistently oriented at every edge. 


D37: The underlying topological space of a polygonal complex is the quotient space 
for the union of all the polygons after all the identifications. 


D38: A polygonal complex is said to realize any topological space that is homeo- 
morphic to its underlying space. 


D39: Occurrences of an edge e or its inverse within a walk are called stgned edges. 


D40: The oriented boundary walk of a face of a polygonal complex is the closed 
walk in the 1-skeleton that results from traversing the face boundary in the direction of 
orientation. (This walk is unique up to the choice of a starting/stopping vertex.) 


D41: The signed boundary walk of a face of a polygonal complex is the list of the 
signed edges that occur on an oriented boundary walk of that face. 


D42: The boundary-walk specification of a polygonal complex is a list of the signed 
boundary walks of the faces. 
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D43: The vertex variant of the boundary specification of a polygonal complex whose 
1-skeleton is a simple graph gives the boundary walks as cyclic lists of vertices. 


D44: A fundamental polygon for a closed surface is a polygon whose edges are 
pairwise identified and pasted so that the resulting polygonal complex has only one face 
and so that it realizes that surface. 


D45: A specification of a fundamental polygon with 2n sides is its signed bound- 
ary walk. 


D46: The standard fundamental polygon for the orientable surface Sj, is spec- 
ified as aybya, by ‘agbeaz 'by' aks Agbga,"b,*. 


D47: The standard fundamental polygon for the non-orientable surface N;, is 
specified as A101,a2A2...AKAK- 


FACTS 


F14: A polygonal complex can be described combinatorially as the set of its signed 
boundary walks. 


F15: A polygonal complex can realize any compact surface or pseudosurface. 


F16: A polygonal complex realizes a pseudosurface or 2-manifold if and only if each 
side of each polygon is glued to exactly one other side; it realizes a 2-manifold if, in 
addition, every vertex has a topological neighborhood that is homeomorphic to a disk 
(this additional restriction serves to eliminate pinched disks). 


EXAMPLES 


E14: A book with n leaves, n > 3 (or an n-book) is a polygonal complex obtained 
by choosing a side in each of n polygons, often squares, and pasting all the chosen sides, 
as illustrated in Figure 7.1.7. The edge corresponding to the common side is called the 
spine of the book. 


Figure 7.1.7: The 3-book is a polygonal complex that is not a surface. 


E15: The 3-book with spine a can be specified as {ab1c1d1, abecod2, ab3c3d3}. 


E16: The polygonal complex {abc,aeh~'g~', bfi-te—!, cgj-f—', hij} is orientable 
but is not oriented. Reversing the orientation of the first polygon to c-'b~ta~+ would 
make the complex oriented. 
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E17: The Mobius band can be specified as {abed,efgb~', fid~th}, in which case the 
1-skeleton is K3,3. 


E18: If we add a hexagon aeh~'c~'g~1i to the Mébius band specification in Example 
E17, the resulting polygonal complex realizes a projective plane. 


E19: The polygonal complex {abb~'a~!bb~'} realizes the pinched torus with b as its 
pinch point. 


E20: The standard fundamental polygon for the sphere So has the form {a a; ‘}. 


7.1.3. Imbeddings 


DEFINITIONS 


D48: A topological realization of a graph G is obtained by first assigning to each 
of its edges a closed interval and then identifying endpoints of intervals according to the 
coincidences of the corresponding endpoints of edges of the graph. 


D49: An immersion of a topological space is a continuous mapping that is locally 
one-to-one; that is, each point of the domain has a neighborhood that is mapped home- 
omorphically into the codomain. 


D50: An imbedding is an immersion that is globally one-to-one. 


D51: An imbedding of a graph G means an imbedding of a topological realization 
of G. 


D52: A face of the imbedding is a connected component of the complement of the 
image. 


D53: A cellular imbedding or 2-cell imbedding of a graph into a surface is an 
imbedding such that the interior of each face is an open disk; thus, the complement of 
the image of the imbedding is a union of open disks. 


D54: A strongly cellular imbedding is an imbedding such that the closure of each 
face is a closed disk; that is, no two points on the boundary of any face are identified. 


D55: The minimum genus of a graph G (or sometimes, simply genus) is the 
minimum of the set of integers g such that G is imbeddable in the orientable surface 
Sg. It is denoted by Ymin(G) or by 7(G). 


D56: A minimum genus imbedding of a graph G (or sometimes, simply genus 
imbedding) is an imbedding of G into a closed surface of minimum genus. 


D57: The maximum genus of a graph G is the maximum of the set of integers g such 
that G has a cellular imbedding in the orientable surface S,. It is denoted by ymax(@). 


D58: A maximum genus imbedding of a graph is an imbedding into a closed surface 
of maximum genus. 
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D59: The minimum crosscap number of a graph G (or sometimes, simply crosscap 
number) is the minimum of the set of integers k such that G is imbeddable in the non- 
orientable surface N;,. It is denoted by Fmin(G) or by 4(G). 


D60: A minimum crosscap imbedding is an imbedding into a closed non-orientable 
surface of minimum crosscap number. 


D61: The maximum crosscap number of a graph G is the maximum of the set of 
integers k such that G has a cellular imbedding in the non-orientable surface Nz. It is 
denoted by Fmax(G). 


D62: A maximum crosscap imbedding of a graph is 2-cell imbedding into a closed 
non-orientable surface of maximum crosscap number. 


FACTS 


F17: Every finite graph has a topological realization that can be imbedded in Euclidean 
3-space. 


F18: A disconnected graph has no cellular imbedding. 
F19: [Yo63] Every connected graph has a minimum genus imbedding that is cellular. 


F20: [PPPV87] If a connected graph is not a tree, then it has a minimum crosscap 
imbedding that is cellular. 


F21: [Du66] Let 9’ < g < g”. Ifa graph admits a 2-cell imbedding in the surfaces S,) 
and Sy then it also admits a 2-cell imbedding in Sy. 


F22: Let k’ <k < k”. Ifa graph admits a 2-cell imbedding in the surfaces Ny and 
Nx then it also admits a 2-cell imbedding in S,. 


F23: [Wh33] Each planar 3-connected graph admits an essentially unique imbedding 
in the sphere. This is not generally true for imbeddings into other surfaces, not even 
for genus imbeddings. 


F24: [Th89] The problem of determining the minimum genus of a graph is NP-hard. 


F25: [Mo99] For a given graph and a fixed surface there exists a linear-time algorithm 
that either finds an imbedding of the graph in that surface or finds an obstruction for 
such an imbedding. The algorithm is not good for practical purposes since it subsumes 
the knowledge of all forbidden graphs for a given surface. The collection of such graphs 
may be quite large for a surface of moderate size genus. 


F26: Euler polyhedral equation: Each cellular imbedding of a graph with v vertices, 
e edges, and f faces into a surface S satisfies the relation 


v—e+f=x(S) 


F27: [AuBrYo63] For any graph G, 7(G) < 27(G) +1; however, the gap may be 
arbitrarily large. 


F28: [At68] Any graph can be imbedded in a 3-book. 


F29: Every simple graph can be immersed in the plane by spacing the vertices evenly 
around the unit circle and joining adjacent vertices with line segments. 
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EXAMPLES 


E21: Figure 7.1.8 shows two imbeddings of the complete graph Ky, on the torus, one 
non-cellular and the other cellular. 


aoa od ——-> 
— mS 


Figure 7.1.8: Two toroidal imbeddings of K4. 


E22: The vertex-variant specification (1234) (5678) (1265) (2376) (3487)(4158) for the 
cube graph Q3 corresponds to the following imbedding: 


4 


Figure 7.1.9: A standard imbedding of the cube graph Q3 in the sphere. 


E23: Two non-equivalent imbeddings of the cube graph Q3 in the torus given by the 
following vertex-variant specifications 


(123765) (341587) (234876) (126584) (148762) (123785) (326584) (567341) 


are shown in Figure 7.1.10. 


Figure 7.1.10: Two hexagonal imbeddings of Qs in the torus. 
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7.1.4 Combinatorial Descriptions of Maps 
DEFINITIONS 


D63: A graph can be defined alternatively as a combinatorial structure (V, £) with 
ground set S, as follows: 


e The elements of the set S$ are called half-edges. 


e FE is a partition of S into cells of size two, such that each half-edge is paired 
with what amounts to the other half of the same edge. This partition is often 
represented as the set of orbits of an involution T. 


e V isa partition of the half-edges according to the vertex at which they are incident. 


D64: A rotation at a vertex is a cyclic permutation on the set of half-edges at that 
vertex. 


D65: The surface rotation at a vertex v of a graph imbedding is the cyclic ordering 
of the half-edges at v on the surface. If the surface is orientable, this ordering is taken 
to be consistent with the orientation. 


D66: A (global) rotation (also called a rotation system) on a graph is an assign- 
ment of a rotation at each vertex. This corresponds to a permutation p on the set of 
half-edges whose orbits are the rotations at the vertices. 


D67: The (global) surface rotation of an imbedded graph is the set of surface 
rotations at all the vertices. 


D68: The induced imbedding of a global rotation p on a graph is an imbedding of 
that graph whose global surface rotation is p. (The face tracing algorithm below serves 
as proof that such an imbedding exists. It is obviously unique.) 


D69: A face tracing for a global rotation on a graph is a list of the boundary walks 
of the faces of an induced imbedding. 


D70: The signature of a graph G = (V,E) is a subset A C Eg, whose edges 
are called switches. They represent the edges whose traversal switches the sense of 
orientation in an imbedding. 


D71: A generalized rotation is a pair (p, A) composed of a global rotation and a 
signature. 


ALGORITHM 

We suppose that a global rotation p and an involution 7 on the set of half-edges of 
a graph G are given as input. We want to do a face tracing. To make this easily 
understood, we use notation of the form e and e~! for two half-edges paired by the 
involution 7, i.e., for the two different ends of the same edge. To each cycle of the 
rotation p, we visualize a vertex at which the half-edges within that cycle are simulta- 
neously incident in the graph G. 
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Algorithm 7.1.1 Face-Tracing Algorithm 


Input: half-edge list E+, involution 7, rotation p 
Output: list of all face-boundaries of the induced imbedding 


{Initialize} Mark all half-edges unused 
While any unused half-edges remain 
Choose next (lex order) unused half-edge y from E* 
Start new cycle by writing left paren  “(” 
Li=y 
Repeat 
Write x next in current cycle 
x:=p(t(x)) (next half-edge) 
Until « = y 
Close current cycle by writing right paren “)” 
Continue with next iteration of while-loop 


The algorithm for a generalized rotation is slightly more complicated, since it involves 
reversal of cycles. See, for example, Chapter 4 of [GrTu87]. 


EXAMPLES 


E24: A convenient way to apply the Face-Tracing Algorithm uses a table that lists the 
half-edges incident at each vertex, in the cyclic order of the rotation there. For instance, 
this table presents an imbedding of the graph kK in the sphere So. 


- et dt PIG O ea yet, d le ao »€ )(b id at) 
U2. C™ i e~ T=(a tq, + cre t ete t 
ee a (ata (bb (ote ara-\(ere)(F*F-) 


The composition permutation pr has a disjoint cycle representation with four 3-cycles, 
which correspond to the boundary walks of the four triangular faces. Using notation 
that clearly associates corresponding half-edges avoids the need to write the involution. 


E25: The following table presents an imbedding of Ky in the torus $j. 


vy. at bt et 
v2. @ da 
vg. CC f— e 
vg. O ft de 


The imbedding has one 4-sided face and one 8-sided face. 


E26: At the left of Figure 7.1.11 is an imbedding of the dipole Ds on the sphere. At 
the right are shown the three polygons of that imbedding, prior to pasting. Since D3 is 
not a simple graph, the specification of that imbedding as a set of boundary walks 


f=(c"), g=(ca"), h=(a,b-*) 
uses edges, not vertices, as does the specification by global rotation 


p=u:(abe) v:(a-*c7*b-+) 
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a 
- C 
E> Sa 
a b 
Figure 7.1.11: A spherical imbedding of the dipole Ds. 


E27: At the left of Figure 7.1.12 is an imbedding of the dipole Ds on the torus. At 
the right is shown the one polygon of that imbedding, prior to pasting its sides. 


Figure 7.1.12: Toroidal imbedding of the dipole D3. 


As a set of boundary walks, it has the specification 
fH a@e 0" .60") 
and by global rotation the specification 
p=u:(abe) v:(a-'b-'c1) 


E28: At the left of Figure 7.1.13 is an imbedding of the dipole Ds on the Klein bottle. 
At the right is shown the one polygon of that imbedding, prior to pasting its sides. 


Figure 7.1.13: Klein bottle imbedding of the dipole D3. 


As a set of boundary walks, it has the specification 
f= (Ge be a 07") 
and by global generalized rotation the specification 


p=u:(abe) v:(a'b-'e") A= {b} 
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FACTS 


F30: The set of global rotations on a graph is in bijective correspondence with the set 
of oriented, cellular imbeddings of that graph. 


F31: Generalized rotations correspond to cellular imbeddings into arbitrary closed 
surfaces. 


F32: The imbedding of a graph specified by a generalized rotation is non-orientable if 
and only if there is a cycle in G containing an odd number of switches. 
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INTRODUCTION 


The study of graph minimum genus and maximum genus has been among the most 
important and interesting topics in the research in topological graph theory. Besides its 
theoretical importance, the study has found many impressive applications in such areas 
as VLSI design, computer algorithms and complexity, and computer graphics. 


7.2.1 Definitions and Basic Facts 


The graphs in our discussion may have multiple adjacencies or self-adjacencies. A 
graph with no multiple adjacencies and self-adjacencies is called a stmple graph. Unless 
stated explicitly otherwise, any graph in our discussion is assumed to be connected. 
An edge e in a graph is a cut-edge if removing e disconnects the graph. A graph is 
vertez-k-connected (resp. edge-k-connected) if it remains connected after removing 
any k —1 vertices (resp. any k — 1 edges). We sometimes shorten “vertex-k-connected” 
to k-connected. Let C be the set of all cut-edges of a graph G. Each connected 
component of G — C is called an edge-2-connected component of G. Clearly, each 
edge-2-connected component of G' is either an edge-2-connected graph or a single vertex. 

A theorem of Brahana [Brah32] asserts that any orientable surface is homeomorphic 
to the sphere with g handles, where g is called the genus of the surface. An embedding 
p(G) of a graph G in an orientable surface S is a continuous one-to-one function p from a 
topological representation of the graph G into the surface S. Each connected component 
of S — p(G) is called a face of the embedding p(G). The genus of the embedding p(G) 
is defined to be the genus of the surface S. An embedding is cellular if the interior 
of each face of the embedding is homeomorphic to a 2-dimensional open disk. Our 
discussion will be restricted to cellular graph embeddings. 
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DEFINITIONS 


D1: The minimum genus ymin(G) (or simply the genus y(G)) of a graph G is the 
minimum integer g such that there exists an embedding of G into the orientable surface 
Sq of genus g. 


D2: The maximum genus Jmax(G) of a graph G is the maximum integer g such that 
there exists an embedding of G into the orientable surface of genus g. 


D3: The number |E| —|V|+ 1 is called the cycle rank (or the Betti number) of 
the graph G, denoted 3(G). Intuitively, this is the number of edges remaining after the 
edges of a spanning tree are removed. 


FACTS 


F1: The genus of any embedding of a graph G is an integer between 0 and |6(G)/2], 
where {(G) is the cycle rank of the graph G. 


F2: [BHKY62, NSW71] Let {B,, Bo,--- ,B,} be the collection of edge-2-connected 
components of a graph G. Then 


k k 
min (G) — S- “min (B;) and max (G) om Ss max (B;) 
t=1 1=1 


These results are commonly called additive properties. 


F3: [Euler Polyhedral Equation] (for a proof, see [GrTu87]) An embedding of a 
graph G with vertex set V, edge set FE, face set F', and genus g satisfies the relation: 


|V| — |E| + |F] = 2-29 


REMARK 


R1: According to Fact F2, in most cases we need to concentrate only on the minimum 
genus and maximum genus of edge-2-connected graphs. 


EXAMPLE 


E1: Both the minimum genus and the maximum genus of a tree are equal to 0; the 
minimum genus of the complete graph K,4 of four vertices is 0 while the maximum genus 
of Ky is equal to 1. 


Ear Decomposition 


DEFINITION 


D4: An ear decomposition D = [P,, P2,--- ,P,| of a graph G is a partition of the 
edge set of G into an ordered collection of edge-disjoint simple paths P,, P,, ---, P, 
such that P; is a simple cycle and P;, i > 2, is a path with only its endpoints in common 
with P, +---+ P;_,. Each path P,; is called an ear. 


FACT 
The class of edge-2-connected graphs has the following nice characterization. 


F4: [Whit32] A graph G has an ear decomposition if and only if G is edge-2-connected. 
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Edge Insertion and Deletion 


The operations of edge insertion and edge deletion have turned out to be important and 
useful in the study of graph embeddings. 


DEFINITIONS 


D5: Let p(G) be an embedding of a graph G. We say a new edge e is inserted into 
p(G) if the two ends of e are inserted into face corners in p(G) to make an embedding 
for the graph G+ e. The operation is called edge insertion. 


D6: The edge deletion operation acts inversely to edge insertion: let p(G’) be an 
embedding of the graph G’ and let e be an edge in G’ that is not a cut-edge. If the two 
sides of e belong to two different faces of p(G’), then deleting e from p(G’) “merges” 
the two faces without changing the embedding genus; if the two sides of e belong to 
the same face in p(G’), then deleting e from p(G’) “splits” the face into two faces and 
decreases the embedding genus by 1. 


FACTS 


F5: If the two ends of edge e are inserted into the corners of the same face f in 
p(G), then the edge e “splits” the face f into two faces and leaves the embedding genus 
unchanged. In this case, the two sides of the new edge e belong to two different faces 
in the resulting embedding for G +e. See Figure 7.2.1(a) for illustration. 


ED).CA CB. 


Figure 7.2.1: Inserting a new edge into an embedding. 


F6: On the other hand, if the two ends of e are inserted into the corners of two different 
faces f,; and fz in p(G), then the edge e “merges” the faces f; and f2 into a single larger 
face and increases the embedding genus by 1. In this case, the two sides of the new 
edge e belong to the same face (i.e., the new larger face) in the resulting embedding for 
G+e. Topologically, this operation can be implemented as follows: first we cut along 
the boundaries of the two faces f; and fz and leave two holes on the surface. Then we 
add a handle to the surface by pasting the two ends of a cylinder with two open ends to 
the boundaries of the two holes on the surface, respectively. The new edge e now runs 
along the new handle. See Figure 7.2.1(b) for illustration. 


F7: Inserting an edge to a graph embedding can never decrease the embedding genus, 
and deleting an edge that is not a cut-edge from a graph embedding can never increase 
the embedding genus. 
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7.2.2 Kuratowski-Type Theorems 


Any characterization of graph embeddability in terms of a finite set of forbidden 
subgraphs has been called a “Kuratowski-type” result, in celebration of Fact F8. 


DEFINITIONS 


D7: Let e = [u,v] be an edge in a graph G. Denote by G/e the graph obtained from 
G by removing the edge e then identifying the two ends u and v. We call this operation 
an edge contraction. 


Ds: A graph A is a minor of another graph G if H can be obtained from a subgraph 
of G by contracting edges. 


D9: A class F of graphs is minor closed if for each graph G in F, all minors of G 
are also in F. 


D10: Let F¥ be a minor closed graph class. A graph G isa minimal forbidden minor 
for F if G is not in F but every proper minor of G is in F. The set of all minimal 
forbidden minors for the graph class F is called the set of minimal forbidden minors 
for F. 


FACTS 


F8: [Kura30] |[Kuratowski’s Theorem] A graph G is planar (i.e., the minimum genus 
of G is 0) if and only if G contains no subgraphs homeomorphic to either Ks or K3,3. 


F9: [Wagn37] A graph is planar if and only if it has neither Ks nor K3,3 as its minor. 


F10: Every minor of a graph G can be obtained from G by a sequence of operations 
of edge contractions and edge deletions (we assume here that isolated vertices are auto- 
matically removed whenever they are created in the process). This follows immediately 
from Definition D8. 


F11: The Kuratowski theorem is equivalent to the statement that a graph G is planar 
if and only if G has neither Ks nor K33 as its minor. Therefore, Ks and K3 3 are 
essentially the only two forbidden minors for the class of planar graphs. 


Minimum Genus 


Erdés and Kénig [Koni36] raised the question whether there is a Kuratowski-type the- 
orem for the class of graphs that are embeddable (not necessarily 2-cellular embed- 
dable) in a fixed surface S. Some special cases were vastly generalized by the pow- 
erful results of Robertson and Seymour in their study of graph minor theory. The 
study of graph minor theory has induced significant progress in the research of graph 
theory. In particular, an impressive series of Robertson and Seymour’s work (see 
[RoSe85, RoSe88, RoSe90a, RoSe90b, RoSe95]) has led to the confirmation in Fact 
F14 of a well-known conjecture by Wagner [Wagn37]. 
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FACTS 


F12: [GlHu78] There is a finite set F, of graphs such that a graph G is embeddable 
in the projective plane Nj, (i.e., the non-orientable surface of crosscap number 1) if 
and only if G has no minor in F;. A complete list of these graphs can be found in 
[Arch81, GHW79]. 


F13: [ArHu89] For any non-orientable surface N, there is a finite set Fy of graphs 
such that a graph G is embeddable in the non-orientable surface N, if and only if G has 
no minor in Fy. 


F14: [RoSe88](Formerly known as Wagner’s Conjecture) Any minor-closed class of 
graphs has a finite set of minimal forbidden minors. 


F15: For every integer g > 0, the class of graphs of minimum genus at most g is minor 
closed. 


F16: [RoSe90b] For every integer g > 0, the set of minimal forbidden minors for the 
class of graphs of minimum genus at most g is finite. 


REMARKS 


R2: A constructive proof for Fact F16 was developed by Mohar [Moha99]. There has 
been further effort to simplify the proof [Thom97b]. On the other hand, it has remained 
as a challenge, even for every small g such as g = 1, to give a good estimation on the 
number of graphs or the size of the graphs in the set of minimal forbidden minors in 
Fact F16. Seymour [Seym93] has shown that the size of the set of minimal forbidden 
minors for graphs of minimum genus bounded by g is bounded by 


(69+9)° 
92 


R3: Proof that the class of planar graphs is minor closed is not difficult. Let G be a 
planar graph and let p(G) be a planar embedding of G. Contracting an edge e of G on 
the planar embedding p(G) can be accomplished by continuously “shrinking” the edge 
e on the plane until the two ends of e are identified. This gives a planar embedding 
of the contracted graph G/e. Moreover, by Fact F7, edge deletion does not increase 
embedding genus. We conclude that every minor of a planar graph is also planar, i.e., 
the class of planar graphs is minor closed. 


Maximum Genus 


We point out that a class of graphs defined in terms of maximum genus is in general not 
minor closed. For example, the bouquet By of two self-loops (i.e., the graph with a single 
vertex and two self-loops) is a minor of the “dumbbell” D (i.e., the graph consisting of 
an edge [u,v] plus two self-loops on u and v, respectively). However, it is easy to verify 
that Ymax(Bo) = 1 while ymax(D) = 0 (see Facts F18 and F21 below). 


DEFINITIONS 


D11: Let G be a graph and let v be a degree-2 vertex with two neighbors u and w 
in G (u and w could be the same vertex). We say that a graph G’ is obtained from G 
by smoothing the vertex v if G’ is constructed from G by removing the vertex v then 
adding a new edge connecting the vertices u and w. 
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D12: Two graphs G; and Gp are homeomorphic if they become isomorphic after 
smoothing all degree-2 vertices. It is easy to see that two homeomorphic graphs have 
the same minimum genus and the same maximum genus. 


D13: A graph is a cactus if it can be constructed from a tree T and a subset S' of 
vertices in T’, by replacing each vertex in S by a cycle. 


D14: A necklace of type (r,s) is obtained from a cycle C2,4, of 2r +s vertices by 
doubling r non-adjacent edges in C 2,1, (or, equivalently, by adding an extra multiple 


edge to each of these adjacencies), and adding a self-loop at each of the other s vertices. 
Figure 7.2.2 gives a type (4,0) necklace and a type (1,3) necklace. 


Figure 7.2.2: Left: a type (4,0) necklace; right: a type (1,3) necklace. 


FACTS 
F17: An edge-2-connected graph G has maximum genus 0 if and only if G is a cycle. 


F18: A graph G has maximum genus 0 if and only if no vertex is contained in two 
different cycles in G. 


F19: [NSW71] A graph G has maximum genus 0 if and only if G is a cactus. 
F20: The maximum genus of any necklace is 1. 


F21: [ChGr93] An edge-2-connected graph G has maximum genus 1 if and only if G 
is homeomorphic to either a necklace or one of the graphs in Figure 7.2.3. 


F22: More generally, a graph G has maximum genus | if and only if all except one of its 
edge-2-connected components are either a cycle or a single vertex, and the exceptional 


edge-2-connected component of G is homeomorphic to either a necklace or one of the 
graphs in Figure 7.2.3. 


SBSerg kA 


Figure 7.2.3: Graphs of maximum genus 1 that are not necklaces. 
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7.2.3. Planarity and Upper-Embeddability 


There has been extensive research in the study on graphs of minimum genus 0 and 
on graphs G of maximum genus |6(G)/2]. 


DEFINITIONS 


D15: A graph G whose maximum genus is | 3(G)/2] is called an upper-embeddable 
graph. 


D16: An ear decomposition D = [P;, P2,--- , P,| of a graph G is 3-connected if r > 3, 
P,+P2+ P3 is homeomorphic to the complete graph Ky, and every subgraph P; +--- P; 
of G, 3 <i< vr, is homeomorphic to a 3-connected graph. 


D17: The edge complement G— T' of a spanning tree of a graph G is called a co-tree. 
Note that the number of edges in a co-tree is exactly equal to the cycle rank 6(G) of 
the graph G. 


D18: A component H of the co-tree G — T is called an even component (resp. odd 
component) if the number of edges in H is even (resp. odd). 


D19: For a given spanning tree T of the graph G, the deficiency €(G,T) of T is 
defined to be the number of odd components of the co-tree G — T. 


D20: The deficiency £(G) of the graph G is defined to be the minimum of £(G, T) 
over all spanning trees T. 


D21: A spanning tree T of G is called a Xuong tree if the deficiency €(G,T) of T 
is equal to the deficiency €(G) of the graph G. 


D22: We say that two edges are adjacent if they share a common endpoint. 


D23: Let A be a subset of edges of a graph G, and let G— A be the graph obtained 
from G by deleting the edges in A. (The graph G — A need not be connected.) Define 
C.(G — A) and C.(G — A) to be the numbers of connected components in G — A with 
odd cycle rank and with even cycle rank, respectively, and let v(G, A) = C.(G— A) + 
2C,(G—A)—|A|—1. The Nebesky nu-invariant v(G) is defined to be the maximum 
of v(G, A) over all edge subsets A of the graph G. 


FACTS ABOUT PLANARITY 


The study of 3-connected planar graphs has played an important role in the study of 
planar graphs. A number of important properties of planar embeddings of 3-connected 
planar graphs can be derived based on a special ear-decomposition of 3-connected 
graphs. 


F23: [Chen90] Every 3-connected graph has a 3-connected ear decomposition. 


Based on Fact F23 and using induction, we can derive the following two well-known 
results on 3-connected planar graphs. 


F24: [Whit33] There is essentially only one way to embed a 3-connected planar graph 
in the plane. 
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F25: [Tutt60] Every 3-connected planar graph has a planar embedding in the geometric 
plane in which every face, except the outer face, is a convex polygon. 


F26: [Fary48] Every planar graph has a planar embedding in the geometric plane in 
which every edge is a straight line segment. 


FACTS ABOUT UPPER-EMBEDDABILITY 


Early study on graph upper-embeddability was focused on derivation of upper- 
embeddability of special graph classes. It became clearer later that most of these results 
could be obtained from effective characterizations of graph maximum genus. There have 
been a number of successful characterizations of graph maximum genus, which are now 
described. 


F27: [Xuon79a] The maximum genus 7max(G) of a graph G is equal to (6(G)—&(G)) /2. 


F28: [ChKa99] A spanning tree T of G can be constructed so that the co-tree G — T 
contains at least ymax(G) pairs of adjacent edges. Therefore, ymax(G) is bounded by 
(8(G) — €(G))/2. This implies Fact F27. 


F29: [Nebe8la] v(G) = €(G). In consequence, the maximum genus ymax(G) of the 
graph G is equal to (6(G) — v(G))/2. 


F30: [NSW71, Jung78, Xuon79b, Nebe8la] A graph G is upper-embeddable if and 
only if €(G) < 1, or equivalently, if and only if G has a spanning tree T such that the 
co-tree G—T has at most one odd component. (This result was obtained independently 
by a number of researchers.) 

F31: [Kund74] Every edge-4-connected graph contains two edge-disjoint spanning 
trees. 


F32: Every edge-4-connected graph is upper-embeddable. 


F33: A number of other upper-embeddable graph classes have been identified, including 
the following: 
[Nebe81b] locally connected graphs, 


PaXu79] cyclically edge-4-connected graphs, 
SkNe89] k-regular vertex-transitive graphs of girth g with k > 4 or g > 4, 
Skov91] loopless graphs of diameter 2, 


HuLi00a] (44 + 2)-regular graphs and (2k)-regular bipartite graphs. 


Readers are referred to the original papers for more detailed definitions and results. 


EXAMPLES 


E2: The graphs of maximum genus 1 in Figure 7.2.3 all have cycle rank 3. In conse- 
quence, all these graphs are upper-embeddable. 


E3: The complete graphs K,, are upper-embeddable for all n > 1. 
E4: The complete bipartite graphs K,,,m are upper-embeddable for all n,m > 1. 


REMARK 


R4: The maximum genus of a graph can also be characterized in terms of ear decom- 
positions of the graph. We refer interested readers to [ChKa99] for more details. 


Section 7.2. Minimum Genus and Maximum Genus 753 


7.2.4 Lower Bounds 


The study of lower bounds for minimum genus and maximum genus has been an 
interesting topic in topological graph theory. The research has led to better understand- 
ing of these important graph invariants. Moreover, the study has played an important 
role in the study of some well-known graph problems, such as the famous Heawood 
problem. 


DEFINITIONS 


D24: A c-coloring of a graph G is a function from the vertices of G to a set of ¢ 
“colors” such that no two adjacent vertices are assigned the same color. 


D25: The chromatic number of a graph G is the minimum integer c such that 
the graph G has a c-coloring. The chromatic number of a surface S is the maximum 
integer h such that there is a simple graph G of chromatic number h and G' is embeddable 
in the surface S. 


Lower Bounds on Minimum Genus 


FACTS 


F34: Ifa graph G with n vertices and m edges contains no cycle of size less than d, 


then os 
2 Oe gi 


min G = 
min (G) (aa) 5 
F35: If G is a simple graph with n vertices and m edges, then 
m n 
min G oe ae 1 
Ymin(G) 2 E— at 


F36: Ifa simple graph G of n vertices and m edges has a triangulated embedding, 
then the embedding is a minimum genus embedding and 


m nr 
min(G ae ee. ee, 1 
Ymin(G) = at 


F37: [RiYo68] For the complete graph K,, of n vertices, with n > 3, 


asi [ee 


Proving this was most of the proof of the well-known Heawood problem, conjectured 
by Heawood in 1890 [Heaw90], and confirmed by Ringel and Youngs [RiYo68]. 


F38: [RiYo68] Ringel-Youngs Theorem (formerly the Heawood Conjecture) The 
chromatic number of the surface S, of genus g, for g > 0, is equal to 


pa 


This assertion for g = 0 is the Four-Color theorem [ApHa76]. 


F39: If G is a graph of chromatic number c, then ymin(G) > (c? — 7¢ + 12)/12. 
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Lower Bounds on Maximum Genus 


The study of lower bounds on the maximum genus for graphs concentrates on graphs 
that are not upper-embeddable. In view of Fact F32, such graphs have edge-connectivity 
less than 4. There is no useful lower bound for maximum genus in terms of number of 
vertices or of cycle rank. In order to derive meaningful lower bounds for graph maximum 
genus, a critical structural property of necklaces must be excluded, since necklaces can 
have arbitrarily large cycle rank, and yet have maximum genus at most 1. 


FACTS 


F40: [ChGr95] A graph of any fixed maximum genus M greater than zero and arbi- 
trarily large cycle rank can be constructed from a graph G of maximum genus M — 1, 
by replacing an edge e of G with the result of subdividing a proper edge that uniquely 
joins its endpoints, and splitting the necklace open at the new vertex into a “chain”, 
so that the univalent vertices of the chain are attached where the endpoints of edge e 
were. Moreover, this is essentially the only way to construct graphs of very large cycle 
rank while keeping the maximum genus small. 


F41: [CKG96] Suppose G is a simple graph of minimum degree at least 3. Then 


vimst@) = [> 


This bound is tight, in the sense that there are infinitely many simple graphs G of 
minimum degree 3 whose maximum genus is arbitrarily close to 8(G)/4 [CKG96]. 


F42: Let G be an edge-2-connected simple graph of minimum degree at least 3. Then 


vmet@) = [> 


This was first proved by Kanchi and Chen, using Fact F27, and then by Archdeacon, 
Nedela, and Skoviera, using Fact F29. These results are reported in [Ar+02]. 


F43: [Ar+02] Let G be an edge-3-connected graph. Then ynax(G) > [6(G)/3]. 


F44: [CAG96] There exists an infinite class of edge-3-connected simple graphs G whose 
maximum genus is equal to [6(G)/3]. Thus, the bound in Fact F43 is tight. 


REMARKS 


R5: The facts for edge-k-connected graphs just above can be translated without much 
trouble to k-connected graphs, since a k-connected graph is edge-k-connected. 


R6: Lower bounds on graph maximum genus have been derived in terms of graph 
connectivity, independent number, girth, and chromatic number. See [HuLi00b, LiLi00]. 


7.2.5 Algorithmic Issues 


Algorithms and complexity on planar graph problems have been extensively studied. 
In particular, there is a linear time algorithm that either constructs a planar embedding 
for a given graph or reports that the graph is not planar [HoTa74]. Based on graph pla- 
nar embeddings, many difficult graph problems become relatively easier. For example, 
there is a linear time algorithm that tests the isomorphism of planar graphs [HoWo74]. 
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Minimum Genus Algorithms 


The complexity of constructing minimum genus embeddings for graphs of bounded min- 
imum genus has been studied in the past 30 years. The first polynomial-time algorithm 
for the problem for graphs of minimum genus bounded by a constant g was proposed in 
1979, with time complexity O(n°\) [FMR79]. See [MyKol1] for recent updates and 
comments on this line of research. 

Fellows and Langston [FeLa88] indicated that the graph minimum genus problem of 
graphs of bounded minimum genus can be solved in polynomial time based on Robertson 
and Seymour’s results in graph minor theory. In fact, they showed a much stronger result 
that for any minor closed graph class C, there is a polynomial time algorithm that tests 
the membership for the class C. We describe their results here. 


DEFINITION 


D26: A graph G is an apex graph if it contains a vertex v such that G — v is planar. 


FACTS 


F45: [RoSe95] Let H be a fixed graph. There is a polynomial-time algorithm that 
decides for any given graph G whether H is a minor of G. 


F46: For any minor-closed graph class C, there is a polynomial-time algorithm that 
tests the membership for the class C. 


F47: For any fixed integer g, there is a polynomial-time algorithm that tests whether 
a given graph has minimum genus at most g. The set of minimal forbidden minors 
for the class of graphs of minimum genus at most g is unknown (even though by Fact 
F16, we know such a finite set exists). An algorithm for constructing a minimum-genus 
embedding of graphs of bounded minimum genus was developed by Mohar. 


F48: [Moha99] For each fixed integer g, there is a linear-time algorithm that, for a 
given graph G, either constructs an embedding of genus bounded by g for G or reports 
that no such embedding exists. 


F49: [Thom89] The problem of deciding for a graph G and integer k, decide whether 
Ymin(G) < k is NP-complete. 


F50: [Thom97a] The problem of deciding for an integer k and a graph G of maximum 
degree 3 has minimum genus at most k is NP-complete. 


F51: It is easy to test in polynomial time whether a given graph is an apex graph. 


F52: [Moha98] The problem of deciding whether an apex graph has minimum genus 
at most k is NP-complete. 


Maximum Genus Algorithms 


On the other hand, the construction of graph maximum genus embeddings turns out to 
be easier. 
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FACTS 


F53: [FGM88] There is a polynomial-time algorithm that constructs a maximum genus 
embedding for a given graph. Based on Xuong’s characterization of maximum genus 
(Fact F27), Furst, Gross, and McGeoch construct a maximum-genus embedding for a 
given graph by reducing the problem to the linear matroid parity problem, which was 
known to be solvable in polynomial time [GaSt85]. 


F54: [Chen94] For any fixed integer g, there is a linear-time algorithm to decide whether 
a given graph has maximum genus g; if so, the algorithm constructs a maximum-genus 
embedding for the graph. 


F55: [Chen94] For any fixed integer g, there is a linear-time isomorphism algorithm 
for graphs of maximum genus at most g. 


F56: [GrRi91] Starting from any embedding of a graph, there is a sequence of edge 
deletion-then-reinsertion operations that never decreases the embedding genus and even- 
tually leads to a maximum-genus embedding. Thus, there are no graph embeddings that 
are “strictly locally maximal” but not globally maximum with respect to embedding 
genus. On the other hand, there are graph embeddings that are not minimum genus 
embeddings but are “strictly locally minimal” that represent arbitrarily deep traps, 
in the sense that one must ascend arbitrarily higher in genus, before one reaches an 
embedding from which there is a monotonic descent to the global minimum genus. 
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INTRODUCTION 


This chapter explores the natural problem of constructing a surface-by-surface in- 
ventory of the imbeddings of a fixed graph, which was introduced by Gross and Furst 
[GrFu87]. The present scope includes several interesting extensions of that problem. 


7.3.1 Ranges and Distributions of Imbeddings 


An imbedding is taken to be cellular, unless it is clear from context that a non- 
cellular imbedding is under consideration. We regard two cellular imbeddings as “the 
same” if they have equivalent rotation systems. Moreover, a graph is taken to be 
connected unless the context implies otherwise. Minimum genus and maximum genus 
are presented in $7.2. We are concerned here with the entire genus range. 


DEFINITIONS 


D1: The minimum genus of a graph G, denoted Ymin(G), is the smallest positive 
integer g such that the graph G has an imbedding in the orientable surface Sy. 


D2: The maximum genus of a graph G, denoted ymax(G), is the largest integer g 
such that the graph G has a cellular imbedding in the orientable surface S,. 


D3: The genus range of a graph G is the integer interval [Ymin(G), Ymax(G)]. 
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D4: The j*” orientable imbedding number of a graph G, denoted gj(G), is 
the number of equivalence classes of orientable imbeddings of G into the orientable 
surface S; or equivalently (see §7.1), the number of rotation systems for graph G' that 
induce an imbedding in Sj. 


D5: The genus distribution sequence of a graph G is the sequence whose j*” entry 
is gj(G). 


D6: The genus distribution polynomial is 
Ia(x) = > 9j(G)2! 
j=0 


D7: The minimum crosscap number of a graph G, also known as the mini- 
mum non-orientable genus, is the smallest integer k such that the graph G has an 
imbedding in the non-orientable surface Nz. It is denoted ¥,,;,,(G). 


D8: The maximum crosscap number of a graph G, also known as the maximum 
non-orientable genus, is the largest integer k such that the graph G has a cellular 
imbedding in the no-norientable surface Nz. It is denoted F,44(G). 


D9: The crosscap range of a graph G is the integer interval [F,,:n(G), Ymax(@)]- 
D10: The j*” crosscap imbedding number of a graph G, denoted @;(G), is the 
number of equivalence classes of non-orientable imbeddings of G into the non-orientable 


surface N;. 


D11: The crosscap distribution sequence of a graph G is the sequence whose j*” 
entry is %;(G). 


D12: The crosscap distribution polynomial is 


Ia(y) = ye ,(G)y! 


D13: The boundary walk of a face of an imbedding is called an fb-walk. 


FACTS 


F1: Let G be a graph. Then the total number of equivalence classes of orientable 
imbeddings equals 


So 9i(G) = J] [deg(v) - 1)! 
j=0 veEV(G) 


since the sum on the left and the product on the right both count every imbedding of 
G exactly once. Moreover, the polynomial evaluation I[g(1) gives this same number. 


F2: [Du66] [Interpolation Theorem] For every integer j within the genus range of 
a graph G, ie., whenever Ymin(G) < 7 < Ymax(G), the number g;(G) of orientable 
imbeddings of G' is positive. 
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F3: [St78] For every integer j within the crosscap range of a graph G, i.e., whenever 
Vmin(G) < 3 < Vmax(G), the number %;(G) of non-orientable imbeddings of G is 
positive. 


F4: Let G be a graph. Then the total number of equivalence classes of imbeddings 
(orientable and non-orientable) equals 


Co 


So 9(G) + S53(G) = 2° TT [deg(v) — 1}! 
j=0 


j=l veEV(G) 


since the sum on the left and the product on the right both count every imbedding 
of G exactly once. The factor of 2° on the right accounts for the possible choices of 
orientation on every edge not in a designated spanning tree for G. 


EXAMPLE 


E1: All of the examples of genus distributions in Table 7.1 can be calculated by 
considering the corresponding rotation systems (see §7.1). Consideration of symmetries 
expedites the calculations. 


Table 7.1: Genus distributions of some familiar graphs. 


graph G | g9o(G) m(G) g92(G) 93(G)__ga(G) 
Ka 2 14 0 0 0 
bouquet By 4 2 0 0 0 
dipole: Ds 2 2 0 0 0 
Kos 0 40 24 0 0 
Kg x C3 2 38 24 0 0 


REMARK 


R1: Complementary to the graph-theoretic problems concerned with counting the 
imbeddings of a given graph over a range of surfaces are the map-theoretic problems of 
counting the maps on a given surface, taken over all possible imbedded graphs, or over 
all graphs with some prespecified property. (See §7.6). 


7.3.2 Counting Noncellular Imbeddings 


This section describes how the problem of calculating distributions of noncellular 
imbeddings reduces to counting cellular imbeddings. Explicit discussion of methods for 
achieving such a reduction are scarce in the literature. 


DEFINITIONS 


D114: A semicellular graph imbedding is an imbedding G — S whose regions are 
planar, but which may have more than one boundary component. 


D15: A graph imbedding G > S is strongly noncellular if any of its regions is 
nonplanar. 
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D16: A closed curve that separates a region of a noncellular graph imbedding G > S$ 
is boundary-separating if there is at least one boundary component of the region on 
each side of the separation. 


D17: A closed curve in a region of a noncellular graph imbedding G — S is strongly 
noncontractible if cutting it open and capping off the holes with disks reduces the 
genus of the region. 


D18: Given a semicellular graph imbedding G > S, the underlying cellular imbed- 
ding is obtained by cutting each non-cell region open along a maximal family of 
boundary-separating closed curves and capping the holes with disks. 


D19: Given a noncellular graph imbedding G > S, a planarizing curve for a 
nonplanar region is a separating closed curve such that all of the boundary components 
lie to one side of the separation and all of the handles lie to the other. The concept is 
that the resulting component with all the handles contains no part of the graph and is 
discarded. 


D20: Given a strongly noncellular graph imbedding G — S, the underlying semi- 
cellular imbedding is obtained by cutting each non-cell region open along a maximal 
family of boundary-separating closed curves and capping the holes with disks. 


FACTS 


F5: Every semicellular graph imbedding has an underlying cellular imbedding that is 
unique up to homeomorphism. 


F6: The semicellular orientable imbeddings of a graph are in bijective correspondence 
with partitions of the regions of the underlying cellular imbedding. 


F7: Every nonplanar region of a noncellular graph imbedding has a planarizing curve. 


F8: Every strongly noncellular imbedding has an underlying semicellular imbedding 
that is unique up to homeomorphism. 


F9: The strongly noncontractible imbeddings of a graph are in bijective correspondence 
with the set of functions from the regions of the underlying semicellular imbedding to 
the nonnegative integers. 


F10: A strongly noncellular imbedding G > Sp4, of a graph in a surface can be 
obtained from a semicellular imbedding into G > S,, by partitioning the number k, and 
next selecting one face of the imbedding into S,, for each of the parts of the partition, 
and then increasing the genus of each selected face by the value of the associated part 
of the partition. 


EXAMPLES 
E2: The graph Ke x C3 has six vertices, each of degree 3. Thus, by Fact F1, the total 


number of orientable cellular imbeddings is 64 = 2°. The cellular genus distribution 
sequence is 2,38, 24,0,0,..., as given in Table 7.1. 
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E3: Each of the two imbeddings of K2 x C3 in So has five faces, as shown in Figure 
7.3.1. Five faces can be partitioned into four nonempty parts in five ways. (In general, 
one can use Stirling subset numbers for these partition-number calculations.) Thus, 
each imbedding in Sp yields five different possible semicellular imbeddings in S;. Thus, 
there are ten semicellular noncellular imbeddings of K2 x C3 in $1, plus 38 cellular 
imbeddings, as mentioned in Example E2, for a total of 48 semicellular imbeddings 
in S}. 


Figure 7.3.1: The graph Kk x C3. 


E4: Each semicellular imbedding of K2 x C3 in the surface Sz corresponds to a partition 
into three parts of the five faces of a cellular imbedding in Sp or to a partition into two 
parts of the three faces of a cellular imbedding into S;. Using the Stirling numbers 
{>} = 25 and ih = 3, and using the cellular genus distribution sequence from Example 
E2, we calculate that the number of semicellular (but noncellular) imbeddings into S) 
equals 2-25 + 38-3 = 164. Adding in the 24 cellular imbeddings in S2, we obtain a 
total of 188 semicellular imbeddings in 5S. 


7.3.3. Partitioned Genus Distributions 


Often a graph G is obtained by “pasting” two smaller graphs (called amalgamands) 
together, which means matching some subgraph in one of them to an isomorphic sub- 
graph in the other. To calculate the genus distribution of G from those of the two 
smaller graphs, it is necessary to partition the genus distributions of the smaller graphs 
according to the incidence of fb-walks on the respective subgraphs that are matched 
together. In this subsection, we describe the partitioning for pasting on a single vertex 
or on a single edge. 


DEFINITIONS 


D21: A rooted graph is a tuple (G, 21, %2,...,£n) in which G is a graph and 21,...,%n, 
are either vertices or edges that have been designated as roots. 


D22: The most frequently encountered graph amalgamations are of four types. 
Types (i) and (ii) are called vertex amalgamations. Types (iii) and (iv) are called 
edge amalgamations. 


(i) Let (G,u) and (G’,u’) be singly vertex-rooted graphs. One kind of amalgamated 
graph is formed from the disjoint union GU G’ by merging the vertices u and wu’. 
It is usually construed to have no root. 

(ii) Let (G, u,v) and (G’,s,t) be doubly vertex-rooted graphs. Another kind of amal- 


gamated graph is formed from the disjoint union GU G" by merging the vertices 
v and s. It is construed to have roots u and t. 
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(iii) Let (G,d) and (G’,d’) be singly edge-rooted graphs. A third kind of amalgamated 
graph is formed from the disjoint union GUG’ by either way of merging the edges 
dand d’. 

(iv) Let (G,d,e) and (G’,x,y) be doubly edge-rooted graphs. A fourth kind of amal- 
gamated graph is formed from the disjoint union GU G’ by either way of merging 
the edges e and x. It is construed to have roots d and y. 

The concept of amalgamation can be generalized to allow the roots to be arbitrary 
subgraphs, in which case one also specifies an isomorphism from a root of the first 
amalgamand to a root of the second amalgamand. 


D23: Each type of root structure is associated with its own kind of partitioned genus 
distribution. 


(i) Let (G,u) be a vertex-rooted graph such that root u is 2-valent. Then for each 
J = Ymin(G),.--,Ymax(G), the g;(G) imbeddings G > 5; are partitioned into the 
d;(G, u) imbeddings such that two different fb-walks are incident on root-vertex u 
and the s;(G,u) imbeddings such that the same fb-walk is twice incident on uw. 


— 
_ 
me 

wa 


Let (G,b) be an edge-rooted graph such that both endpoints of root-edge e are 
2-valent. Then for each j = Ymin(G),.-.,%maz(G), the g;(G) imbeddings G — S; 
are partitioned into the d,;(G,b) imbeddings such that two different fb-walks are 
incident on root-edge b and the s;(G, b) imbeddings such that the same fb-walk is 
twice incident on b. 


wa 


Let (G,u,v) be a doubly vertex-rooted graph such that both roots are 2-valent. 
Then for each 7 = Ymin(G),.-.,%maz(G), the g;(G) imbeddings G > 5S; are 
partitioned into four subtypes: 


(iii 


e the dd;(G,u) imbeddings such that two different fb-walks are incident on 
root-vertex u and two different fb-walks are incident on root-vertex v; 

e the ds;(G,u) imbeddings such that two different fb-walks are incident on 
root-vertex u and the same fb-walk is twice incident on v; 

e the sd;(G,u) imbeddings such that the same fb-walk is twice incident on 
root-vertex u and two different fb-walks are incident on root vertex v; 

e the ss;(G,u) imbeddings such that the same fb-walk is twice incident on 
root-vertex u and the same fb-walk is twice incident on v. 


(iv) Let (G, b,c) be a doubly edge-rooted graph such that both endpoints of both root- 
edges are 2-valent. Then for each 7 = Ymin(G),.--,Y%maz(G), the g;(G) imbeddings 
G — S; are partitioned into four subtypes: 


e the dd;(G, b,c) imbeddings such that two different fb-walks are incident on 
root-edge b and two different fb-walks are incident on c; 

e the ds;(G,u) imbeddings such that two different fb-walks are incident on 
root-edge b and the same fb-walk is twice incident on c; 

e the sd;(G,u) imbeddings such that the same fb-walk is twice incident on 
root-edge b and two different fb-walks are incident on root c; 

e the ss;(G,u) imbeddings such that the same fb-walk is twice incident on 
root-edge u and the same fb-walk is twice incident on c. 


Each of the names dd;, ds;, sd;, and ss; associated with the subtypes of imbeddings is 
called a partial, and the number of imbeddings of a subtype is called the value of the 
corresponding partial. 
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REMARK 


R2: In practice, it is usually necessary to further partition double-root partials into 
subpartials that indicate how many fb-walks incident on one root are also incident on 
the other root. 


EXAMPLE 


E5: We consider the complete graph K4 with two vertices u and v as roots inserted 
at the midpoints of two non-adjacent edges, as in Figure 7.3.2. Then we partition the 


Figure 7.3.2: The doubly vertex-rooted graph (K4, u,v). 


genus distribution go(K4) = 2, g1(K4) = 14 as follows: 


do(Kau)=2 so(Kau)=0 
ddo (Ka, u, v) =2 dso(K4,u,v) = 0 sdo(K4, u, v) =0 s89(Ky4, u,v) =0 
d;(K4, u) =8 81(K4, u) =6 


dd,(K4,u,v) =4 ds,(K4,u,v) =4 sdo(Ka,u,v) =4 88o(Ka,u,v) =2 


These numbers can be confirmed by using the face-tracing algorithm of the 16 imbed- 
dings. Using symmetries reduces the required effort. 


7.3.4 Graph Amalgamations 


Bar Amalgamations 


DEFINITIONS 


D24: A bar-amalgamation of two disjoint vertex-rooted graphs (G,u) and (H,v) 
is obtained by running a new edge e between the roots u and v, as illustrated in Fig- 
ure 7.3.3. Notation: (G,u) *bar (H,v). The isomorphism type of a bar-amalgamation 
depends on the choice of root-vertices in the two graphs. 


Figure 7.3.3: A bar-amalgamation of Ky and Ks; —e. 


D25: The convolution of the sequences (a;) and (b;) is the sequence whose k*® term 


is 
k 
s aides 
i=0 
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FACTS 


F11: [GrFu87] The genus distribution of a bar-amalgamation of the graphs (G,u) 
and (H,v) is representable as the convolution of their respective genus distributions, 
multiplied by a scalar equal to the product deg(u)deg(v). 


F12: For any two vertex-rooted graphs (G,u) and (H,v), with ve: u—> v, 
Ymin ((G, u) *. (H,v)) = Ymin((G, ¥) *bar (H, v)) 


F13: For any two vertex-rooted graphs (G,u) and (H,v), with ve: uv, 


Ymaz((G,U) *bar (H,v)) < Ymae((G,u) *. (A,v)) < Ymae((G, U) *bar (A, v)) +1 


F14: The genus distribution sequence of a vertex-amalgamation (G, uw) *, (H,v) domi- 
nates the genus distribution sequence of the bar-amalgamation (G, u) *bar (H,v). That 
is, every term of the former is at least as large as the corresponding term of the latter. 


EXAMPLES 

E6: The bar-amalgamation [4 *par 43,3 has the following genus distribution sequence: 
9 (0, 80, 608, 336, 0, 0,...) 

E7: The bar-amalgamation K3 *pa, K3 has the genus distribution sequence 


4,0,0,0,... 


Vertex Amalgamations 


Quite commonly, an interesting sequence of graphs can be specified recursively, so that 
after starting off with a base graph (X,s,t), subsequent graphs are obtained by it- 
eratively amalgamating copies of some other graph (Y,u,v), as we describe in this 
subsection. 


DEFINITIONS 


D26: A production for an amalgamation of two doubly vertex-rooted graphs is a rule 
of the form 


pi(G, U, v) * pi (G", bs v’) ars S- Oe Vit jtke (G * G', U, v') 
L 


where p; and Di; are types of subscripted partials and where each of the summands on 
the right is a subscripted partial q;+;+%, preceded by the number ay of imbeddings 
corresponding to that subscripted partial. The genus of the surface of every imbedding 
must be at least 7+ 7, the sum of the genera of the two imbeddings on the left of 
the production. The added term ky reflects the fact that the genus of some resultant 
imbeddings may be larger than that sum. 


D27: The doubled path DP,, is obtained from an n-edge path by doubling every 
edge. 
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FACT 


F15: The following two productions are sufficient to calculate the genus distribution 
of the doubled path DP,, recursively. To simplify the calculation, we have allowed the 
letter x to stand for either d or s. 


zd;(G, u,v) *dd;(G’,w’,v') —> 4adj.;(G* G’,u,v’) + 205:4541(G * G, u,v’) 
r8;(G,u,v) *dd;(G’,u',v') —> 6rdi,;(G* G',u,v’) 


EXAMPLE 


E8: We use Fact F15 to calculate the partitioned genus distributions and genus distri- 
butions of the doubled paths DP2, DP3, and DP,, starting from dd(DP,) = 1. 


ae 4xrdg(DP2) + 2x81(DP2) 


xrdp = 64, xd, = 96, rs, = 32, rsq = 24 
Jo = 64, n= 128, 92 = 24 


(DP) 
pgd(DP») rdp = 4, rs, = 2 
gd( DP») go =4, 91 =2 
xtdo(DP2) *ddo(DP,) —> 4-4-1ado(DP3)+2-4-1xs,(DP3) 
28,(DP») *ddo(DP,) —> 6-2-1rd,(DP3) 
pgd(DP3)  : ado = 16, xd; = 12, vs, = 8 
gUDP3)  : go = 16, 91 = 20 
ady(DP3) *ddg(DP,) —> 4+16-1xdo(DP,) +2-16-1xs,(DP,) 
ad,(DP3) *ddg(DP,) —> 4+12-1xd\(DP,) +2-12-1xs9(DP,) 
2s,(DP3) *ddo(DP}) —> 6-+8-1rd\(DP;) 
(DP) 
(DP) 


7.3.5 Genus Distribution Formulas for Special Classes 


Even at the outset of the program to provide explicit calculations of imbedding 
distributions, it was clear that a variety of techniques would be needed. Different 
topological and combinatorial methods seem to be needed for every class of graphs. 


DEFINITIONS 


D28: The n-rung closed-end ladder L,, is the graph obtained from the cartesian 
product P, x K2 by doubling the edges v1 x Ke and vp x Ke at both ends of the path, 


as illustrated in Figure 7.3.4. 


Figure 7.3.4: The 3-rung closed-end ladder Ls. 
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D29: The cobblestone path J, is the graph obtained by doubling every edge of the 
n-vertex path P,, as illustrated in Figure 7.3.5. 


CSSe 


Figure 7.3.5: The cobblestone path Js. 


D30: The n-bouquet B,, is the graph with one vertex and n self-loops, as illustrated 


in Figure 7.3.6. 
"O "CO A 


Figure 7.3.6: Some bouquets. 


D31: The dipole D,, is the graph with two vertices, n edges, and no self-loops. 


D32: An outerplanar imbedding of a graph G is an imbedding such that there is a 
face whose boundary walk contains every vertex of G. 


D33: An outerplanar graph is a graph that has an outerplanar imbedding. 
D34: <A Halin graph is the graph that results from an ordered plane tree when a 


cycle is drawn in the plane through all the leaf vertices, as shown in Figure 7.3.7 in the 
order that they occur on a preorder traversal. 


Figure 7.3.7: A Halin graph. 


FACTS 


F16: [FuGrSt89] The closed-end ladders have the following formula for their genus 
distributions: 


=144 41-1) 2n+2—3i ; a 
G (En) = DT) cee. TOE | 
> : otherwise 


The following table shows the genus distributions for some of the smaller ladders. 
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go 91 g2 g3 ga total 


Ly 2 2 0 0 0 4 
Iz 4 12 0 0 oO 16 
[3 8 40 16 0 O- 64 
Ly 16 112 128 O O- 256 
L5 32 288 576 128 O 1024 


F17: [FuGrSt89] The cobblestone paths have the following formula for their genus 
distributions: 


GAR SH aes ee aga s) fori >Oandn>1 
i i 
The following table shows the genus distribution for some of the smaller cobblestone 
paths. 


go 91 g2 total 
J, 4 2 0 6 
Jz 16 20 0 36 
Jz 64 128 24 216 
J4 256 704 336 1296 


F18: [GrRoTu89] The bouquets have the following formula for their orientable imbed- 
ding numbers: 


93 (Bn) = (m— 1). 2°» en—aj41 (m2) 


where the numbers 


ex (n) = 


\" Ee Non 


where po is an arbitrary fixed cycle of length 2n 


permutation 7 has & cycles, and 
5 full involution 8) [7 = po o 8] 


— 


are given by the formula of Jackson [Ja87]. The closed formula above for g; (B,,) leads 
to the following recursion. 


Initial conditions: 


g;(Bn) = 0 for 7 <0 or n<0 
g;(Bo) = 9;(Bi1) =0 for 7 >0 


4 forj7=0 
g;(B2) = 2 for 7 = 1 
0 for j>2 


Recursion for n > 2: 


(n + 1)9;(Bn) = 4(2n — 1)(2n — 3)(n — 1)? (n — 2)9;(Bn—2) 
+ 4(2n — 1)(n — 1)g;(Bn-1) 
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This recursion enables us to calculate numerical values. 


9g; (Bn) 7 =0 1 2 Total 
n=0 1 1 
1 1 Lyd 
2 4 2 3!=6 
3 40 80 5! = 120 
4 672 3360 1008 7! = 5040 
5 16128 161280 185472 9! = 362880 


F19: Rieper [Ri90] elaborated upon the use of group characters in his analysis of 
the genus distribution of dipoles, which are graphs with two vertices and no self-loops. 
Andrews, Jackson, and Visentin [AnJaVi94] took a map-theoretic approach to dipole 
imbeddings. 


F20: Stahl [St9la] calculated genus distributions for small-diameter graphs. 


F21: Kwak, Kim, and Lee [KwKiLe96] took a distributional approach in studying a 
class of branched coverings of surfaces. 


F22: Using doubly vertex-rooted and doubly edge-rooted graphs, Gross, Khan, and 
Poshni (see [GrKhPo10], [Grlla], [KhPoGr10], and [PoKhGr10]) developed quadratic- 
time algorithms for calculating the genus distribution of chains, cycles, and twisted 
cycles of arbitrarily many copies of a given graph of known genus distribution. 


F23: [Gr11b] and [PoKhGr11] give quadratic-time algorithms for the genus distribution 
of any 3-regular and 4-regular outerplanar graphs, respectively. 


F24: Outerplanar graphs have treewidth 2 (see [Bo98}]). 


F25: [Grl2a] gives a quadratic-time algorithm for the genus distribution of any 3- 
regular Halin graph. 


F26: Halin graphs have treewidth 3 (see [Bo98]). 


F27: [GrkKol2] gives a quadratic-time algorithm for any cubic series-parallel graph 
and, more generally, for any graph of treewidth 2 and maximum degree at most 3. 


F28: [Gr12b] gives a quadratic-time algorithm for any class of graphs of fixed treewidth 
and bounded degree. It is not a practical algorithm. 


REMARKS 


R3: Ladder-like graphs played a crucial role in the solution of the Heawood map- 
coloring problem. (See [Ri74].) McGeoch [McG87] calculated the genus distribution of 
circular ladders and of Mobius ladders. Tesar [Te00] calculated the genus distribution 
of Ringel ladders. 


R4: The computations of imbedding distributions of ladders and cobblestone paths 
were subsequently generalized by Stahl [St91a] to linear families. 


R5:  Riskin [Ri95] took a distributional approach in studying a class of polyhedral 
imbeddings. 
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R6: Stahl [St97] studied the zeroes of a class of genus polynomials. 


R7: Among the important properties of bouquets to topological graph theory is that 
every regular graph can be derived by assigning voltages (possibly permutation voltages) 
to a bouquet. (See [GrTu77] or [GrTu87].) 


7.3.6 Other Imbedding Distribution Calculations 


Including non-orientable surfaces in the inventory requires some additional theory, 
partly because the possible twisting of edges complicates the recurrences one might 
derive. Yet another enumerative aspect of graph imbeddings regards as equivalent any 
two imbeddings that “look alike” when vertex and edge labels are removed. 


DEFINITIONS 


D35: The total imbedding distribution of a graph G is the bivariate polynomial 


Ia(a,y) = Ia(a) + Taly) = 5 on(G)x* + $2 ae (@)y* 
k=0 k=1 


D36: Given a general rotation system p for a graph G and a spanning tree T’, the 
entries of the overlap matrix M,,r = [m,,;] are given for all pairs of edges e;,e; of 
the co-tree G — T by 


1 ift A J and pure (p) is nonplanar 
eos Tbe; +e; 
“J \1 ifi=j and edge i is twisted 
0 otherwise 


The notation pure(p)|r+e,+e, means the restriction of the underlying pure part of the 
rotation system p to the subgraph T' + e; + e;. 


D37: The imbeddings 1; :G—- S and tg: G > S are congruent if there exist a 


graph automorphism a: G + G and a surface homeomorphism h: S — S such that 
the diagram in Figure 7.3.8 is commutative. We write 11 & lo. 
————— > § 


‘| i 


G ——> $ 


by 


Figure 7.3.8: Commutativity condition for imbedding congruence ¢1 ~ ta. 
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FACTS 


F29: When non-orientable imbeddings of a graph G are also to be considered, the 
total number of imbeddings increases by a factor of 2°(@)—1, since each of the 6(G) 
edges in the complement of a spanning tree may be twisted or untwisted. 


ig(1,1) = 2° TT [deg(v) - 1)! 
vEV(G) 


F30: [Mo89] Let G be a graph, T a spanning tree of G, and p a general rotation system 
of G. Then 


ac ae 2g if S(p) = S, (induced surface orientable) 
® kif S(p) = N; (induced surface non-orientable) 


Thus, the genus of the surface induced by a rotation system can be calculated without 
doing face-tracing. 


F31: [ChGrRi94] Calculating the surface type by face-tracing requires O(n) time for 
a graph with n edges, and calculating the rank of the overlap matrix deteriorates to 
O(n?) time. However, regrouping the total set of imbeddings according to rank of the 
overlap matrix sometimes facilitates calculation of the total imbedding distribution. 


F32: [ChGrRi94] Closed-end ladders have the following total imbedding distribution 
polynomials. 


ts tate tip=ntl r gin Qintl 
I, (@,y) = 2” x grt II [round (=) + round ( 3 ) 7 


dig SO hai 
— Iz, (y*) + In, (2) 


F33: [ChGrRi94] Cobblestone paths have the following total imbedding distribution 
polynomials. 


‘ tate tip=nt1 r Qin Qintl 
Ij, (2, y) = 2” | ae gntr-lyntinr U [round (=) + round ( 3 ) ql 
ds str = 


— 13, (y*) + I, (@) 


REMARKS 


R8: [MuRiWh88] counted congruence classes of imbeddings of K,, into oriented sur- 
faces. The key to counting congruence classes was to convert the cycle index of Aut(G) 
acting on V@ into the cycle index for the induced action on the rotation systems. 


R9: [KwLe94] counted congruence classes of imbeddings into non-orientable surfaces. 
One of their underlying ideas is to regard an edge-twist as the voltage 1 (mod 2) and 
to construct the orientable double cover. Then the graph automorphisms act on the 
induced rotation systems. 


R10: [KwSh02] developed a formula for the total imbedding distributions of bouquets. 
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EXAMPLES 


E9: In the illustrative calculation of Figure 7.3.9, the spanning tree has edges 4, 5, 
and 6. Thus, the rows and columns correspond to co-tree edges 1, 2, and 3. Since the 
rank of the matrix is 3 and the imbedding is non-orientable, the imbedding surface must 
be N3 (by Fact F30). 


0 
4 0 
; 


o-oo 


{ 
0 
0 
2 overlap matrix 


Figure 7.3.9: Sample calculation of the overlap matrix. 


E10: In deriving Fact F32, [ChGrRi94] chose a tree T in the ladder graph with a path 
as a co-tree, as in Figure 7.3.10. This yielded a “tridiagonal” overlap matrix, which is 
a convenient property in rank calculations. 


xx00000 

2 3 4 5 6 xxx0000 
Oxxx000 

1 7 00xxx00 
000xxx0 

0000xxx 

00000x x 


Figure 7.3.10: Ladder Lg, spanning tree, and tridiagonal overlap matrix. 


E11: In deriving Fact F33, [ChGrRi94] chose a tree T in the cobblestone path again 
with a path as a co-tree, as in Figure 7.3.11. This again yielded a “tridiagonal” overlap 
matrix. 


Figure 7.3.11: Cobblestone path J; and spanning tree. 


E12: In regard to Remark R8, Figure 7.3.12 shows how the 16 different orientable 
imbeddings of the complete graph Ky are partitioned into congruence classes. 


2 like this in sphere 6 like this in torus 8 like this in torus 


Figure 7.3.12: Partitioning the 16 imbeddings of K4 into congruence classes. 
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7.3.7 The Unimodality Problem 


DEFINITIONS 


D38: A sequence {a,,} is unimodal if there exists at least one integer M such that 
AQm—1 <Q@m_ for all m< M and 


am > Am+1 forall m>M 
D39: A sequence {a} is strongly unimodal if its convolution with any unimodal 
sequence yields a unimodal sequence. 
FACTS 


F34: <A typical unimodal sequence first rises and then falls, as illustrated in Figure 
7.3.13. 


Figure 7.3.13: A unimodal sequence has no false maxima. 


F35: [KeGe71] An equivalent criterion for strong unimodality is that 
az, > @m+41Gm—1 for all m 


F36: The genus distributions of the closed-end ladders [FuGrSt89], cobblestone paths 
[FuGrSt89], and bouquets [GrRoTu89] are all strongly unimodal. 


F37: [GMTW13a] Arbitrarily long chains of copies of various small graphs, including 
the complete graph Ky, the 4-wheel W4, the Mobius ladder ML, and the circulant 
graph circ(7: 1,2), have strongly unimodal genus distributions. 


F38: [GMTW13b] Iterated claw graphs have real-rooted genus polynomials, which 
implies that these genus polynomials are strongly unimodal. 
REMARKS 


R11: We observe that an imbedding of the bouquet B,, has n+1 faces if in the sphere 
So, 2 —1 faces if in the torus $;, n — 3 faces if in the surface $2, and so on. Intuitively, 
this suggests that the genus distribution of the bouquet B, might resemble the sequence 


of Stirling cycle numbers 
2n 2n 2n 
nt+1}’? |n—-1]’ [n—-3)]? 


which is a strongly unimodal sequence. 


R12: [St91la] The resemblance to Stirling numbers holds also for various graphs of 
small diameter, including partial suspensions of trees and of cycles. 


R13: [St90] The genus distribution of the bouquet B, is asymptotically proportional 
to this sequence. The proof uses group character theory. 

RESEARCH PROBLEM 

RP1: Decide whether the genus distribution of every graph is strongly unimodal. 
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7.3.8 Average Genus 


DEFINITIONS 


D40: The average genus of a graph G, denoted yYavg(G), is the average value of the 
genus of the imbedding surface, taken over all orientable imbeddings. 


D41: The cycle rank of a connected graph G is the number |Eg| — |Ve| + 1; this is 
denoted 6(G) and conceptually best understood as the number of edges in the co-tree 
of a spanning tree for G. 


D42: [GrK1Ri93] A necklace of type (r,s) is obtained from a 2r + s-cycle by doubling 
r disjoint edges and then adding a self-loop at each of the s vertices that is not an 
endpoint of a doubled edge. 


D43: Let e be an edge of a graph. We say that we attach an open ear to the interior 
of edge e if we insert two new vertices u and v and then double the edge between them. 
The two new vertices are called the ends of that open ear. 


D44: We attach a closed ear to the interior of edge e if we insert one new vertex w 
in its interior and then attach a self-loop at w. The vertex w is called the end of that 
closed ear. 


D45: We say that r open ears and s closed ears are attached serially to the edge e 
if the ends of the ears are all distinct, and if no ear has an end between the two ends of 
an open ear. 


FACTS About Average Genus 


F39: [GrKIRi93] The average genus of a graph with nontrivial genus range can lie 
arbitrarily close to the maximum genus. 


F40: [Grk1Ri93] The average genus of a graph is at least as large as the average genus 
of any of its subgraphs. 


F41: [ChGrRi95] For any 3-regular graph G, 


1 
Yave (G) 2 3 Ymax (G) 


F42: [(ChGrRi95] For any 2-connected simple graph G other than a cycle, 


1 
rora(@) = =8(G) 


F43: [Ch94] Isomorphism testing of graphs of bounded average genus can be achieved 
in linear time. 


F44: [GrFu87] The average genus of the bar-amalgamation of two graphs G and H 
equals Yave(G) + Yave(H). 


F45: [ChGr92b] Let G be a 2-connected graph, and let G; be a graph obtained by 
serially attaching ears to an edge of G. Then 


Navel Gy) < Yave (G+) < Vail G) el 
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FACTS About Small Values of Average Genus 


F46: A graph has average genus 0 if and only if at most one cycle passes through any 
vertex. This follows from [NRSW72]. 


F47: The maximum genus of a necklace is 1. This follows from [Ku79]. 
F48: [GrKI1Ri93] The average genus of any necklace of type (r,s) is 
(2) (3) 
2 3 
F49: [GrK1Ri93] Each of the six smallest possible values of average genus is realizable 
by a necklace. Figure 7.3.14 indicates these values and shows a graph realizing each of 


them. 
i 1 5 2 19 3 
3 G3 2 +4 9 3 27 4 2 


Figure 7.3.14: Realizations of the six smallest positive values of average genus. 


F50: [ChGr93] Except for necklaces, there are exactly eight 2-connected graphs of 
average genus less than one. The bouquet Bs, the dipole D4, and the complete graph 


Ky, have average genus 
4 Se g 9 5 7 


a. 6" 
respectively. Figure 7.3.15 shows the other five such graphs and their average genus. 


«I ee Oe yaa ote 


Figure 7.3.15: Five sporadic 2-connected graphs with average genus less than one. 
F51: Facts F44 and F50 together yield a complete classification of all graphs of average 
genus less than one. 


F52: [ChGr93] There are exactly three 2-connected graphs with an average genus of 1. 


«<p UL 


Figure 7.3.16: The three 2-connected graphs with average genus equal to one. 
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FACTS about close values of average genus 


F53: [GrK1Ri93] Arbitrarily many mutually nonhomeomorphic 2-connected graphs 
can have the same average genus. 


F54: [ChGr92a] For each real number r, only finitely many 3-connected graphs have 
average genus less than r. 


F55: [ChGr92a] For each real number r, only finitely many 2-connected simple graphs 
have average genus less than r. 
FACTS about limit points of average genus 


F56: [GrK1Ri93] The number 1 is an upper limit point of the set of possible values of 
average genus. 


F57: [ChGr92a] The set of possible values of average genus for 3-connected graphs has 
no limit points. 


F58: [ChGr92a] The set of possible values of average genus for 2-connected simple 
graphs has no limit points. 


F59: [ChGr95] Lower limit points of average genus do not exist. 


REMARKS 


R14: Fact F48 provides a means for constructing upper limit points. In fact, all limit 
points arise from this construction. 


R15: Additional results on average genus are given by [MaSt96], [Sc99], [St95a], and 
[St95b]. 


7.3.9 Stratification of Imbeddings 


Superimposing an adjacency structure on the distribution of orientable imbeddings 
appears to offer some insight into the problem of deciding whether two given graphs are 
isomorphic. 


DEFINITIONS 


D46: Two orientable imbeddings 1; and v2 of the same graph G are V-adjacent if 
there is a vertex v of G such that moving a single edge-end at v is sufficient to transform 
a rotation system representing J; into a rotation system representing 2. 


D47: Two orientable graph imbeddings 1; and t2 of the same graph G are E-adjacent 
if there is an edge e of G such that moving both edge-ends of e can transform a rotation 
system representing 4; into a rotation system representing Le. 


D48: For any graph G, the stratified graph SG has as its vertices the orientable 
imbeddings of G. Its edges are the V-adjacencies and the E-adjacencies. 
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D49: The induced subgraph of SG on the set of imbeddings into the surface S; is 
called the j** stratum of SG and is denoted $jG. 


D50: A complete isomorphism invariant for a graph is a graph invariant that has 
a different value on each isomorphism type of graph. 


FACTS 
F60: The number of vertices of the j*" stratum SjG is g;(G). 


F61: [GrTu79] There may be false minima in the stratified graph, that is, local minima 
that are not global minima. 


F62: [GrRi91] The false minima may have arbitrarily great depth. 


F63: [GrRi91] No false maxima exist, so that it is possible to ascend from any imbed- 
ding to a maximum imbedding, even though strict ascent might not always be possible. 


F64: [GrTu95] For every vertex of the stratified graph SG, the induced subgraph on 
its set of neighbors in SG is a complete isomorphism invariant of the graph. 


REMARKS 


R16: This is consistent with [Th89], which proves that the minimum genus problem 
is NP-complete. 


R17: This is consistent with [FuGrMc88], which establishes a polynomial-time algo- 
rithm for maximum genus. 


R18: [GrTu95] also demonstrated how two graphs with similar genus distributions 
may have markedly different imbedding strata. These findings support the plausibility 
of a probabilistic approach to graph isomorphism testing, based on the sampling of 
higher-order imbedding distribution data. 
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INTRODUCTION 


In the voltage graph construction, a small graph with algebraic labels (called volt- 
ages) on its edges specifies a large graph with global symmetries. A Cayley graph for 
a group can be specified by assigning group elements to the self-loops of a one-vertex 
graph (a bouquet). In this sense, voltage graphs are a generalization of Cayley graphs. 


7.4.1 Regular Voltage Graphs 


The usual purpose of a voltage graph is to specify an undirected graph. Accordingly, 
even though the voltage graph construction formally employs directions on the edges as 
a formal convenience, the terminology adopted concentrates on the undirected object. 


DEFINITIONS 


The regular voltage graph construction now described was introduced in [Gr74]. The 
definition of a Cayley graph here is as in §7.5. See §6.2 for an algebraic perspective. 


D1: Let G = (V,£) bea digraph and 6 a group. A regular voltage assignment for 
G in the group B is a function a: E > B that labels each edge e with a value a(e). 


e The pair (G,a: E — B) is called a regular voltage graph. 
e Graph G is called the base graph and group B is called the voltage group. 
e The label a(e) is called the voltage on edge e. 
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D2: The covering digraph G® (formerly, derived digraph) associated with a given 
regular voltage graph (G = (V, E),a: E — B) is defined as follows: 
e V(G°) = V° =V x B, the cartesian product. 
° E(G*) = E°=ExB. 
e If the edge e is directed from vertex u to vertex v in G, then the edge e, = (e, b) 
in G® is from the vertex up = (u,b) to the vertex Upa(e) = (v, ba(e)). 


NOTATION: Vertices and edges of the derived graph are usually specified in the subscript 
notation, rather than in cartesian product notation. The only standard exception to 
this convention is to avoid double subscripting. 


TERMINOLOGY NOTE: The digraph G® is usually called, simply, the covering graph 
(formerly, derived graph). Moreover, its underlying (undirected) graph is also denoted 
G* and is also called the covering graph. Such shared terminology avoids excessively 
formalistic prose. In context, no ambiguity results. 


D3: The Cayley graph C(A, X) for a group A with generating set X has the elements 
of A as vertices and has edges directed from a to ax for every a € A and x € X. We 
will assume that vertices are labeled by elements of A and that edges are labeled by 
elements of X. Although an involution x (i.e., an element of order 2 in the group A) 
gives rise to a directed edge from a to ax and also one from az to a, for all a, sometimes 
we will choose to identify these pair of edges to a single undirected edge labeled x. In 
86.2, such a pair is always represented by a single edge. 


EXAMPLES 


E1: Figure 7.4.1 shows how a Cayley graph for the cyclic group Zs is specified by 
assigning the elements 1 mod 5 and 2 mod 5 to the two self-loops. 


Figure 7.4.1: A voltage assignment in Zs for the Cayley graph Ks. 


E2: Figure 7.4.2(i) is a regular voltage graph (G,a: E — Zs), and Figure 7.4.2(ii) is 
the corresponding covering graph. 


Each a-edge of the covering graph G® joins two u-vertices in G%, because edge a of 
the base graph is a self-loop at vertex u. Since the voltage on edge a is 1 mod 3, each 
subscript increments by 1 in a traversal of an a-edge from tail to head. Since edge b of 
the base graph G goes from vertex u to vertex uv, each of the b-edges in the covering 
graph G® crosses from a u-vertex to a u-vertex. The subscripts on tail and on head are 
equal on all the b-edges, because edge b carries voltage 0. 
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voltages in Z, 


Figure 7.4.2: A regular voltage assignment and the covering graph. 


TERMINOLOGY 


Involutions in the voltage group. Assigning an involution x as the voltage to a 
self-loop e at a vertex v in the base graph causes the e-edges in the covering digraph to 
be paired. That is, the directed edge e, from vertex vp to vertex vp, is paired with the 
directed edge ep+44 from vy4_ to Vp. 


TERMINOLOGY NOTE: The term covering graph also refers to the undirected graph 
that is obtained by identifying these pairs of directed edges to a single edge, as one may 
do with (topological) Cayley graphs. 


REMARK 


R1: The earliest application of voltage graphs was to construct imbeddings of large 
graphs on surfaces, often in connection with minimum genus or symmetric maps. 


Fibers 


DEFINITIONS 


D4: Let G° be the covering graph for a regular voltage graph (G = (V, FE),a: E > B). 
e The vertex subset {v} x B = {uy : b € B} is called the (vertex) fiber over v. 


e Similarly, the edge subset {e} x B = {e, : b € B} is called the (edge) fiber over e. 


D5: Let (G = (V,F),a: E > B) bea regular voltage graph. The graph mapping from 
the covering graph G® to the voltage graph G given by the vertex function and edge 
function 

Urru eprre 


respectively, is called the natural projection. (Thus, the natural projection is given 
by “erasure of subscripts”.) 


EXAMPLE 


E2, continued: In Figure 7.4.2, the subset {uo, wi, u2} of V(G%) (what we were calling 
the “u-vertices”) is the vertex fiber over u. The subset {bo, bi, b2} of E(G°) (what we 
were calling the “b-edges”) is the edge fiber over b. 
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FACT 


F1: It is clear from the definition of the covering graph that the vertex-set of the 
covering graph is partitioned into |V| fibers, each with |B] vertices. Similarly, the edge- 
set of the covering graph is partitioned into || fibers, each with |B] vertices. 
Bouquets and Dipoles 


For economy of description, it is helpful to use a base graph with as few vertices as 
possible. 


DEFINITIONS 
D6: The bouquet B,, is the one-vertex graph with n self-loops. 
D7: The dipole D,, is the two-vertex graph with n edges joining the two vertices. 


EXAMPLE 


Voltage graph theory is intuitively spatial. Instead of cluttering the drawings with 
cumbersome labels, one uses graphic features to represent the partitions into fibers. 


E3: Figure 7.4.3 illustrates how graphic features are used. For instance, in each covering 
graph, a particular fiber and its corresponding voltage assignment are displayed in bold. 


0 
110 111 111 110 
2 2 0 
0 
1 1 190 191 
101 1 
: , 0 0 fe) 00 
000 001 001 000 
1 100 
‘OO: &> 
2 001 
(i) voltages in Z; (ii) voltages in Z; (iii) voltages in 25 


Figure 7.4.3: Three regular voltage assignments and their covering graphs. 


Figure 7.4.3(i) derives Ks with Zs-voltages on Bg, as in Figure 7.4.1. This time, we 
have “suppressed the mainscript” on the vertex fiber in the covering graph, and shown 
only the subscripts. Moreover, we have suppressed the directions in the covering graph. 
Figure 7.4.3(ii) derives K3,3 with Zs3-voltages on D3. Using hollow and solid vertex 
graphics enables us to label all the vertices in the covering graph by their subscripts, 
without loss of information. Similarly, the edge graphics enable us to verify readily 
that the edges in each edge fiber join vertices whose labels differ by the correct amount. 
Figure 7.4.3(iii) derives the union of two isomorphic copies of the cube graph Q3 with 
Z3-voltages on D3. This phenomenon is examined in §7.4.2. 


Additional elementary examples appear in [GrTu87] and in [GrYe06]. 
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FACTS 


F2: The Cayley graph of a group with generating set {x,,...,x7,} is (naturally) iso- 
morphic to the covering graph specified by the bouquet By with voltages {x1,...,x%} 
on its respective self-loops. 

F3: The complete graph K2,4 1 can be covering by assigning the voltages 1,2,...,n 


from the cyclic group Zan+41 to the edges of the bouquet B,,. (This is a special instance 
of Fact F2.) 


F4: The complete graph K2, can be covering by assigning the voltages 1,2,...,n from 
the cyclic group Zz, to the edges of the bouquet B,,, if one compresses the pairs of 
edges that arise from the involution n mod 2n. 


F5: The symmetric complete bipartite graph K,,,,, can be covering by assigning voltages 
0,1,...,2—1 in the cyclic group Z, to the n edges of the dipole Dy. 


F6: The d-dimensional cube graph Qg can be covering by assigning the d elementary 
vectors in Z to the edges of the dipole Da. 


Net Voltages 


DEFINITIONS 


Ds: A walk in a voltage graph is any walk, as if the voltage graph were undirected. 
This means that some of its edge-steps may proceed in the opposite direction from the 
direction on the edge it traverses. 


D9: The voltage sequence on a walk W = v9, €1, 11, €2,---;€n; Un is the sequence 
of voltages aj, ...,@n, encountered, where a; = a(e;) or a(e;)~', depending on whether 
edge e; is traversed in the forward or backward direction, respectively. 


D10: The net voltage on a walk in a voltage graph is the product of the algebraic 
elements in its voltage sequence. 


EXAMPLE 


E2, continued: We return to the voltage graph of Figure 7.4.2, reproduced here for 
convenience. 


voltages in Z3 


The walk W = u,c,v,b~,u,c,u has net voltage 1+0+1=2 mod 3. We observe that 
in the walk uo, co, v1, by , ui, C1, V2 in the derived graph, which follows the same c, b~,c 
edge pattern as walk W, the subscript increases by 2 from initial vertex to final vertex. 
This phenomenon is examined in 87.4.5. 
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The Local Group 


DEFINITION 


D11: The local group at a vertex v of a voltage graph (G = (V, F),a: E > B) is the 
subgroup of all elements of B that occur as the net voltage on a closed walk that starts 
and ends at vertex v. It is denoted Loc,. 


FACTS 


F7: [AlGr76] If the voltage group is abelian, then the local group is the same at every 
vertex. If it is non-abelian, then the local group at a vertex is conjugate to the local 
group at any other vertex. 


F8: [AlGr76] For any regular voltage graph (G,a : E — 6), the number of components 
of the derived graph equals the index [B : Loc,] of the local group in the voltage group. 


F9: [AlGr76] The components of the derived graph are mutually isomorphic. 


EXAMPLE 


E3, continued: The local group for Figure 7.4.3(iii) is the subgroup of 3-tuples with 
evenly many 1’s. This subgroup has index two in Z3. Thus, there are two components 
to the derived graph. 


REMARKS 


R2: If we select a root vertex r in the base graph, then the component of the derived 
graph containing the vertex ro (here, zero denotes the group identity, even for a non- 
abelian group) serves as a preferred component of the derived graph. 


R3: The theory of the local group and multiple components was developed by [AlGr76] 
in the terminology of topological current groups and general algebra. 


Natural Automorphisms 


The natural action of the group on any of its Cayley graphs generalizes to a natural 
action of a voltage graph on the derived graph. 


DEFINITION 


D12: Let (G,a: E > B) bea voltage graph, and let x € B. The natural automor- 
phism yp, : G* > G* is given by the rules 


Upt> Ugp and ep, +1 egy 


Thus, if edge e runs from vertex u to vertex v of the base graph, then edge ey runs from 
vertex up to vertex Uzpq(e) in the derived graph. 
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FACTS 
See [GrTu87] for details. 


F10: The group of natural transformations is fiber preserving. That is, each vertex 
and edge of the derived graph is mapped to another vertex or edge, respectively, within 
the same fiber. (This is immediate from the definition of a natural transformation.) 


F11: The group of natural transformations acts transitively on the vertices within each 
vertex fiber and transitively on the edges within each edge fiber. 


F12: Let (G,a: E > B) be a voltage graph. A component of the derived graph G® 
that contains a vertex in the fiber over v is mapped to itself by the natural automorphism 
(x if and only if « € Locy. 


EXAMPLE 


E3, continued: The natural automorphism ~,,, for Figure 7.4.3(iii) maps a compo- 
nent of the derived graph to itself if and only if zyz has evenly many 1’s. 


7.4.3 Permutation Voltage Graphs 


The permutation voltage graph construction of [GrTu77] uses the objects permuted 
by a permutation group as the subscripts, rather than using the group elements as in 
the regular voltage graph construction. This leads to increased generality (Fact F15). 


DEFINITIONS 


D13: Let G = (V,E) be a digraph. A 5,,-permutation voltage assignment for G 
is a function a: E > %,, that labels each edge with a permutation in the symmetric 


group. 
e The pair (G,q) is called a ©,,-permutation voltage graph. 


e Graph G is called the base graph and group &,, is called the permutation 
voltage group. 


e The permutation label a(e) is called the voltage on edge e. 


D14: The (~,,-permutation) derived digraph G° associated with a permutation 
voltage graph (G = (V, E),a: E > X,) is defined as follows: 


e V(G°) =V°%=V x {1,...,n}, the cartesian product. 
© (GY) =F* =f Xuan} 


e If the edge e is from vertex u to vertex v in G then the edge e; = (e,j) is from 


the vertex uj = (u, j) to the vertex vqj) = (v, a(J)). 
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EXAMPLE 


E4: Figure 7.4.4 shows a 43-permutation voltage graph and the corresponding covering 
digraph. 


U 


(123) 


(i) (ii) 


Figure 7.4.4: (i) A %3-voltage graph; (ii) the covering digraph. 


The edge fiber over self-loop 6 at vertex u of the base graph forms the single cycle 
(u1, b1, U2, b2, U3, bs), because b has voltage (123). The edge fiber over self-loop d forms 
two disjoint cycles (v1, d1v2, d2)(v3,d3), because the voltage on d is (12)(3). Since edge 
c goes from vertex u to vertex v in the base graph, each edge in the fiber over c crosses 
from the vertex fiber over u to the vertex fiber over v, and the vertex subscripts are 
permuted in accordance with the voltage (123) on edge c. 


FACTS 


F13: [Gr77] Every regular graph of even degree 2k is specifiable by assigning permu- 
tation voltages to the bouquet B,. (See also [SiSk85].) 


F14: From Babler’s theorem [Ba38] (see Fact F6 of §5.4), it follows that every 2k- 
edge-connected (2k + 1)-regular graph is specifiable by assigning permutation voltages 
to the bouquet By, if one permits a 1-factor to represent the fiber resulting from an 
involution. 


F15: Any graph derivable by regular voltages is also derivable by permutation voltages. 
This follows from the fact that the right regular representation of any group can be 
imbedded in a symmetric permutation group. 


REMARKS 


R4: Analogous to the regular case, the vertex-set and edge-set of a &,-permutation 
derived graph are partitioned, respectively, into |V| vertex-fibers, each with n vertices, 
and into |E| edge-fibers, each with n edges. Analogously, natural projection is by erasure 
of subscripts. 


R5: Further elementary examples of permutation voltage graphs are given in [GrTu87]| 
and in [GrYe06]. 


R6: In labeling a voltage graph drawing with permutations, one must specify whether 
the voltages are to be regarded as permutation voltages or as regular voltages. For 
permutation voltages, there are n vertices or edges in each fiber. For regular voltages, 
there are n! vertices or edges. 
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R7: In particular, it is possible to label the bouquet By with permutations in the 
wreath product Zn @wr Ze (with a cycle shift and a de Bruijn permutation) so that the 
permutation derived graph is the n-dimensional de Bruijn graph and the regular derived 
graph is the wrapped butterfly graph. 


7.4.4 Representing Coverings with Voltage Graphs 


Covering spaces are a topological abstraction of Riemann surfaces. In fact, every 
covering space of a graph can be specified by assigning voltages. The advantage of spec- 
ifying a covering graph by voltages, rather than by the classical abstract descriptions, is 
that the derived graph over a voltage graph has every vertex and edge labeled according 
to its fiber, in a manner that lends itself to topological and combinatorial intuition. 


Coverings and Branched Coverings of Surfaces 


DEFINITIONS 


D15: Let S and S be surfaces, and let p: S — S be a continuous function, such that 
the following condition holds: 


Every point of S' has an open neighborhood U such that each component of 
p‘(U) is mapped homeomorphically by p onto U. 


Then p: S > Siscalleda covering projection and the surface S is called a covering 
space of S. 


D16: Let p: § > S be a covering projection. For each point x € S, the set p *(a) is 
called the fiber over x. 


D17: Let S and S be closed surfaces, and let B be a finite set of points in S such that 
the restriction of the mapping p: S > S to S — UB is a covering projection. Then 


e The mapping p: S + S is called a branched covering. 
e The space $ is called a branched covering space of S. 
e The set B is called the branch set. 


e The images of points in the branch set are called branch points. 


EXAMPLES 


E5: The complex function e3” is a covering projection of the unit circle in the complex 
plane onto itself. The fiber over a point e is the set {e’”, e#+?7/3, e'*+47/31 Moreover, 
the function e*” is a branched covering of the unit disk in the complex plane onto itself, 
in which {0} is the only branch point. 


E6: The classical Riemann surfaces are branched coverings of the complex plane. 


E7: [A120] Every closed orientable surface is a branched covering of the sphere. 
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E8: Consider the unit sphere So = {(x,y,z) € R®: 2*+y?+2? = 0} and the antipodal 
mapping (x,y,z) ++ (—x,—-y,—z). The quotient mapping induced by the antipodal 
homeomorphism is a covering projection of Sg onto the projective plane. Moreover, the 
restriction of this covering projection to the annular region of Sp between the “Tropic 
of Cancer” and the “Tropic of Capricorn” is a covering projection of this annular region 
onto a Mobius band. 


REMARK 


R8: The branch set in any covering of manifolds has codimension 2. Thus unless a 
graph is imbedded in a surface, there is no branching. 


Using Voltage Graph Constructions 


A few basic facts serve as a guide to the use of voltage graph constructions. 


FACTS 


Let (G = (V, E),a: E > B) bea regular voltage graph. Then the following statements 
hold. 


F16: |V(G*)|=|V(G)|- |B] and |E(G*)| =|£(G)| - |B). 
F17: In the fiber over a vertex v € V(G), every vertex v, has the same degree as v. 


F18: A proper coloring of the base graph can be lifted to a proper coloring of the 
voltage graph, in the following sense: every vertex in the fiber over a vertex v € V(G) is 
assigned the same color as v. (A graph with self-loops is considered to have no proper 
colorings.) 


F19: [GrTu79] Let T be a spanning tree of a graph G. If a graph G can be constructed 
by assigning 6-voltages to G,, then it is possible to do so by completing an assignment 
of arbitrary voltages from 6 to the edges of T. 


EXAMPLES 


E9: To represent the Petersen graph as a regular covering space of the dumbbell graph, 
we observe that the Petersen graph has 10 vertices and 15 edges. The only nontrivial 
common divisor of 10 and 15 is 5, so using Fact F16, we seek a base graph with 2 vertices 
and 3 edges. There are four such connected graphs. In accordance with Fact F17, the 
base graph must be 3-valent regular, which narrows the possibilities to two graphs. One 
of these two, the dipole D3, is 2-colorable. By Fact F18, it cannot be a base graph for 
the Petersen graph. This leaves the dumbbell graph, shown in Figure 7.4.5, as the only 
possible base graph. 


We seek Z5-voltages on the dumbbell graph, since Z; is the only group of order 5. By 
Fact F19, we may start by assigning the voltage 0 to the edge d. Figure 7.4.5 shows the 
completed assignment. 
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voltages in Zs 


Figure 7.4.5: A regular voltage assignment for the Petersen graph. 


E10: By similar considerations, we can demonstrate that the complete graph Ky, is 
not nontrivially derivable as a voltage graph. It has 4 vertices and 6 edges, so the only 
nontrivial common divisor is 2. By the exact same progression of steps as in Example 
9, we narrow the possible candidates down to the dumbbell graph, and we narrow the 
possible voltage groups to Za. 


Assigning 0 to a self-loop in the base leads to an edge fiber of self-loops. Assigning 
the involution 1 to both self-loops leads to a 4-vertex 6-edge graph with two double 
adjacencies, whose elimination yields a 4-cycle, not K4. 


Action of the Group of Covering Transformations 


The subsection is confined to exploring the sense in which voltage graphs provide all 
possible covering graphs. Prior acquaintance with algebraic topology is helpful. 


DEFINITIONS 


D18: Let p: S>Sbea covering projection. A homeomorphism h on S is called a 
covering transformation if ph = p. One sometimes says that such a homeomorphism 
his fiber preserving, since its restriction to any fiber is a permutation of that fiber. 


D19: A group H of covering transformations on a covering projection p : S38 is 
said to act freely if no transformation in H except the identity has a fixed point in S. 


D20: A group H of covering transformations on a covering projection p : S > Sis 
said to act transitively if its restriction to each fiber acts transitively. 


D21: A regular covering projection is a covering projection p : S + S such that 
there exists a group of freely acting covering transformations. In this case, the domain 
S is called a regular covering space of S. 


TERMINOLOGY: The phrase “covering space of a graph G” is used to describe a covering 
space of a topological realization of the graph G, e.g., in 3-space. It also refers to any 
graph G such that there is a graph map G + G whose topological realization is a 
covering projection. 


EXAMPLE 


E10, continued: The three functions e*” 4 e*,e! 1 e'*+27/3, and e@” 1 ett 47/3 
form a group of covering transformations. This group acts freely, and it acts transitively 
on the fibers over every point of the unit circle. Thus, the complex function e*” is a 
regular covering projection of the unit circle onto itself. 
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FACTS 


F20: [Gr74] Let (G,a) be a regular voltage graph. Then the derived graph G® is a 
regular covering graph of G. 


F21: [GrTu77] Let X be a topological realization of a graph G be any graph, and 
let X be a regular covering space of X. Then X is homeomorphic to the topological 
realization of the derived graph corresponding to a regular voltage assignment on G. 
(“Every regular covering space of a graph is realizable by a regular voltage assignment.” ) 


F22: [GrTu77] Let (G,a) be a permutation voltage graph. Then the derived graph 
G® is a covering graph of G. 


F23: [GrTu77] Let X be a topological realization of a graph G be any graph, and let 
X bea covering space of X. Then X is homeomorphic to the topological realization of 
the derived graph corresponding to a permutation voltage assignment on G. (“Every 
covering space of a graph is realizable by a permutation voltage assignment.” ) 


F24: Let (G, a) be aregular voltage graph,whose voltage group B has order n. Then the 
corresponding derived graph is isomorphic to the derived graph of the ©,,-permutation 
voltage graph (G, @), where b ++ 6 is the right regular representation of the group B em- 
bedded in %,,. (“Every regular voltage assignment can be represented as a permutation 
voltage assignment.” ) 


F25: Let (G,qa) be any regular or permutation voltage graph, and let e be a directed 
edge of graph G. If the direction of e is reversed, and if the voltage a(e) is replaced 
by its algebraic inverse a(e)~', then the resulting derived graph is isomorphic to the 
derived graph G. 


F26: Let p: C —>C bea covering projection of the unit circle onto itself, such that 
each point in the image is covered & times. Then p : C > C extends to a branched 
covering of the unit disk itself, in which 0 is the only branch point in the codomain and 
{0} is the branch set in the domain. 


7.4.5 The Kirchhoff Voltage Law 


DEFINITIONS 


D22: Let W = v9, €1, U1, €2, ---) En; Un be a walk in a regular voltage graph (G,a: E > 
B), and let a1,...,dn be its voltage sequence. Let b € B. Then the walk 

We = (vo, b), (e1, bax), (v1, bax), (ea, ba1a2), ee (En; ba, ag eal Gn); (Un; ba,az aa An) 
is called a lift of the walk W. 


D23: Let W = vo, €1, U1, €2,-.-,€k, Uk be a walk in a permutation voltage graph (G,a: 
E+ %,), and let m,..., be its voltage sequence. Let 7 € {1,...,n}. Then the walk 


(v0.3), (e1,m(3)), (v1,m(3)), (e2, n2(m(J))), 
+5 (ex me(-> > (1m1(9)))), (vis me + (m2(9)))) 


is called a lift of the walk W. 
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D24: Let W be a closed walk in a voltage graph. If the net voltage on W is the 
identity of the voltage group, then we say that the Kirchhoff voltage law (KVL) 
holds on W. 


FACTS 


F27: [Gr74] [GrTu77] Let W be a closed walk in a voltage graph. If the Kirchhoff 
voltage law holds on W, then every lift W, of W is a closed walk in the derived graph. 


F28: [Gr74] Let W be a closed walk in a regular voltage graph (G,a: E > B), with 
net voltage c. Let c have order k in the voltage group B. Then the concatenation of the 
sequence of lifts Wy, Wye, Wac2,---, Wy-e-1 is a closed walk in the derived graph G®. 


NOTATION: Under the hypotheses of Fact F28, the set of lifts of the walk W is concep- 
tualized as partitioned into Bl sequences of lifts, as in the conclusion, whose concate- 
nations are closed walks in the derived graph. This set of closed walks formed by such 


concatenation is denoted W~. 


F29: [GrTu77] Let W be a closed walk in a permutation voltage graph (G,a: E > =), 
with net voltage 7. Let 7 have order & in the voltage group U,. Then the concatenation 
of the sequence of lifts W;,W1 (3), Wn2(j),---»Waye-1() 18 a closed walk in the derived 
graph G®. 


NOTATION: Under the hypotheses of Fact F29, the set of lifts of the walk W is conceptu- 
alized as partitioned into [ sequences of lifts, as in the conclusion, whose concatenations 
are closed walks in the derived graph. This set of closed walks formed by such concate- 


nation is denoted W%, as in the regular case. 


7.4.6 Imbedded Voltage Graphs 


Imbedded voltage graphs and their duals, called current graphs, are used to specify 
the imbeddings of graphs on surfaces. Imbedded voltage graphs are used extensively 
in calculations of maximum and minimum genus of a graph (see §7.2), in calculating 
the minimum genus of a group (see §7.5), and in constructing regular maps on surfaces 
(see §7.6). 


DEFINITIONS 


D25: Let (G,a) be a voltage graph such that the graph G is (cellularly) imbedded in 
a closed surface S. Then the pair (G > S,a) is called an imbedded voltage graph; 
also, (G > S,a) is called the base imbedding, and S is called the base surface. 


D26: Let 0 be the set of closed walks of the faces of an imbedded voltage graph 
(G + S,a). Then the union © of the sets W*, where W € Q), is called the set of lifted 
boundary walks. 


D27: Let 2 be the set of lifted boundary walks in the derived graph G® for an imbedded 
voltage graph (G — S,a). The cellular 2-complex S® that results from fitting to each 
closed walk in Qa polygonal region (whose number of sides equals the length of that 
closed walk) is called the derived surface. The imbedding G — S® is called the 
derived imbedding. 
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D28: Let (G > S,a) be an imbedded voltage graph. To extend the natural projection 
p:G => G to the surfaces, the natural projection p is extended from the set of lifted 
boundary walks in the imbedding G* — S@ to the regions they bound (with branching 
as needed), in accordance with Fact F28. The resulting extended function is called the 
natural projection. 


D29: A monogon is a face whose boundary walk has length equal to 1. 


D30: A digon is a face whose boundary walk has length equal to 2. 


FACTS 


F30: Let (G — S,a) be an imbedded voltage graph. Then the derived surface is a 
closed surface, and the derived imbedding is a cellular imbedding; moreover, if the base 
surface S is orientable, then so is the derived surface. 


F31: [GrAl74, Gr74] Let (G > S,a) be an imbedded voltage graph. If the Kirchhoff 
voltage law holds on the boundary walk of every face of the base imbedding, then the 
natural projection p: S33 Sisa covering projection. If KVL does not hold, then the 
natural projection is a branched covering. 


EXAMPLES 


E11: Figure 7.4.6 shows an imbedded ordinary voltage graph in which the base graph 
is the bouquet By, the base surface is the torus S;, and the voltages are in the cyclic 
group Z;. The derived graph is the complete graph K; and the derived surface is the 
torus S;. There is only one base face, and KVL holds on its boundary walk. Thus, 
each of the derived faces has the same number of sides as the base face, and the natural 
projection is a covering projection. 


Up (oh U, Uy d, U3 A, U4 d, Up 


& 
voltages in Zs 


() a wi) kos 


Uy 


Figure 7.4.6: (i) An imbedded voltage graph; (ii) its derived imbedding. 


E12: Figure 7.4.7 shows another imbedded ordinary voltage graph in which the base 
graph is the bouquet B2 and the voltages are in the cyclic group Zs, but the base surface 
is the sphere Sp. The derived graph is the complete graph Ks. 


There are three base faces, i.e., two monogons and one digon, and KVL does not hold 
on any of their boundary walks; indeed, the net voltage on each boundary walk has 
order 5 in the group Z5. Thus, each of the derived faces has 5 times as many sides 
as the base face, so there are two 5-gons and one 10-gon; the natural projection is a 
branched covering projection, with a branch point in each base face. Since the Euler 
characteristic of the derived surface is —2 = 5—10+3, it follows that the derived surface 
is So. 
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Figure 7.4.7: An imbedded voltage graph Bz > So. 


E13: [Gr74] Every Z-metacyclic group with presentation 

(s,t [eer ea i= st) 
such that m is odd and n is even is a toroidal group. Various such results on the genus 
of a group have been derived with the aid of voltage graphs. 


7.4.7 ‘Topological Current Graphs 


The origin of voltage graphs was in the pursuit of a unified explanation (see [GrA174] 
and [GrTu74]) of the 300-page Ringel—Youngs solution [RiYo68] (see also [Ri74]) to the 
Heawood map-coloring problem [He:1890], which is to calculate the chromatic number 
of every closed surface except the sphere. Several extensions of voltage graph theory 
have augmented its utility. 


DEFINITIONS 


D31: Let G be a digraph with vertex-set V and edge-set FE’, imbedded in a surface 
S. A regular current assignment for G in a group B is a function a from F to B. 
The function value a(e) is called the current on edge e. The pair (G > S,q) is called 
a regular current graph, and B is called the current group. 


D32: Let (G = (V,E) — S,a: E > B) be a regular current graph. Its dual is the 
imbedded voltage graph whose base imbedding is G* — S, the dual of the imbedding 
G — S (which involves reversed orientation from the primal imbedding surface, if S$ is 
orientable). For each primal directed edge e € E, we define a*(e*) = a(e) to be its 
voltage. 


D33: The derived imbedding of a current graph (G > S,a: E > B) bea 
current graph is the derived imbedding of its dual, that is, of the imbedded voltage 
graph (G* > S,a‘*). 


D34: Let v be a vertex in a current graph. If the net current at v is the identity of the 
current group, then we say that the Kirchhoff current law (KCL) holds at v. (In 
an abelian group, the net current is the sum of the inflowing currents. In a non-abelian 
group, one calculates the product in the cyclic order of the rotation at v.) 


EXAMPLE 


E14: In Figure 7.4.8, all three drawings shown are on tori, with the left side of the 
rectangle pasted to the right, and the top pasted to the bottom. The derived imbedding 
is pasted with a 2 twist, so that like labels match. We observe that KVL holds on both 
faces of the imbedded voltage graph. In accordance with duality, KCL holds at both 
vertices of the corresponding current graph. 
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Figure 7.4.8: Deriving an imbedding from a topological current graph. 


REMARKS 


R9: In Gustin’s original conception [Gu63], a current graph was a 3-regular graph 
whose vertices were marked with instructions for traversing a small family of closed 
walks that doubly covered its edges, and whose edges were marked with algebraic la- 
bels. Ringel and Youngs augmented Gustin’s “nomograms” (Youngs’s terminology) into 
numerous varieties of combinatorial current graph, each with a distinct set of defining 
rules, whatever helped in their endeavors to solve the Heawood problem. The com- 
mon feature of all varieties was that recording the algebraic elements traversed along 
those closed walks yielded generating rows for rotations systems (see §7.1) that specified 
minimum-genus imbeddings for complete graphs. 


R10: The theory of topological current graphs [GrA174] generalized the various com- 
binatorial current graphs referred to in Remark 9 into a single unified construction, 
applicable not just to complete graphs, but to a wide variety of graphs with symme- 
tries, and it identified the underlying construction as a branched covering. The theory 
of regular voltage graphs [Gr74] separated the base graph from its imbedding, which 
facilitated a divide-and-conquer approach to constructing imbeddings, as in Examples 
E11 and E12. The theory of permutation voltage graphs [GrTu77] expanded the con- 
struction. 


R11: When one dualizes the natural projection of the derived imbedding onto an 
imbedded voltage graph, there is a natural projection of the dual of the derived graph 
onto the current graph, which is a folded covering in the sense of [Tu36]. The rela- 
tionship of the dual derived graph to the current graph was studied by [PaPiJa80] and 
[GrJaPaPi82]. A different perspective on simultaneous consideration of voltages and 
currents is given by [Ar92]. 


7.4.8 Lifting Voltage Graph Mappings 


In porting distributed algorithms between parallel architectures, a theoretical prob- 
lem that arises is the construction of a mapping between two large symmetric graphs 
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that would minimize the slowdown involved in emulating a computation designed for 
one computer architecture on a different architecture. 


DEFINITIONS 


D35: Let f:G— HA bea graph map. The guest is the domain, and the host is the 
codomain. (A computation designed for the guest is to be emulated by the host.) 


D36: Let f : G — H be a graph map. The load at a vertex v of the host is 
the cardinality |f~'(v)| of its preimage. (A processor at host vertex v is required to 
reproduce the computations of the processors at every guest vertex mapped to v. These 
must be done consecutively, so there is a delay at v proportional to the load.) The load 
of the mapping f : G > H is the maximum vertex load, taken over all vertices of the 
host. 


D37: Let f:G— H bea graph map. The congestion at an edge e of the host 
is the cardinality |f~+(e)| of its preimage. (A link at host edge e is required to carry 
the messages of the links represented by every guest edge mapped to e. These must be 
done consecutively, so there is a delay at v proportional to the load.) The congestion 
of the mapping f : G— H is the maximum edge congestion, taken over all edges of 
the host. 


FACTS 


F32: (GrCh96] A graph mapping from a guest G to a host H can sometimes be 
constructed by lifting a graph mapping from a voltage graph for G to a voltage graph 
for H. 


F33: = [ArGvSi97], [ArRiSiSk97] A mapping of imbedded graphs can sometimes be 
constructed by lifting a graph mapping from an imbedded voltage graph for G to an 
imbedded voltage graph for H. 


F34:  [GrCh96] A graph mapping from a guest G to a host H that minimizes load 
or congestion can sometimes be constructed by lifting a graph mapping from a voltage 
graph for G to a voltage graph for H. 


REMARK 


R12: Other references about lifting mappings include [MaNeSk00], [MiSa00a], and 
[Su90]. 


7.4.9 Applications of Voltage Graphs 


Beyond their initial uses in the construction of minimum imbeddings of graphs and 
maps with various forms of symmetry, voltage graphs have acquired several other uses. 
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REMARKS 


R13: The use of voltage graphs in the construction of graphs with special prescribed 
properties or in the calculation of invariants appears in [AnGa81], [ArKwLeSo00], 
[FeKw02], [KrPrTe97], [KwHoLeSo00], [KwLeSo99], [Le82], [MiSa97], [MiSa00b], 
[MiSa02], and [MiSa03]. 


R14: For an extensive survey of counting covering spaces of a graph, see [KwLe04]. 


R15: Among the many papers that apply voltage graphs to counting covering spaces of 
a graph are [EwHo93], [FeKwKiLe98], [Ho91a], [Ho91b], [Ho95a], [Ho95b], [HoKw93], 
[HoKwLe96], [HoKwLe99], [KikiLe99], [KwChLe98], [KwHoLeSo00], [KwLe92], 
[KwLe94], [KwLe98], and [KwKiLe96]. 


R16: For a study of random permutation voltages, see [AmLi02]. 

R17: For connections to the Vassiliev knot invariants, see [P101]. 

R18: For connections between voltage graphs and coding theory, see [LaPe81]. 
R19: For connections between voltage graphs and biological networks, see [MoRe02]. 


R20: For the applications of voltage graphs to the study of isomorphisms of graph 
coverings, see [Sa94] and [Sa01]. 


Applications of Imbedded Voltage Graphs and Topological Cur- 
rent Graphs 


Imbedded voltage graphs and topological current graphs have often been used in con- 
nection with investigations in algebra and geometry. 


REMARKS 


R21: For results on lifting imbeddings with special properties, including symmetries 
and the realization of minimum imbeddings for a graph, see [Mo85], [Mo86], [Mo87b], 
[Mogg], [NeSk97], [Pa80], [Pi80], [Pi82], and [St79]. 


R22: For the use of voltage graphs in counting graph imbeddings, see [KwChLe98}, 
[KwKiLe96], [LeKi01], [LeKi02], and [Mo87al. 


R23: Applications of voltage graphs or topological current graphs to the genus of a 
group (see §7.5) appear in [Gr74], [GrLo80], [JuWh80], [Pi92], [PiTu89], [Pr77], [Pr81], 
[PiTu92], [PiTuWi92], [Tu80], [Tu84], and [Tus3}. 


R24: Voltage graphs have been used in the construction of block designs by [A175], 
[BrHu87], [Ga79], and [Wh78], among others. 


R25: The use of voltage graphs in the representation of finite geometries (see §7.9) 
has been pioneered by A. T. White. See especially [Wh01]. 
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INTRODUCTION 


When a Cayley graph C(.A, X) for a finite group A is imbedded in a surface, the 
face boundaries, as cycles in the Cayley graph, give relations in the generating set X, 
that is, words in the generators and their inverses that represent the identity element of 
A. Thus, the possible imbeddings of Cayley graphs for the group A are closely related 
to the possible presentations for that group in terms of generators and relations. The 
smallest genus g such that some Cayley graph for the group A can be imbedded in 
the surface of genus g is called the genus of the group A. White [Wh72,Wh84] first 
introduced the term, although Burnside [Bull] considers a similar concept. Study of 
the genus of groups is closely related to questions about group actions on surfaces, 
regular branched coverings, and automorphisms of Riemann surfaces. For more about 
the genus of a group, see the survey chapter in [Tu09]. 


7.5.1 Symmetric Imbeddings of Cayley Graphs 


If the Cayley graph C(.A,X) is imbedded in an orientable surface S, one might 
ask whether the natural vertex-transitive symmetry of the Cayley graph is somehow 
reflected in the symmetry of its imbedding, especially for minimum genus imbeddings. 


DEFINITIONS 


D1: The Cayley graph C(A, X) for a group A with generating set X has the elements 
of A as vertices and has edges directed from a to ax for every a € A and x € X. We 
will assume that vertices are labeled by elements of A and that edges are labeled by 
elements of X. 


We notice that an involution x gives rise to a directed edge from a to ax and 
also one from az to a, for all a; sometimes we will choose to identify this pair of 
edges to a single undirected edge labeled «x. 
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D2: The genus of a group A, which is denoted (A), is the smallest genus g such 
that some Cayley graph for the group A can be imbedded in the orientable surface S, 
of genus g. 


D3: The natural action of the group A on the Cayley graph C(A, X) is the group 
of automorphisms corresponding to left multiplication of the vertices of a Cayley graph 
C(A, X) by an element b of the group A. (This respects the labeling and directing of 
the edges, since (ba)x = b(az).) 


D4: The action of an automorphism group on a graph is vertex-transitive if it takes 
any vertex to any other vertex. 


D5: An automorphism group acting on a graph is a free action if no element except 
the identity fixes any vertices of the graph. 


D6: The finite group A acts on the orientable surface S if A is isomorphic to a 
subgroup of the group of all homeomorphisms of 9. 


D7: The action of a group A on a surface S' preserves orientation if every element 
of the corresponding subgroup of homeomorphisms on S preserves the orientation of S. 


D8: An imbedding of a Cayley graph C(A, X) in the orientable surface S is symmetric 
if the natural action of A on C(A, X) extends to an action of A on S. 


D9: An imbedding of a Cayley graph C(A, X) in the orientable surface S is strongly 
symmetric if the natural action of A on C(.A, X) extends to an orientation-preserving 
action of A on S. 


D10: The symmetric genus (respectively, strong symmetric genus) of the group 
A, denoted o(A) (respectively, 7°(A)), is the smallest g such that some Cayley graph 
imbeds symmetrically (respectively, strongly symmetrically) in the surface of genus g. 


TERMINOLOGY: A strongly symmetric imbedding of a Cayley graph for the group A is 
also called a Cayley map for the group A [BiWh79, RSJTW]. 


FACTS 
F1: The definitions immediately imply that 7(A) < o(A) < 0°(A). 


F2: The natural action of the group A on the Cayley graph C(.A, X) is vertex-transitive 
and free. 


F3: [Sa58] A graph G is a Cayley graph for the group A if and only if there is a group 
of automorphisms isomorphic to A acting on G, such that the action is vertex-transitive 
and free. 
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F4: [GrTu87] Any orientation-preserving automorphism of a graph imbedding must 
respect rotations at vertices (the cyclic ordering of edges around each vertex given by 
the imbedding), and the natural action of a group on a Cayley graph respects labels. 
Thus, if an imbedding of a Cayley graph is strongly symmetric, then it must have the 
same cyclic ordering of generators and their inverses at every vertex, and conversely. 


F5: It is a corollary to Fact F4 that to specify a symmetric imbedding of the Cayley 
graph C(A, X), all we need do is to give a cyclic ordering of the elements of X and their 
inverses. 


F6: The derived graph of an imbedded voltage graph for a bouquet of circles, where 
the assigned voltage set X generates the voltage group A, gives a strongly symmetric 
imbedding of the Cayley graph C(.A, X). Every strong symmetric imbedding of a Cayley 
graph can be obtained this way. 


F7: [GrTu87] Any orientation-reversing automorphism of a graph imbedding reverses 
the rotations at vertices. If the action of the group A on the orientable surface S does 
not preserve orientation, then the set of elements of A that do preserve orientation 
form an index-two subgroup B of A. Thus if an imbedding of a Cayley graph for A is 
symmetric but not strongly symmetric, there is a subgroup 6 of index two in A such 
that all vertices in 6 have the same rotation and all vertices not in 6 have the opposite 
rotation, and conversely. 


F8: Viewing symmetric imbeddings that are not strongly symmetric as derived graphs 
of a small (one or two vertices) voltage graph tends to be complicated. One possibility 
is to begin with an imbedded voltage graph of a bouquet of circles in a non-orientable 
surface and hope that the derived surface is orientable. Another is to begin with a two- 
vertex graph imbedded in a symmetric surface having an orientation-reversing involution 
f that interchanges the two vertices. 


F9: [GrTu87] If the voltage group A has an index two subgroup B, such that all loops 
are assigned voltages in 6, such that all edges between the two vertices are assigned 
voltages not in B, and such that e and f(e) get the same voltage, then the derived graph 
will be a symmetric, but not strongly symmetric imbedding of a Cayley graph for A. 


F10: [Tu83] Any action of the finite group A on the orientable surface S comes 
from a symmetric imbedding of a Cayley graph C(A, X) in S. If the action preserves 
orientation, then the imbedding is strongly symmetric. Thus o(A), respectively o°(A), 
is the minimal g such that A acts, respectively, acts preserving orientation, on a surface 
of genus g. 


F11: If BG is a subgroup of A, then 
a(B)<oa(A) and o°(B) <o0°(A) 
since if A acts on a surface, then so does B. 


F12: [Ba77] If B is a subgroup of A, then any Cayley graph for A edge-contracts to a 
Cayley graph for B. In particular, y(B) < 7(A). 
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EXAMPLES 


E1: View the standard cube as having a top and bottom face with four vertical faces. 
Let y denote the rotation by 90 degrees about the centers of the top and bottom faces. 
Let x denote the reflection that interchanges the top and bottom faces but takes each 
vertical face to itself. It is not hard to see that the action generated by the symmetries x 
and y is vertex-transitive and free, that y preserves orientation and x does not, and that 
xy = yx. Thus, the vertices and edges of the cube can be labeled to give a symmetric, 
but not strongly symmetric imbedding of a Cayley graph for the abelian group 


(x,y: 22 =y* =1,2y = yr) = Zo x Za. 


E2: Let y be as in the previous example, but let z denote the rotation by 180 
degrees about the midpoint of a vertical edge; then z interchanges not only the top and 
bottom faces, but also the vertical faces in pairs as well. It is again not hard to see 
that the action generated by the symmetries z and y is vertex-transitive and free, that 
it preserves orientation, and that zyz = y~!. Thus, the vertices and edges of the cube 
can also be labeled to give a strongly symmetric imbedding of a Cayley graph for the 
group 
(z,yi2? =y* =1,2zyz=y7") 
which is the dihedral group of order 8. 


REMARKS 


R1: In both examples above, the vertical edges correspond to an involution, with 
the resulting pairs of edges identified. In the first example, the involution reversed 
orientation; it is not hard to check that the vertical edges could not be replaced by 
a pair of directed edges and still have the involution x respect the directions. In the 
second example, the vertical edges could be replaced by a pair of directed edges and 
still have the directions respected by the involution z. In general, pairs of edges in a 
symmetric imbedding of a Cayley graph corresponding to an involution that reverses 
orientation must be identified but need not be otherwise. 


R2:  Orientation-reversing homeomorphisms of finite order, such as involutions, can 
be nonintuitive and subtle. One tends to think in terms of euclidean isometries, where 
there are two types: reflections and glides. For example, one can imagine cutting a 
torus in half, forming two dividing circles, and interchanging the halves by a reflection 
that leaves fixed the dividing circles. But it is also possible to interchange the halves by 
an antipodal map that also interchanges the circles. It is even possible to interchange 
the halves, leaving one dividing circle fixed, but rotating the other dividing circle a half 
turn (like a glide along the circle). See [GrTu87] for more examples. 


7.5.2 Riemann—Hurwitz Equation; Hurwitz’s 
Theorem 


Given a voltage graph imbedded in a surface S,, the Euler characteristic for the sur- 
face of the derived imbedding is easily calculated, in terms of the order of the voltage 
group, the Euler characteristic of S$, and the order of the net voltages on the faces. As 
derived imbeddings of one-vertex imbedded voltage graphs, strongly symmetric imbed- 
dings of Cayley graphs can be handled the same way. 
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DEFINITIONS 


D11: Suppose that the Cayley graph C(A,X) has a strongly symmetric imbedding 
in the surface T as the derived graph for a voltage graph imbedding of the bouquet 
B in the surface S. Suppose that the non-identity net voltages on the faces of this 
imbedding are r1,--- ,7. Then the Euler characteristic y(T) can be computed by the 
Riemann—Hurwitz equation (where |.A| is the order of the group A): 


x(P) = |Ala(S) = (1-4). 


1 
D12: The quantity 1—— is sometimes called the deficiency of the branch point at 
7 


the center of a face (of an imbedded voltage graph) whose excess voltage has order r;. 
That is because there are only |A|/r; copies of that face in the derived imbedding, instead 


i 


and edges generate |.A| copies each, this explains the Rirmanntigeete equation.) 


of the “expected number” |.A| copies, which is a deficiency of |A| — —. (Since vertices 


D13: A similar equation holds for symmetric imbeddings that are not strongly sym- 
metric. If the associated imbedded voltage graph still has one vertex, then the surface S 
is non-orientable, but the equation holds exactly as before. If the associated imbedded 
voltage graph has two vertices, then the Riemann—Hurwitz equation becomes 


x2) = Bs) — 1 1/r9). 


D14: A triangle group is a group with presentation 


(2,7: a? = y? = (xy)” = 1) 
This is the group of isometries of the plane generated by the rotations at the vertices of 
a triangle with angles 7/p,7,q,7/r (the geometry of the plane is spherical, Euclidean 


or hyperbolic, depending on whether the angle sum is greater than, equal to, or less 
than 7). 


D15: Any quotient group A of the triangle group 
(x, r: a? = y? = (cy)" = 1) 
is said to be a (p,q,r)° group. 


D16: The full triangle group has presentation 


(t,y, 2:27 =y? = 2? = (ay)" = (yz)? = (wz)" = 1) 


and is generated by reflections in the sides of a t/p,7/q,7/r triangle. 


D17: Any quotient group A of the full triangle group 


(t,y,2:27 =y? = 2? = (ay)" = (yz)? = (wz)" = 1) 


is said to be a (p,q,r) group. 


D18: A (p,q,r) group A, with generators as above, is properly (p,q,1r) if the subgroup 
generated by xy and yz has index two in A. (The index is otherwise one.) 
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FACTS 


F13: If Aisa (2,q,r)° group, then by the Riemann—Hurwitz equation it has a strongly 
symmetric imbedding in a surface of Euler characteristic 


«= lap (1-3)-(-2)- 0-2) md 


and genus g = 1 — & satisfying 
1 1 1 


2/2 p q 


F14: Similarly, if A is properly (2,q,1r), then it has a symmetric imbedding in a surface 
of genus g satisfying 
ae |A| |; 1 7 


4 |2 p q 


F15: (Hurwitz’s Theorem [GrTu87], [Hu:1893], [Tu80]) If the group A has strong 
symmetric genus 0°(A) > 1, then 


|A] < 84(0°(A) — 1) 


with equality if and only if A is (2,3,7)°. If the symmetric genus o(A) > 1, then 
|A| < 168(0(.A) — 1), with equality if and only if A is properly (2, 3,7). 


F16: [Tu80] As a corollary to Fact F15, there are only finitely many groups of a given 
symmetric genus or strong symmetric genus greater than one. 


F17: [Tu80] (The Cayley-graph version of Hurwitz’s theorem) If y(A) > 1, then 
|A] < 168(7(A) — 1) 
with equality if and only if A is properly (2,3, 7). 


F18: [Tu80] As a corollary to Fact F17, there are only finitely many groups of a given 
genus greater than one. 


F19: [Ba91], [Th91] (Babai-Thomassen Theorem): There are only finitely many 
vertex-transitive graphs of a given genus g > 2. In particular, there are only finitely 
many Cayley graphs of a given genus g > 2. 


REMARKS 


R3: The Hurwitz theorems have been stated here as an upper bound on |.A], rather 
than as a lower bound on o(.A) or 7(A), since the traditional view was bounding the 
order of a group of conformal automorphisms on a Riemann surface of given genus. 
The proof of Hurwitz’s original theorem is a brief analysis of the possibilities for the 
Riemann—Hurwitz equation when y(T’) < 0. The same analysis can be refined to give 
detailed information about A, whenever |A| is large compared to o(A) or 0°(A). For 
example, if |A| > 80(0(A)—1), then A is (2,3, 7)°, properly (2,3, 7), or properly (2, 3, 8). 
For more on refinements of Hurwitz’s theorem see [GrTu87], [Tu83]. 
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R4: The Cayley graph version of Hurwitz’s theorem is unexpected. The formula for 
the Euler characteristic guarantees that both the valence and average face size must be 
small, when the number of vertices of an imbedded graph is large compared to the Euler 
characteristic of the imbedding surface. For a Cayley graph, this means there must be 
a small number of generators and many short relators. The proof is a long, exhaustive 
case-by-case analysis with lots of “relation chasing”. This analysis can be refined to give 
detailed information about A, whenever |A| is large, in a manner analogous to Hurwitz’s 
theorem, although again the proofs are much harder and longer (see [GrTu87], [Tu84b]). 


R5: We have stated the Riemann—Hurwitz equation in terms of a strongly symmet- 
ric imbedding of a Cayley graph for the group A, but since all orientation-preserving 
group actions come from strongly symmetric imbedding, we can also view the Riemann— 
Hurwitz equation as holding for any group acting on the surface of genus g, preserving 
orientation. In that context, one might ask on which surfaces a given group acts pre- 
serving orientation. The Riemann—Hurwitz equation governs the situation for almost 
all genus g in the following sense [Ku87]. Given the group A, there is a number P such 
that if A acts on the surface of genus g preserving orientation, then g = 1 mod(P). 
Moreover, there is such an action by A for all but finitely many such g. The quantity 
P is easily computed in the terms of the Sylow p-subgroups A, of A. In particular, it 
follows from Kulkarni’s theorem that the group A acts preserving orientation on all but 
finitely many surfaces if and only if A, is cyclic for all odd p and A does not contain 
the subgroup Z2 x Z4. 


7.5.3 Groups of Low Genus 


For low genus, minimum imbeddings tend to be highly symmetric. For example, by 
Whitney’s theorem that a 3-connected planar graph imbeds uniquely in the sphere, a 
Cayley graph imbedded in the sphere must be symmetrically imbedded. In addition, 
symmetries of the sphere and torus come from the natural geometry of the surfaces: 
spherical geometry for the sphere, and Euclidean geometry for the torus (viewed as the 
Euclidean plane rolled up by a pair of linearly independent translations). 


DEFINITION 


D19: A Euclidean space group or Euclidean crystallographic group is a group 
of isometries of the Euclidean plane that contains translations in independent directions 
and such that the orbit of any point under the group has no accumulation points (there 
is a minimum distance any point is moved by all the elements of the group not leaving 
the point fixed). 


NOTATION: The dihedral group of order 2n is denoted D,,. The symmetric group and 
alternating group on n symbols are denoted S, and A, respectively. 


FACTS 
Finding all groups of a given small genus has a long history. 


F20: There are exactly 17 Euclidean space groups, up to isomorphism, and presenta- 
tions for the groups are well-known [CxMo80]. 
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F21: (Planar groups [Ma:1896], [GrTu87]|) The groups of strong symmetric genus 0 are 
Zn, Dn, As, Sa, and As. The groups of symmetric genus 0 are these groups together 
with their direct products with Z2. In both cases, the associated group actions can be 
realized by automorphisms of prisms and the platonic solids. For all groups, 7(A) = 0 
if and only if o(A) = 0. 


F22: (Toroidal groups [Ba31], [Pr77], [Tu84a], [GrTu87]) Except for three groups, 
(A) = 1 if and only if o(A) = 1. Moreover, o(A) = 1 if and only if A is a finite 
quotient of one of the 17 Euclidean space groups; this yields partial presentations for 
all toroidal groups (see [CxMo80] or [GrTu87]). The three exceptional groups, with 
(A) = 1 but o(A) > 1 have orders 24, 48, and 48 and presentations 


(a) A=(2,y: 2° =y3 = 1, cy2 = yxy), 
(b) A= (a,y: 0° =y? = Layryry = yryryn), 
(AS ie ye =a = Le eyaye S21): 


F23: (Genus two [Tu84c]) There is exactly one group A such that y(A) = 2. It has 
order 96 and the (2,3, 8) presentation: 


A= (t,y,2:07 =y = 2 =1, (ay)? = (yz)? = (wz) = 1, (wy)*2 = 2(ay)*) 
There is a sculpture on display at the Technical Museum of Slovenia showing a sym- 
metric imbedding of C(A, {z, y, z} in the surface of genus two [GoMa07]. It appears on 
the cover of the journal Ars Combinatorea Mathematica. The generators are colored x 
red, y green, z yellow. 


F24: (Symmetric genus 2 [MaZi95]) There are 4 groups of symmetric genus 2: the 
group of genus 2 and the three exceptional groups of genus 1. 


F25: (Symmetric genus 3 [MaZi97]) There are 3 groups of symmetric genus 3: the 
proper (2,3,7) group PGL (2,7), its (2,3, 7)° subgroup PS'L(2,7) (also known as Klein’s 
simple group of order 168), and the proper (2,4,6) group Z2 x Z2 x S4. 


F26: (Strong symmetric low genus [MaZi00]) There are 6 groups of strong symmetric 
genus 2, 10 groups of strong symmetric genus 3, and 10 groups of strong symmetric 
genus 4. 


7.5.4 Genus for Families of Groups 


Abelian groups form an interesting case study for the genus of a group. Commutators 
in the generators provide many possible faces of size four (quadrilaterals), but it is not 
easy to see how to combine them to form an all-quadrilateral imbedding. In fact, a 
strongly symmetric imbedding of a Cayley graph of valence greater than 4 cannot have 
a face of the form ryx~ty~: the rotation would have to be zy~tx~!y with no room 
for any other generators. The canonical form for the abelian group plays a key role. 
The simplest case, when all the factors in the canonical form are even, was part of 
White’s original paper [Wh72] where he introduced the genus of a group. At the other 
extreme are groups whose minimal genus imbeddings are symmetric. For example, any 
proper (2,3,7) group A has genus 1+ |.A|/168, by the Cayley graph version of Hurwitz’s 
theorem. 


814 Chapter 7. Topological Graph Theory 


DEFINITIONS 
D20: Any abelian group A can be written uniquely in the canonical form 
Zm, X Bm, X-+++Zm, Where m;|\mj41 for 7 =1---r 


D21: The factors Z,, in the canonical form of an abelian group A are called the 
canonical factors of A, and the number r of factors is called the rank of the abelian 
group A. 


D22: A (2,3,7)° group is called a Hurwitz group. 


D23: A proper (2,3,7) group is called a proper Hurwitz group. 


FACTS 


F27: [JuWh80] Suppose that the abelian group A does not have Z3 as a canonical 
factor and that my, is even if the rank r = 3. Then y(A) = 1+ |A|(r — 2)/4, whenever 
the right side of this equation is an integer. 


F28: [BrSq88], [MoPiSkWh87] 7(Z3 x Z3 x Z3) = 7, and the minimal genus imbedding 
has very little symmetry, including faces of many different sizes. 


F29: [PiTu89] Let C be any finite group. If A is an abelian group of rank r at least 
twice that of C, then in most cases y(C x A) = 1+ |C||A|(r — 2)/4. 


F30: [Mc65], [MaZi93] The strong symmetric and symmetric genus are known for all 
abelian groups. 


F31: [Co80] For n > 167, the symmetric group S,, is a proper Hurwitz group. In 
particular, 7(S;,) = 1+ n!/168 for n > 167. 


F32: All the 26 sporadic simple groups are (2, p,q) for some p and q; and 12 of them 
are Hurwitz groups. All the alternating groups A,, for n > 167 and many of the simple 
groups of Lie type and of large enough dimension are Hurwitz groups [LuTa99]. 


F33: The symmetric genus of all the sporadic groups [CoWiWo92], of all the alternating 
and symmetric groups [Co85], and of many other simple groups is known. Notice that 
if the group A is simple, then a(.A) = 0°(.A), since A has no subgroups of index two. 


F34: [MaZi02] For every genus g, 0°(Zm x Dn) = g for some m,n. In fact, 0°(Z3 x 
D,,) =n if n is not divisible by 6. 


REMARK 


R6: Fact F34 is interesting, because it shows that the strong symmetric genus 0° is 
free of gaps. For o and 4, it is not known whether there are gaps. If c 4 8,14, there is 
a family of groups with o = g for every g = c mod 18 [CoTul1]. Moreover, there is also 
such a family for g = 8,14 mod 18 if the prime power factorization of g — 1 contains no 
factor p° = 5 mod 6. For 4, easily constructed quadrilateral imbeddings [Wh72] show 
that y(Z2 x Zos X Zasm) = ms? +1, so if g is a gap for y, then g — 1 is square-free. 
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7.5.5 Non-Orientable Surfaces 


It is possible, of course, to imbed Cayley graphs in non-orientable surfaces, and it 
is natural to ask about minimal non-orientable imbeddings. It is also interesting to 
compare minimal imbeddings in orientable and non-orientable surfaces. 


DEFINITIONS 


D24: If S is a non-orientable surface of Euler characteristic y(S), then 2 — y(S) is 
called the crosscap number or non-orientable genus of S. (It follows from the 
classification of closed surfaces that every surface of crosscap number c can be obtained 
from the sphere by attaching c crosscaps, that is, by removing c disks from the sphere 
and identifying the resulting c boundary components to the boundaries of c Mobius 
strips.) 


D25: For any closed surface S, the quantity 2— y(S) is called the Euler genus of S. 
Thus if S is orientable of genus g, then its Euler genus is 2g, and if S is non-orientable 
of crosscap number c, then its Euler genus is c. 


D26: The non-orientable genus or crosscap number of the group A, denoted 
4(A), is the smallest number c such that some Cayley graph for A imbeds in the non- 
orientable surface S of crosscap number c. 


D27: The Euler genus of the group A, denoted y°(A), is the smallest number e such 
that some Cayley graph for A imbeds in a surface of Euler genus e. 


D28: The symmetric Euler genus of the group A, denoted o°(A), is the smallest 
number e such that some Cayley graph for A imbeds symmetrically in a surface of Euler 
genus e. 


D29: An imbedding for a Cayley graph C(A,X) in a non-orientable surface S is 
symmetric if the natural action of A on C(A, X) extends to an action on S. 


D30: The symmetric crosscap number or symmetric non-orientable genus, 
denoted G(.A), is the smallest number c such that some Cayley graph for A imbeds 
symmetrically in a surface of non-orientable genus c. 


D31: An imbedding of a graph in a non-orientable surface can be described by assigning 
to each vertex a cyclic order or rotation to the set of edges incident to that vertex and 
assigning to each edge a type of 0 or 1, telling whether the edge is orientation-preserving 
or orientation-reversing. 


FACTS 


F35: An imbedding of a Cayley graph C(A, X) in a non-orientable surface is symmetric 
if and only if the rotation is the same at each vertex in terms of the directed edge labels 
and if every directed edge labeled by the same generator has the same type. 


F36: The definitions immediately imply the following: 
(i) ¥(A) < G(A) 
(ii) y°(A) < o*(A) 

(iii) y°(A) = min{27(A), 7(A)} 

(iv) o€(A) = min{20(A),7(A)} 
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F37: Any imbedding of a graph in an orientable surface can be turned into an imbed- 
ding in a non-orientable surface, decreasing the number of faces by at most 1, by chang- 
ing the type of a single edge. Thus ¥(.A) < 27(A) +1. 


F38: If B is a subgroup of A, then 47(B) < 4(.A), by Babai’s theorem [Ba77]. 


F39: [Tu83] The group A has a symmetric imbedding in the non-orientable surface S 
if and only if A acts on S. In particular, if B is a subgroup of A, then o(B) < a(A). 


F40: Hurwitz’s theorem and its Cayley graph version apply to non-orientable surfaces. 


F41: [Tu83] If the group A acts on the non-orientable surface S of Euler characteristic 
x, then Z x A acts on the orientable double covering of S', the surface of Euler character- 
stic 2y, with the A factor orientation-preserving. In particular, 0°(A)—1 < (a(A) — 2). 


F42: The two groups of Euler genus 1 are 23 x Z3 and its Z2-extension 
(z,y:a° =y? =1,[2, yey] = 1) 


There are no groups of symmetric Euler genus 1, since any group acting on the projective 
plane also acts on its orientable double covering, the sphere. There are no groups of 
Euler genus 3 and one group of symmetric Euler genus 3. For 7° = 2 or 7° = 4, all 
minimal imbeddings are orientable, that is, y°(A) = 2 if and only if y(A) = 1, and 
¥°(A) = 4 if and only if y(A) = 2. The only group A with o°(A) = 5 is the symmetric 
group Ss; it is also true that y°(S5) = 5 (see [Tu91], [MaZi01]). 


F43: The groups with 7 = 1 are the groups with y = 0 together with the two groups 
with 7° = 1. The groups with ¢ = 1 are the groups with a? = 0. It is conjectured 
that there are no groups with y = 2. Other cases of low crosscap number are studied 
in [Tu91], [MaZi01]. 


F44: The crosscap number is known for many abelian groups. 
F45: The crosscap number is known for all groups Z, x Dn [EMO8] . 


F46: If A is an improper Hurwitz group, then 7(A) = 1+ |A|/84. In particular, 
4(An) = 1+ |A|/84 for all n > 167 [Co85). 


R7: There are inequalities relating all the various genus parameters to each other 
and to a quotient group Q of a given group A [Tu09, Tu08]. Suppose that p and + 
are any of 7,4, 7°o,0°,¢,0° (possibly p = 7). Define d(p) = 1 for p = y,0,0° and 
6(p) = 2 otherwise. There is a number m(p,o) depending only on p and 7, such that if 
T(A) > d(7), then 


In all cases, m(p,T) < 168. Of particular interest is the case when p = T = 7¥ since it is 
not known whether 7(Q) < (A) for any quotient Q of A. Note that by the Riemann— 
Hurwitz equation for group actions, m(p,e) = 1 for p = o,0°,G, and o®. For more 
about comparisons of other genus-like parameters, see [Tu08]. 
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INTRODUCTION 


The theory of maps is likely to be the oldest topic in this volume, going back not just to 
the 4-color problem posed in 1852 and to the theory of automorphic functions developed 
in the late 1800’s, but to the Platonic solids dating from antiquity. Among the many 
contributors to the subject are Archimedes, Kepler, Euler, Poinsot, de Morgan, Hamil- 
ton, Dyck, Klein, Heawood, Hurwitz, Steinitz, Whitney, Tutte, Coxeter, and Grtinbaum. 
General references on maps include [BeiWi09], [BoLi95], [BrSc95], [CoMo84], [LaZv04], 
[GrTu87], [JoSi87], [MoTh01], and [Wh01]. A systematic combinatorial theory of maps 
appears in [JoSi78], [BrSi85], and [BoLi95]. 


7.6.1 Maps and Polyhedral Maps 


Basic notions are introduced: map and polyhedral map, duality, isomorphism, face- 
width. The existence and uniqueness of a map with a given underlying graph is ad- 
dressed, as well as the well-known Euler—Poincaré formula, which relates combinatorial 
and topological invariants of a map. 


DEFINITIONS 


D1: A surface in this section is a compact, connected, 2-dimensional manifold without 
boundary. A surface S is determined up to homeomorphism by two invariants, its 
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orientability and Euler characteristic. For each even integer y < 2, there is a unique 
orientable surface with Euler characteristic y; it is a sphere with g = (2 — y)/2 handles 
(or, equivalently, a connected sum of g tori), and is denoted S,. For each integer y < 1 
there is a unique nonorientable surface with Euler characteristic x; it is a sphere with 
g = 2— x crosscaps (or, equivalently, a connected sum of g projective planes) and is 
denoted N,. 


D2: The invariant g is called the genus of the surface S. The orientable surfaces 
of genus 0 and 1 are the sphere and torus, respectively, and the nonorientable surfaces 
of genus 1 and 2 are the projective plane and Klein bottle, respectively. 


D3: A continuous mapping f: S — S’ from a surface S to a surface S’ is called a 
branched covering if each point x € S has a neighborhood D such that the restriction 
of f to D is topologically equivalent to the complex mapping z > 2%. If all but 
finitely many points of S’ have precisely k preimages, the covering is said to be a k- 
fold covering. The exceptional points are called branch points of f. If f has no 
branch-points, it is called a smooth covering. 


D4: A map M ona surface S is a finite cell-complex whose underlying topological 
space is S. The supporting surface of a map M is denoted Sjy,. 


D5: The underlying graph of a map M is its 1-skeleton. It is denoted G = Gya,. 


D6: The vertices and edges of a map M are the vertices and edges, respectively, 
of Gy. The faces of a map M are the connected components of Sjy — Gy. The 
boundary of each face is a closed walk in Gy,. 


D7: The 0-, 1-, and 2-dimensional cells of a map M are its vertices, edges, and 
faces, respectively. 


D8: Maps M; and Mz are isomorphic maps, denoted M, ~ Mg, if there is a home- 
omorphism of the supporting surfaces that induces an isomorphism of the underlying 
graphs. 


D9: A map homomorphism f: M — N is a branched covering Sjy > Sy between 
the supporting surfaces that induces a graph epimorphism Gyy — Gy between the 
underlying graphs. 


D10: The dual map M* of a map M on a surface S is a map in the same surface S, 
whose vertex set V* consists of one point interior to each face of M, and whose edge 
set E* consists of, for each edge e of M, an edge e* that crosses edge e and joins the 
vertices of V* that correspond to the faces incident with e. 


D11: The Petrie dual of a map M is a map M? with the same underlying graph 
as M, whose face boundaries are closed walks in Gy, such that any two consecutive 
edges, but not three, belong to a face of M. 


D12: The medial map M™? of a map M is a map whose vertex-set is the edge-set 
of M, and whose edges join each pair of edges of M which are consecutive edges in a 
face-boundary walk of M. 


D13: A polyhedral map M is a map whose face boundaries are simple cycles, and 
such that any two distinct face boundaries are either disjoint or meet in either a single 
edge or a single vertex. 
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D14: A polyhedral map M is a triangulation if the boundary of each face is a 3-cycle. 


D15: The face-width of a map M, denoted fw(M), is the minimum number of 
points |7 M Gyz| over all noncontractible simple closed curves 7 on the surface. 


D16: The edge-width of a map M, denoted ew(M), is the length of a shortest cycle 
in Gy that is noncontractible on the supporting surface. 


D17: The operation of edge-contraction for a triangulation, and its inverse oper- 
ation vertex-splitting, are exhibited in Figure 7.6.1. After contracting an edge in a 
triangulation, the map may no longer be a triangulation, i.e., no longer polyhedral; this 
occurs if the edge is contained in a 3-cycle that is not a face boundary or if the map is 
the tetrahedral map. 


VN’ 
Figure 7.6.1: Edge-contraction and vertex-splitting in a triangulation. 


D18: A minimal triangulation of a surface S is a triangulation such that the 
contraction of any edge results in a map that is no longer polyhedral. 


D19: A k-minimal triangulation is a triangulation of a non-spherical surface with 
edge-width & such that each edge is contained in a noncontractible k-cycle. (Note that 
for non-spherical triangulations, minimal and 3-minimal are equivalent.) 


EXAMPLES 


E1: A map M in the torus, formed by a hexagonal imbedding of the 3-dimensional 
cube Qs, and its dual map M* appear in Figure 7.6.2. The torus is obtained by identify- 
ing like labeled edges on the boundary of the polygon. Neither M nor M* is polyhedral. 


(a) M (b) M* 


Figure 7.6.2: A toroidal map and its dual. 
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E2: Figure 7.6.3 shows two nonisomorphic maps in the sphere that have the same 


2-connected, but not 3-connected, underlying graph. The two maps are related by a 
Whitney flip. This example is relevant to Fact F8 below. 


Figure 7.6.3: Maps in the sphere with the same 2-connected underlying graph. 


E3: Figure 7.6.4 shows two polyhedral maps in the projective plane with isomorphic 
3-connected underlying graphs. The projective plane is depicted here as a disc with 
antipodal boundary points identified. This example shows that the analogy to the 
Whitney uniqueness theorem (Fact F8) fails for projective planar graphs. 


Figure 7.6.4: Maps in the projective plane with the same 3-connected underlying graph 


E4: Figure 7.6.5 shows the tetrahedral map M in the sphere and its Petrie dual. The 
map M? is an imbedding of the complete graph Ky into the projective plane known as 


the hemi-cube. Its antipodal double cover (see Remark R8) is the spherical imbedding 
of the cube Q3. 


one 


~---- 


(b) MP 
Figure 7.6.5: A map and its Petrie dual. 


(a) M 
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E5: There is one minimal triangulation of the sphere (i.e., the tetrahedral map), two 
minimal triangulations of the projective plane (see Figure 7.6.6), 21 of the torus, and 
25 of the Klein bottle. 


Figure 7.6.6: The two minimal triangulations of the projective plane. 


REMARKS 


R1: It is equivalent to regard a map as a 2-cell ambedding of a graph G ona surface S, 
i.e., an imbedding such that the connected components of S — G are 2-cells. 


R2: A map may have multiple edges, self-loops, and vertices of degree 1 or 2. A 
polyhedral map, however, can have none of these. Moveover, in a polyhedral map, the 
closure of each face is topologically a closed disc. 


R3: The boundary of a face may not be a simple cycle. Whether every (2-connected) 
graph can be imbedded on a surface in such a way that the face boundaries are (simple) 
cycles is not known (see the conjectures below). 


R4: Face-width, introduced in [RoSe88], is a measure of local planarity, or of how 
dense the graph is in the surface, or of how well the graph represents the surface. It is 
sometimes known as representativity. 


R5: A non-spherical map M is polyhedral if and only if fw(M) > 3 and its underlying 
graph is 3-connected; see [MoTh01, Proposition 5.5.12]. By Steinitz’s Theorem (see 
Fact F22) it follows that the polyhedrality of maps generalizes the notion of a convex 
polyhedron. 


R6: A map &M on the sphere S$ can be drawn in the plane via stereographic projection 
from any point of S — Gy. 


R7: The Petrie dual of a map MM is well-defined for every M. Its faces are so-called 
zig-zag polygons of the map M. The supporting surface of the Petrie dual of a map on 
an orientable surface is orientable if and only if M is bipartite. 


R8: For every map M on a nonorientable surface, there exists an orientable map M 
and a homomorphism M — M. Among such maps there exists a unique minimal map, 
the antipodal double cover of M. 


R9: In the class of cubic polyhedral maps, the operation dual to edge-contraction 
in a triangulation is the operation of edge-suppression. It consists of removal of an 
edge e incident with two faces and suppressing the end-vertices of e, thereby producing 
a smaller cubic map. 
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R10: The concept of map has been extended to cell-complexes whose underlying topo- 
logical space is a manifold of dimension greater than 2. This includes, in particular, the 
boundary complex of any polytope. The generalizations, though natural and interesting, 
are omitted here; for more information see, e.g., [Fe76], [Ga79], and[Vi83al]. 


FACTS 


F1: Euler—Poincaré formula: For any map M with ag vertices, a, edges, a2 faces, 
and Euler characteristic x(M), 

Ao - a, +aQq = x(M). 
F2: Every connected graph G is the underlying graph of a map. The rotation schemes 


introduced in Definitions D31 and D32 give a systematic method for obtaining all 2-cell 
imbeddings of G. 


F3: The underlying graph of a map M on a surface of Euler characteristic y has cycle 
rank at least 2—. A map whose underlying graph is a tree is necessarily spherical. 


F4: If M isa map, then (M*)* = M, (M?)P = M and (M*)™4 = med, 

F5: If M is a map, then fw(M*) = fw(M). In particular, the dual of a polyhedral 
map is polyhedral. 

F6: [MoTh01, Proposition 5.5.4] If M is a map, then fw(M) = ew(M™¢) /2. 

F7: The underlying graph of a polyhedral map is 3-connected. 


F8: Whitney Uniqueness Theorem [Whi32]: A 3-connected planar graph has a 
unique imbedding in the sphere. 


F9: [Ne83] All 6-connected toroidal graphs are uniquely imbeddable in the torus. On 
the other hand, there are infinitely many 5-connected toroidal triangulations of the torus 
that are not uniquely imbeddable in the torus; see [La87]. 


F10: [Ne85] A 5-connected projective planar graph distinct from K6, containing a 
subgraph homeomorphic to K¢, is uniquely imbeddable in the projective plane. 


F11: [Th90, Corollary 5.1.7] If a 3-connected graph G has an imbedding G > S$ 
whose edge-width is greater than the length of any face-boundary, then G is uniquely 
imbeddable in S. 


F12: [BaEd89] The set of minimal triangulations is finite for every fixed surface (see 
Example E5). In other words, for each surface, there is a finite set of triangulations 
from which any triangulation on that surface can be generated by vertex-splittings. In 
particular, every triangulation of the sphere can be obtained from the tetrahedral map 
by a sequence of vertex-splittings. 


F13: For any k > 3, the set of k-minimal triangulations on a fixed surface is finite 
[MaNe95]. ((MoTh01, Theorem 5.4.1] provides another proof.) 
CONJECTURES 


C1: The Strong Imbedding Conjecture: Every 2-connected graph can be imbed- 
ded on a surface so that each face is bounded by a simple cycle in the graph. 
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C2: The Cycle Double Cover Conjecture: Every 2-connected graph contains a 
collection C of cycles such that every edge is contained in exactly two cycles of C. The 
validity of the Strong Imbedding Conjecture implies validity for the Cycle Double Cover 
Conjecture. 


C3: Griinbaum’s Conjecture: If a cubic graph G has a polyhedral imbedding into 
an orientable surface, then G is 3-edge-colorable. 


REMARKS 


R11: According to Fact F2 above, every connected graph has a 2-cell imbedding into 
an orientable surface. Moreover, every connected graph with at least one cycle has a 
2-cell imbedding into a nonorientable surface. 


R12: For cubic graphs, Conjectures C1 and C2 are equivalent. It is known that the 
smallest counterexample to Conjecture C2 must be a non-3-edge-colorable cyclically 
4-edge-connected cubic graph of girth at least 12 (see §5.1.5). 


R13: An equivalent formulation of Griinbaum’s Conjecture states that the dual of a tri- 
angulation of an orientable surface has a 3-edge-colorable underlying graph. Still another 
formulation of the conjecture states that no snark (see §5.2) admits a polyhedral imbed- 
ding into an orientable surface. In contrast, it is known that every nonorientable surface 
supports a polyhedral embedding of some snark [LiCh12]. For the sphere, Griinbaum’s 
conjecture is equivalent to the Four Color Theorem. The conjecture was shown to be 
false for every orientable surface of genus at least 5 [Ko09], and remains open only for 
surfaces of genus 1, 2, 3, and 4. 


7.6.2 Existence and Realizations of Polyhedral Maps 


Elementary equalities hold among the basic parameters of a map. The two questions 
addressed in this section are, first, when are the necessary conditions also sufficient 
for the existence of a map with these parameters and, second, when can the map be 
imbedded in the Euclidean space E? or E* in such a way that the faces are plane convex 
polygons. The classical results for maps on the sphere are Eberhard’s theorem of 1891 
and Steinitz’s theorem of 1922. 


DEFINITIONS 


D20: A map is of map type {p,q} if each face has p edge incidences and each vertex 
has q edge incidences. (No global symmetry is implied; in fact, the automorphism group 
of such a map, as defined in 87.6.6, may be trivial.) 


D21: The cell-distribution vector (a-vector) of amap M is the 3-tuple (ao, a1, a2), 
where Qo, @1, @2 are the numbers of vertices, edges, and faces of MW, respectively. 


D22: The face-size sequence (p-sequence) of a polyhedral map M is the sequence 
{pi}i>3, Where p; is the number of i-gonal faces in M. The reduced face-size sequence 
is the sequence p* = {p;} 


i>3. 
iF6 
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D23: The verter-degree sequence (v-sequence) of a polyhedral map M is the 
sequence {v;};>3, where v; is the number of vertices of degree i in M. The reduced 
vertex-degree sequence is the sequence v* = {v;}i>4. 


D24: For a given triple (p*,v*, g), let Pg(p*, v*, g) be the set of all integers pg for which 
there is a polyhedral map of genus g > 0 with pg hexagonal faces, with the corresponding 
reduced p-sequence p*, and with the corresponding reduced v-sequence v*. 


D25: A geometric realization (realization) of a polyhedral map M is an imbedding 
of M into Euclidean space E% (without self intersection) such that each face is a plane 
convex polygon and that adjacent faces are not coplanar. 


EXAMPLES 


E6: The map MM in Figure 7.6.2(b) is of type {3,6}, with a-vector (4,12,8). Its dual 
M* is of type {6,3}, with a-vector (8,12,4). The maps in Figure 7.6.4 both have v- 
sequence (6,3), but the first has p-sequence (0,6,0,1), while the second has p-sequence 
(13,2); 


E7: Five polyhedral maps on the sphere and their corresponding 3-dimensional real- 
izations appear in Figure 7.6.7. 


Figure 7.6.7: The Platonic solids as 3-dimensional realizations of maps. 


FACTS 


F14: The a-vector, the p-sequence, and the v-sequence satisfy the following elementary 
equalities: 


Soi = a2, Sou = ao, So ini = 2a, = So iv; (7.6.1) 


i>l i>l i>l i> 


F15: For amap M on an orientable surface of genus g and k, A > 0 such that «+A = 1, 
Euler’s formula implies that 


So (ni —2)u; + SOOi- 2)pi = Ag — 1). (7.6.2) 
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For example, taking « = 1/3, and M polyhedral of genus g yields 


S°(6-i)pi +25 °(3- iu, = 12(1 - 9). (7.6.3) 


i>3 i>3 


In particular, if M is a cubic polyhedral map on the sphere, then 


S"(6-a)p; = 12. (7.6.4) 


i>3 


A pair of reduced sequences (p*,v*) is a g-admissible pair of map sequences if 
Conditions (7.6.1) and (7.6.3) are satisfied. 


F16: By Equation (7.6.1), for a polyhedral map on a surface with a given p-sequence 
and a given reduced v-sequence the following holds 


F17: [Eb1891] Eberhard’s Theorem: Condition (7.6.4) above is sufficient for the 
existence of a cubic spherical map, in the following sense: if a sequence {/p; |i > 3, 1 4 6} 
satisfies }°,46(6 — k)px = 12, then there exist values of pg such that {p; |i > 3} is the 
p-sequence of a simple polyhedral map on the sphere. For variations on Eberhard’s 
Theorem, see [Gru70, Je93a, Je93b]. 


F18: Eberhard’s theorem establishes that Pg(p*, {O};i>4,0) is non-empty for each px 
satisfying (7.4). The sets Pg(px,v*,0) and P¢(p*, vx, 1) are determined, up to a finite 
number of exceptions, for all admissible pairs (p*,v*); see [Je93a, Je93b]. There are 
infinitely many 0-admissible pairs (p*,v*) and exactly one 1-admissible pair (p*,v*) 
with P¢(p*,v*,0) = 0 = Pe(p*, v*, 1). 


F19: [Je93b, Theorem 2] For any g-admissible pair (p*,v*), g > 2, the set Pg(p*,v*, g) 
contains all but finitely many positive integers. 


F20: [Gri83] Equation 7.6.3 for the torus (with « = 1/3) becomes 


25° -3)u,+ 5-6) p; = 0, 


which leads to the following analogue of Eberhard’s theorem for the torus. Given a 
sequence {p;|i > 3, i 4 6} and a positive integer s, there is a realization in E? of some 
polyhedral map on the torus with p-sequence {p; |i > 3} and 5>(i—3)v; = s if and only 
if ),46(6 — k)px = 25 and s > 6. Related results appear in [BaGrHo9]1]. 


F21: [EdEwKu82] If S is a surface with Euler characteristic y, if ao,a1,a2,p,q are 
positive integers such that ag — a, + a2 = x, and if pag = 2a, = qao, then there exists 
a map of type {p,q} on S with a-vector (ag, a1,Q@2), except when S is the projective 
plane, {p,q} = {3,3}, ao = ag = 2, and a; = 3. 


F22: [Sti22] Steinitz’s Theorem: Every polyhedral map on the sphere is isomorphic 
to the boundary complex of a 3-dimensional polytope. Thus, any polyhedral map on 
the sphere has a realization in E?. 
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F23: [Al71, Gru67] A 3-valent polyhedral map M cannot be realized in Euclidean 
space of any dimension, unless the supporting surface of M is the sphere. 


F24: [BrSc95] Each triangulation on the torus or on the projective plane can be realized 
in E*. 


F25: [BrWi93] On any nonorientable surface N,, there exists a triangulation that 
cannot be realized in E?. (When g > 1, it is an open question whether each triangulation 
of orientable genus g can be realized in E®.) 


F26: [Sti06] The vector (a9,01,@2) is the a-vector of a realization in E? of some 
polyhedral map on the sphere if and only if ag — ay + ag = 2, 4 < ag < 2aq — 4, and 
4 < ag — 209 —A, 


F27: [Gri83] The vector (a9,Q1,Q2) is the a-vector of a realization in E°? of some 
polyhedral map on the torus if and only if ag — a, +a2 = 0, ag(11—az2)/2 < ag < 2a, 
ao(11 = ao) /2 < a2 < 220, 2a) = 3Q9 > 6, and Q1 x 19. 


REMARKS 


R14: Cubic spherical polyhedral maps with reduced p-sequence given by ps5 = 12 and 
p; = 0 for each j ¢ {5,6}, known as fullerenes, represent models of complex carbon 
molecules. They have been extensively investigated during the recent decades; see 
[FoMa07]. 


R15: Ifa map M with ap vertices and Euler characteristic y(S) is polyhedral, then 


aoe i V9 — a) | 


2 


and this lower bound is attained for all surfaces except $2, N2, and N3. By duality the 
same bound holds for ag. 


7.6.3. Paths and Cycles in Maps 


This section covers three topics involving paths and cycles: the Lipton—Tarjan separator 
theorem, the existence of nonrevisiting paths in polyhedral maps, and the decomposition 
of maps along cycles in the graph. The third topic is related to a result of Robertson 
and Seymour on minors. 


DEFINITIONS 


D26: A path p in the underlying graph of a map M is said to be a nonrevisiting 
path if the intersection of p with the boundary of F' is connected for each face F' of M. 


D27: A surface S has the nonrevisiting path property if, for every polyhedral map 
M on S, any two vertices of M are joined by a nonrevisiting path. 


D28: A map M’ isa minor of a map M if M' can be obtained from M by a sequence 
of edge contractions and deletions. The operations of edge deletion and edge contraction 
on a graph can be extended to a surface imbedding of the graph in an obvious way. 
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EXAMPLE 


E8: A polyhedral map on the surface S2 that fails to have the nonrevisiting path 
property appears in Figure 7.6.8 below. There is no nonrerevisiting path from x to y. 
(The map is obtained by gluing along like labeled edges.) 


Figure 7.6.8: A map on 5S» that does not satisfy 
the non-revisiting path property. 


FACTS 


F28: [LiTa79] Planar Separator Theorem: A planar graph with n vertices has a 
set of at most 2/2n vertices whose removal leaves no component with more than 2n/3 
vertices. 


F29: [AlSeTh94] Let M be a loopless map on the sphere with n vertices. Then 
there is a simple closed curve 7 on the surface of the sphere passing through at most 
k < 3\/2n/2 vertices (and no other points of the graph) such that each of the two open 
disks bounded by 7 contains at most 2n/3 — k/2 vertices. This result slightly improves 
the Lipton—Tarjan separator theorem. 


F30: [GiHuTa84] A map of genus g contains a set of at most O(,/gn) vertices whose 
removal leaves no component of the graph with more than 2n/3 vertices. This generalizes 
the Lipton—Tarjan theorem to maps on orientable surfaces of higher genus. 


F31: [PuVi98] For polyhedral maps, the nonrevisiting path property holds for the 
sphere, torus, projective plane, and the Klein bottle. It fails for all other surfaces 
except possibly the nonorientable surface of genus 3 (see [PuVi96] and Example E8). 


F32: The nonrevisiting path property holds for every polyhedral map with face-width 
at least 4. 
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F33: [RoSe88] Let Mo be any map on a surface S other than the sphere. There exists 
a constant k such that, for every map M on S with fw(M) > k, the map Mp is a map 
minor of M. The following two results provide bounds for the constant k when the 
given Mp contains certain sets of disjoint cycles. 


F34: [Sc93] A map M on the torus with face-width w contains |3w/4| disjoint non- 
contractible cycles. 


F35: [BrMoRi96] For general surfaces there exist |w/2]| pairwise disjoint contractible 
cycles in the graph of any map M, all containing a particular face, | (w—1)/2]| pairwise 
disjoint, pairwise homotopic, surface nonseparating cycles, and |(w—1)/8]|—1 pairwise 
disjoint, pairwise homotopic, surface separating, noncontractible cycles. (It is unknown 
whether any map of orientable genus g > 2 with face-width at least 3 must contain a 
noncontractible surface separating cycle.) 


F36: [Bar88] Every polyhedral map on the torus (projective plane, Klein bottle) is 
isomorphic to the complex obtained by identifying the boundaries of two faces of a 
3-polytope (cross identifying one face of a 3-polytope, cross identifying two faces of a 
3-polytope). 

F37: [Yu97] (see also [Th93]) If d is a positive integer and M is a map on S, of face- 
width at least 8(d+ 1)(29 — 1), then the underlying graph of M contains a collection of 


induced cycles C,C2,...,C, such that the distance between distinct cycles is at least 
d and cutting along the cycles results in a map on the sphere. 


F38: [Sc91] Schrijver proved necessary and sufficient conditions (conjectured by Lovasz 


and Seymour) for the existence of pairwise disjoint cycles C;,...,Cx in the underlying 
graph of a map M homotopic to given closed curves C),...,C on the surface. 
REMARK 


R16: The Lipton—Tarjan separator theorem has applications to divide-and-conquer 
algorithms. Nonrevisiting paths arise in complexity issues for edge following linear 
programming algorithms like the simplex method. 


7.6.4 Map Coloring 


The famous problems on map coloring, the Four Color Problem and the Heawood Map 
Coloring Problem, stimulated considerable research in the area, and the Heawood Prob- 
lem led to the birth of topological graph theory. The long-lasting effort of mathemati- 
cians to solve both problems significantly influenced development not only in graph 
theory and combinatorics, but also in algebra, geometry, computer science, and others. 
This subsection is aimed at presenting the solution of the Heawood Map Coloring Prob- 
lem along with other related results. 


DEFINITION 


D29: The chromatic number chr(S) of a surface S is the least number of colors 
sufficient to properly color the faces of any map on S. By duality, it is also the least 
number of colors sufficient to properly color the vertices of any map on S$. In this 
section, coloring will mean vertex coloring. 
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FACTS 
F39: [ApHa76] Four Color Theorem: chr(So) = 4. 
F40: [Fr34] chr(N2) = 6. 


F41: [RiYo68] Heawood Map Coloring Theorem: For every surface S$ except the 


Klein bottle No, 
1/49 — 24 
chr(S) = S am 8) 


The right-hand side of the equation is called the Heawood formula. 


F42: [Di52, AlHu79] If G is a graph imbedded into a surface S other than the sphere, 
then chr(G) < chr(S) unless G contains the complete graph of order chr(S') as a 
subgraph. 


F43: [FiMo94] There is a universal constant c such that every map M on a surface 
with Euler characteristic x < 2 such that ew(M) > clog(2 — x) is 6-colorable. 


F44:; [Th93] Any map M on S, with ew(M) > 214916 is 5-colorable. 
F45: [Gr59] Grétzsch Theorem: Every planar map of girth at least 4 is 3-colorable. 
F46: [Kel879, TsWel1] A planar triangulation is 3-colorable if and only if it is Eulerian. 


F47: [HuRiSe02] There is a constant f(g) such that every Eulerian triangulation 4 
on an orientable surface of genus g with ew(M) > f(g) is 4-colorable. 


F48: [Hu95] For every positive integer g there is a constant f(g) such that the following 
holds: If M is a map on S, such that ew(M) > f(g) and all boundary-walks are of even 
length, then M is 3-colorable. 


F49: [Th97] For a fixed surface S$, there is a polynomial time algorithm to decide 
whether a map on S can be 5-colored. 


F50: The problem of deciding whether a map can be 3-colored is NP-complete even 
for maps on the sphere [GaJo79]. 


F51: [RSST96] On the sphere, a 4-coloring can be found in O(n?) steps. 


REMARKS 


R17: The problem of determining the chromatic number of the sphere appeared in 
an 1852 letter from Augustus de Morgan to Sir William Hamilton, and was likely due 
to Francis Guthrie, the brother of a student of de Morgan. A computer dependent 
proof of Appel and Haken [ApHa76] that four colors suffice was considerably simplified 
[RSST97], but still remains computer dependent. 


R18: That the formula in the Heawood Map Coloring Theorem gives an upper bound 
on chr(S') was proved by Heawood [He1890]. That there exist graphs that actually 
require the number of colors given by that formula is a consequence of the formula for 
the genus of complete graphs due to Ringel and Youngs [RiYo68). 
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R19: For surfaces with Euler characteristic y < 0 the problem of determining the 
chromatic number of a surface is equivalent to the determination of the minimum genus 
of the complete graph K,, [RiYo68], [GrTu87]. 


R20: It follows from the Heawood formula that the chromatic number of surfaces 
increases with the genus. In contrast, by Facts F43 and F44 maps with large edge- 
width or face-width have chromatic number at most six or at most five, respectively. To 
prove stronger upper bounds on the chromatic number one needs additional conditions 
on the maps in question; see Facts F46, F47, and F48. 


R21: There is a quadratic-time algorithm to color a planar map with four colors 
[RSST97, p.27]. Since deciding the 3-colorability of a graph is NP-complete, it is 
widely thought to be unlikely that there exists a polynomial-time algorithm for de- 
ciding whether a map on an arbitrary surface can be 4-colored. 


PROBLEMS 


RP1: [A181] Let S be any surface. Does there exist a natural number q(5') such 
that any graph G on S contains a set A of at most q(S) vertices such that G — A is 
4-colorable? 


RP2: [GiTh97] Can the chromatic number of a triangle-free graph on a fixed surface 
be found in polynomial time? 


EXAMPLE 


E9: Figure 7.6.9(a) exhibits a triangular imbedding of Kg in the projective plane, and 
Figure 7.6.9(b) exhibits a triangular imbedding of K7 in the torus. This shows that 
x(Ni) > 6 and that ($1) > 7. In fact, y(Ni) = 6 and x(S1) = 7, in accordance with 
Facts F40 and F41. 


Figure 7.6.9: Triangular imbeddings of K6 into the projective plane and of K7 into the 
torus. 
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7.6.5 Combinatorial Schemes 


The definition of map in 87.6.1 as a cell complex is topological. A strictly combinato- 
rial description, although less intuitive, is often more convenient to apply. Four such 
schemes are described: rotation system, signed rotation system, crystallization, and 
graph encoding of a map. They have been discussed in [Ed60], [Ja68], [Co75], [Wa75], 
[Ri77], [Sta78], [JoSi78], [Tut79], [Sta80], [Li82] and [BrSi85]. 


DEFINITIONS 


D30: A graph is a triple G = (D;V,L), where D is a set of darts (or directed edges), 
V is a partition of D, the vertex-set of G, and LF is an involutory fixed-point-free 
permutation of D called the dart-reversing involution. The orbits of L are the edges 
of G and the incidence relation between vertices and edges of G is defined by non-empty 
intersection. 


D31: A rotation system is a pair (G; R), where G = (D;V,L) is a connected graph, 
and R is a permutation of D whose system of orbits coincides with V. The cycle of R 
permuting the darts of a vertex v € V is called the local rotation at v and is denoted 
R,. A rotation system (G;R) with G = (D;V,L) can also be represented as a triple 
(D; R, L) without making the graph G explicit. This scheme encodes any map with G 
imbedded on an orientable surface with a fixed orientation [Ed60, GrTu87, JoSi78]. 


D32: A signed rotation system is a triple (G;R,o), where (G; R) is a rotation 
system and o is a mapping that assigns to each edge of G an element of {1,—1}. 
This scheme encodes any map with G imbedded on any closed surface including the 
nonorientable surfaces [Ri77, Sta78]. 


D33: A switch of signed rotation system at a vertex v is an operation transform- 
ing a signed rotation system (G; R,o) into (G; R’,o’), where Ri, = R,' and Ri, = Ru 
for each vertex u € V — {v}; and o’(e) = —o(e) for every edge incident with v except 
for loops, and a‘(e) = o(e), otherwise. 


D34: The map determined by a rotation system (G; R) is a map on an orientable 
surface whose faces are bounded by the cycles of RL. Regarding each cycle of RL 
as the boundary of a polygonal 2-cell and gluing together 2-cells along paired darts 
x and Lz results in an orientable surface in which G is imbedded. Conversely, the 
rotation system of a map M on an orientable surface is a pair (G; R), where G is 
the underlying graph of M and R is the permutation induced by the chosen orientation 
of the supporting surface that cyclically permutes darts based at any vertex. 


D35: The map determined by a signed rotation system (G;R,c) is defined as 
follows. Let L and R be permutations of D x {1,—1} defined by R(a,i) = (R*(x), i) 
and L(a,i) = (La,io(x)). The boundary walks of the map are formed from the cycles 
of RL by ignoring the second coordinate of each pair (x,i) € D x {1,—1}. The cycles 
of RL occur in pairs giving the same face boundary twice, once in each direction. 
The supporting surface of the map is formed by taking a representative of each pair, 
spanning it with a 2-cell, and gluing the boundaries of faces along the corresponding 
darts. Conversely, a signed rotation system of a map M with underlying graph 
G is a triple (G; R,a), defined as follows. For each vertex v choose a local orientation 
around v. Let R, be the cyclic permutation of darts based at v induced by the chosen 
orientation, and set R = [[,, Ry. For each edge e define o(e) = 1 if and only if the local 
orientations at the end-vertices of e are consistent. 
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D36: The barycentric subdivision of a map M is a map M°*" whose vertices are 
the vertices of M, the centers of edges of M/, and the centers of faces of I. Two vertices 
of M°*" are joined by an edge on the supporting surface of M if the corresponding cells 
of M are incident. The barycentric subdivision of M is a triangulation; its triangles are 
called flags of M. Each edge of M touches exactly four flags. A vertex of valency k 
in M is incident with 2k flags and the center of a face of size m in M is incident with 
2m flags. Each vertex v of M*” is labeled with 0,1, or 2, according to the dimension 
of the cell in M that the vertex v represents. Every edge e of M*" is colored by the 
label missing at its end-vertices. 


D37: The graph encoding of a map M isa pair G = (H,¢), where H is the underlying 
graph of (M")*, the dual of the barycentric subdivision of M, and ¢ is a proper 3- 
edge-coloring of H with color i assigned to the dual edge e* if and only if the edge e of 
M**" is colored with i. If G = (H,¢) is a graph encoding of a map, then H is a cubic 
graph and each component of a subgraph H, formed by edges colored by 0 and 2 is a 
4-cycle [Tut63, Ga79, Li82, Vi83a, Tut84, BrSi85, BoLi95]. 


D38: The crystallization of a map M is a quadruple (F370, 71,72), where F is 
the set of flags of M (i.e., triangles of M°*") and, for i € {0,1,2}, 7; is a fixed-point- 
free involution of F' transposing two incident flags sharing an edge of M°®" colored i. 
The group (70,71, 72) is transitive on F’, (T2T 9)? = 1, and 7979 is fixed-point-free. This 
scheme encodes a map on any closed surface. 


D39: The map M defined by a crystallization (F;7,71,72) is constructed as 
follows. First construct a cubic 3-edge-colored graph H with vertex set F by joining 
vertices x and y with an edge colored i whenever y = 7;(). Form a surface S by gluing 
a 2-cell to every bicolored cycle of H, which defines an imbedding of H into S. The 
dual map of the imbedding of H is by definition the barycentric subdivision M°*” of the 
map M defined by the crystallization (F';7,71,72). To construct the map given by a 
graph encoding G = (H,¢) we first define the associated crystallization by identifying 
flags with the vertices of H and setting 7;(x) = y, for 7 € {0,1,2}, if and only if there 
is an edge zy in H colored by 7. Then we proceed as above. 


D40: The underlying graph G = (D;V,L) of the map defined by a crystallization 
(F; 7,71, 72) is defined by taking D to be the set of orbits of 72 with L sending a dart 
{x,T2(x)} to the dart {70(x), ToT2(x)}. Each vertex of G is formed by the union of darts 
constituting an orbit of (71,72) on F’. 


REMARKS 


R22: A rotation scheme (G; R) defines not just a map M on an orientable surface S 
but it also prescribes a global orientation of S$. The scheme (G; R~') determines the 
mirror image of the map M, denoted M~, which is topologically identical to M, 
up to orientation. Other schemes describe maps without regard to orientation. 


R23: Two rotation schemes (G;R,) and (G; R2) describe the same (isotopic) imbed- 
dings of G if and only if Ri = Re. Similarly, two crystallizations (F';70,71,72) and 
(F';7),7{,73) describe isotopic imbeddings of the underlying graph if and only if r/ = 7; 
for i € {0,1, 2}. 
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R24: Two signed rotation systems (G; R,o) and (G; R’,o’) describe the same imbed- 
ding of G if and only if there exists a sequence of switches that transforms (G; R,c) 
into (G; R’,a’). 


R25: The dual of a map defined by a rotation system (D; R,L) is a map determined 
by the system (D; RL, L). The dual of a map given by a crystallization (F; 70, 71, 72) 
is a map defined by the crystallization (F'72,71,70). If M is given as a graph encoded 
map G = (H,¢), the dual map M* is encoded by the same graph H with edge colors 0 
and 2 interchanged. 


R26: The Petrie dual of a map defined by a signed rotation system (G; R,c) is a map 
determined by the system (G; R, —c). The Petrie dual of a map given by a crystallization 
(F; 70,71, 72) is the map defined by the crystallization (F'; 79T2, 71, T2). 


R27: Generalizations of graph encoded maps to higher dimensions were introduced in 
a topological context by [Fe76], [Ga79], [Li82], [Vi83al]. 


R28: Lifting the restriction that (7972)? = 1 in the crystallization of a map, or that 
each component of H, in the graph encoding G = (H,¢) of the map must be a 4-cycle, 
results in the concept of a hypermap or hypergraph imbedding. 


R29: Properties of maps studied within the theory of maps are combinatorial or 
algebraic schemes. Therefore, a map M is usually identified with a scheme describing M. 


EXAMPLES 


E10: Figure 7.6.10 shows the tetrahedral map on the sphere. It has twelve darts 
{1,2,3,4,5,6} x {—1,1}; the dart-reversing involution L takes (x,i) > (x,—i). In the 
diagram, the dart (2,1) is encoded as x and is represented by a directed edge, based at 
its origin. The dart (z,—1) = x~! is represented with the same edge endowed with the 
opposite orientation. The rotation consistent with the counterclockwise orientation of 
the sphere is R = (1,2,3)(171,6,5~+)(271,5,4-1)(371, 4,6-+). 


ii 


Figure 7.6.10: Tetrahedral map. 


E11: Figure 7.6.11 describes an imbedding of Kk’, into the projective plane. The corre- 
sponding signed rotation system is (Ky; R,o), with R = (a, f-',d~')(c, f,b-+)(d, e, c+) 
(a~',b,e~) shown in the left part of the figure and the signature 0: E(K,4) > {—1,1} 
described in the right part of the figure. 
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Figure 7.6.11: Map on the projective plane with its signed rotation system. 


E12: The left part of Figure 7.6.12 represents a spherical map; the right part shows 
its graph encoding. 


Figure 7.6.12: Map and its graph encoding. 


FACTS 


F52: The supporting surface of a map defined by a signed rotation system (G; R,c) 
is nonorientable if and only if G contains a cycle with an odd number of negative 
edges. By Definition D35 the rotation system (D; R, L), where D = D x {1,-1}, 
R(a,i) = (Ri(x),i) and L(x, i) = (La, io(a)) determines the antipodal double cover of 
a nonorientable map given by the signed system (G; R,c). 


F53: The supporting surface of a map defined by a graph encoding G = (H,¢) is 
orientable if and only if the graph H is bipartite. The same criterion can be used to 
determine the orientability of a map defined by a crystallization. 


F54: If (F370,71,72) is a crystallization of a map on an orientable surface, then the 
same map (up to the choice of an orientation) is defined by the rotation system (D; R, L), 
where D is the partition set F'* in the bipartition of F, L = tot], and R = 772. 


F55: The antipodal double cover of a nonorientable map defined by a crystalliza- 
tion (F'; 7,71, 72) is, up to the choice of an orientation, the map with rotation system 
(F; R,L), where L = 7972 and R = 72. 


F56: The Euler characteristic of the supporting surface of a map given by a crystal- 
lization (F'; 7,71, 72) is X = Ao — Q1 + G2, where ao, a1, and az is the number of orbits 
of the groups (71,72), (7,72), and (70, 71), respectively. 
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7.6.6 Maps and Universal Tessellations 


Every map M has a universal cover by a uniform tiling of the sphere, Euclidean plane, 
or the hyperbolic plane. This fact and its consequences are the subject of this section. 
Representation of a map as a quotient of its universal cover is closely related to groups 
acting as homeomorphisms on surfaces, which we also discuss in this section. Deep 
connections between maps, universal tessellations, Galois groups, and Grothendieck’s 
dessins d’enfants are discussed in [JoSi87, JoSi96, LaZv04]. The tessellations are re- 
garded as infinite maps, even though, by our definition, a map is a finite cell complex. 


DEFINITIONS 


D41: The monodromy group of an oriented map M with rotation system 
(D;R,L) is the subgroup Mont(M) = (R,L) of the symmetric group Sym(D) of 
all permutations of D. 


D42: A homomorphism between oriented maps M, = (D;; Ri, 11) and Mz = 
(D2; Ro, Lz) is a mapping f: M; > M2 taking D; + Dz such that fR, = R2f and 
fly = Lof. 


D43: An automorphism of an oriented map M = (D;R,L) is a permutation f 
of D such that fR = Rf and fL = Lf. The automorphisms form a group Autt(M) 
under composition, called the automorphism group of an oriented map. 


D44: The monodromy group of a map M with crystallization (F'; 70,71, 72) is the 
subgroup Mon(M) = (70,71,72) of the symmetric group Sym(F) of all permutations 
of F. 


D45: A homomorphism f: M, > Mp between maps M = (Fi;70,71,72) and 
N = (F; po, p1, p2) is a mapping F, > F2 such that f7; = p;f for each i € {0, 1, 2}. 


D46: An automorphism of a map M = (F;70,71,72) is a permutation f of the set 
F of flags such that 7;f = f7; for each i € {0,1,2}. The automorphisms form a group 
Aut(M) under composition, called the automorphism group of a map. 


D47: The tessellation {p,q} is the unique tessellation of the sphere or plane into 
regular p-gons, g incident at each vertex. This is a tiling of the sphere if 1/p+1/q > 1/2, 
of the Euclidean plane if 1/p+1/q = 1/2, or of the hyperbolic plane if 1/p+1/q < 1/2. 


D48: The triangle group A*(p,q,2) is the group with presentation 
ay | Sar > ah). (7.6.5) 
The group At = A*(co, 00, 2) is the universal triangle group. 


D49: The extended triangle group A(p, q,2) is the group with presentation by three 
generators Xo, £1, £2 and the relations 


(x9, 21,22 | 24 = 27 = «2 = (aor1)” = (2122)! = (4220)? = 1). (7.6.6) 
The group A = A(ov, 00, 2) is the universal extended triangle group. 


D50: A Belyi function f: S — So is a meromorphic function from a (closed) Rie- 
mann surface S onto the Riemann sphere So with at most three singular values forming 
a subset of {0, 1, co}. 


D51: A Shabat polynomial is a complex polynomial with at most two critical values. 
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D52: A dessin d’enfant, or briefly dessin, is a bipartite map with a fixed bipartition 
(coloring) of its vertices. 


D53: The trivial dessin T is a map on the Riemann sphere whose vertices are 0 and 
1 and the unique edge is formed by the unit interval [0,1]. By definition, 0 is colored 
black and 1 white. The point oo is the center of the single face of Z. 


D54: The universal dessin is a dessin J on the extended complex upper half plane 
U ={z€C| Im(z) > 0} UQU {oo} whose vertex-set consists of all rational numbers, 
with a/b is joined to c/d by an edge drawn as a hyperbolic geodesic (half-circle) if and 
only if ad — bc = +1. 


EXAMPLES 


E13: Both maps M and M* on the torus shown in Figure 7.6.2 are coverings of the 
tetrahedral map in Figure 7.6.7. The covering by M is branched at face centers and 
the covering by M™* is branched at vertices. Both are 2-fold coverings, that is, each 
non-singular point of the sphere is covered by two points of the torus. 


E14: Figure 7.6.13 shows all the (hyperbolic) mirrors of reflection symmetries of the 
tessellation {6,4} (or {4,6}). These lines form a Coxeter complex, a subdivision of 
the hyperbolic plane into triangles — flags of the infinite map. 


Figure 7.6.13: Reflection symmetries of the hyperbolic tessellation {6, 4}. 


E15: Figure 7.6.14 shows three bicolored plane trees whose Belyi functions are Shabat 
polynomials of the form P(2) = «3(x — 1)?(a2 — a), where a is a root of the equation 
25a® — 12a? — 24a — 16 = 0. The trees form an orbit under the action of the universal 
Galois group ['(Q/Q). For more details see [LaZv04, Subsection 2.2.2.3]. 
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Figure 7.6.14: Conjugate trees over a cubic field. 
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REMARKS 


R30: It is usual to identify the automorphism group of a map M with the automor- 
phism group H of the corresponding crystallization, as done in Definition D46. To be 
more precise, the topological automorphism group [ contains a normal infinite subgroup 
N <iT fixing each flag of M, for which H = T/N. Another equivalent definition of H 
for maps with valency of each vertex at least 3 is as follows: H is isomorphic to the 
group of all graph automorphisms that extend to map automorphisms. In what follows 
the group H will be denoted Aut(M). 


R31: Given a map on an orientable surface, we identify its group of orientation preserv- 
ing automorphisms [t+ with the automorphism group H* of the corresponding rotation 
system. Equivalently, H+ consists of the automorphisms of the underlying graph which 
extend to orientation preserving map automorphisms. In what follows the group H* 
will be denoted Aut*(M). 


R32: The subgroup (7021, 7122) < A(p,q, 2) of index two is isomorphic to A*(p, q, 2) 
for all parameters p and q. 


R33: The supporting surface of a homomorphic image of an oriented map is a closed 
orientable surface. Let f: S, — S);, be an orientation-preserving covering between 
orientable surfaces. If M is a (topological) map on S,, then f(J/) is a map without 
semiedges if and only if f(x) 4 f(Lx) for every dart of M. It is sometimes convenient 
to consider maps whose underlying graphs have semiedges; see [MaNeSk02, NeSk97]. In 
terms of rotation systems this means allowing the dart-reversing involution L to have 
fixed points. 


R34: The supporting surface of a homomorphic image of a map may have a non-empty 
boundary. For instance, a quotient of a cycle C' imbedded in the sphere by the group 
of order two generated by a reflection fixing the cycle point-wise is an imbedding of C 
into the disk. It is sometimes useful to have the set of maps closed under homomorphic 
images. In terms of crystallizations this means allowing the involutions 79, 7, T2, and 
ToT2 to have fixed points; for more details see [BrSi85]. 


FACTS 


F57: By connectivity, the monodromy group of a map is transitive on the set of flags. 
Similarly, the oriented monodromy group is transitive on the set of darts of an orientable 
map. 


F58: By Fact F57, every homomorphism between maps, or between oriented maps, is 
surjective. 


F59: The automorphism group Aut(M) of a map M is the centralizer of the mon- 
odromy group Mon(M) of M. Similarly, the group of orientation preserving automor- 
phisms Aut*(M) of an oriented map M is the centralizer of the oriented monodromy 
group Mont (M) of M. 


F60: By Fact F59, the group Aut(M) acts freely on the flags of M, and if M is 
orientable, then Aut*(M) acts freely on the darts of M. It follows, in turn, that 
|Aut(M)| < |F'| = 4e, where e is the number of edges of M. Similarly, if M is an 
oriented map, then | Aut*(M)| < |D| = 2e. 
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F61: The automorphism group of the tessellation {p,q} is isomorphic to the extended 
triangle group A(p,q,2). The generators xo, 21, and x (and their conjugates) cor- 
respond to reflections in the three sides of a flag, as described in Example E14; the 
products #122, %oX2, and xox, (and their conjugates) correspond to rotations about 
vertices, midpoints of edges, and face centers, respectively. 


F62: Every map M has a covering by a tessellation {p,q} for some p and gq. In other 
words, every map M is a quotient of a tessellation {p,q} by a subgroup Hy, of the 
group A(p,q,2). The subgroup Hy is determined by M up to conjugation. 


F63: [Vi83a] The automorphism group Aut(M) of any map M is isomorphic to the 
quotient Na(Hy)/Hm, where Na denotes the normalizer and where Hy, is a subgroup 
of A of index 4e from Fact F62. A similar statement holds for Aut*(M) and At. 


F64: Conjugacy classes of torsion-free subgroups of finite index 4e of A are in a one-to- 
one correspondence with the isomorphism classes of maps with e edges. Furthermore, 
conjugacy classes of torsion-free subgroups of finite index 4e of A(p, q,2) are in a one- 
to-one correspondence with the isomorphism classes of maps of type {p, q} with e edges 
[JoSi78, BrSi85]. 
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Figure 7.6.15: The regular self-dual map {5,5}3 and its universal cover {5,5}. 


F65: Every map M of type {p,q} admits a smooth covering by the tessellation {p, q}. 
For example, the map on Ns of type {5,5} in Figure 7.6.15 is covered by the tessellation 
{5,5} of the hyperbolic plane. (The map is obtained by identifying like labeled edges 
in the figure.) 


F66: [Bi72] The automorphism group of an orientable map of genus g > 1 can be 
faithfully represented in the group of 2g x 2g symplectic matrices with integral entries. 
From this fact it can be proved, for example, that if a prime p divides |Aut(M)|, then 
the genus of the map M is either 1, 2, or at least (p— 1)/2. 


F67: [Hul892] Hurwitz bound: If a finite group IT acts on a surface S of Euler 
characteristic x(S) < 0 as a group of its automorphisms, then |[]| < —84y(S). In 
contrast, surfaces of Euler characteristic y > 0 admit actions of groups of arbitrarily 
large order. The bound is satisfied with equality if and only if I is the automorphism 
group of a regular map of type {3,7} or {7,3} (see §7.6.7 for the definition of a regular 
map). This is a direct consequence of the Hurwitz bound. 
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F68: The Hurwitz bound is achieved for infinitely many surfaces; on the other hand, 
there are infinitely many surfaces where the bound is not achieved [Co90, Col0]; see 
also Fact F87 and Remark R40. 


F69: The automorphism group of the universal dessin is the free group of rank two. 
F70: Every dessin covers the trivial dessin and is covered by the universal dessin. 


F71: If M is a map on an orientable surface, then there is an associated dessin D(M) 
whose black vertices are the vertices of M and white vertices are the centers of edges 
of M. 


F72: For every map M on an orientable surface S$ it is possible to introduce the 


structure of a Riemann surface on § such that the map coverings T 4 D(M) Bots 5 
are meromorphic functions with singular values only at vertices and face centers; in 
particular, 6 is a Belyi function [LaZv04]. 


F73: A dessin is tree-like if and only if its Belyi function is a Shabat polynomial 
[LaZv04, Theorem 2.2.9.]. 


REMARK 


R35: Fact F72 implies that the supporting surface of any map M can be assumed 
to carry the structure of a Riemann surface such that Aut(M) acts as a group of 
conformal homeomorphisms [JoSi87, LaZv04]. The edges of Gy are geodesics of equal 
length with respect to a Riemannian metric of constant curvature (defined everywhere 
except perhaps at finitely many singular points located at vertices and face centers) and 
the angles formed by successive edges incident with a vertex are equal. 


7.6.7 Highly Symmetrical Maps 


Regular maps, those enjoying the greatest symmetry, are analogues of the Platonic 
solids on surfaces of higher genera. Cayley maps and vertex-transitive maps are less 
restrictive, but still very interesting classes of maps. Both regular and Cayley maps can 
be considered as visualizations of discrete actions of groups on surfaces, and therefore 
naturally appear in the investigation of symmetries of surfaces and related objects. 


DEFINITIONS 


D55: An oriented map M with rotation system (D;R,L) is ortentably regular if 
Autt (M) acts transitively on D. 


D56: An oriented map M = (D;R,L) isomorphic with its mirror image M7! = 
(D; R~',L) is called a reflexible map; otherwise M is a chiral map. 


D57: A map M with crystallization (F;70,7,72) is a regular map if Aut(M) acts 
transitively on the set of flags F’. 
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D58: Let I be a finite group with generating set X such that 1 ¢ X and X = X~}, 
and let p be a cyclic permutation of X. A Cayley map CM(T,X,p) for a group 
I is an oriented map (D;R,L) with D=T x X, and with R(g,x) = (g,p(x)) and 
L(g, x) = (gx,x~*) for every dart (g,x) € D. 


D59: The fiber transformation group FT(f) of a map homomorphism (covering) 
f: M — N is the group formed by all map automorphisms g € Aut(M) such that 
fg =f. A homomorphism (covering) f: M— N is a regular covering if FT(f) acts 
transitively on the fiber f~'(y) for some flag y of N. 


D60: A skew-morphism f of a group H is a bijection H > H satisfying the following 
conditions: 

1. fQ) =1, 

2. flay) = f(x) f™™(y), 
where 7: H > Z is an integer-valued function and f™*) is the m(2)-th power of f in 
the symmetric group Sym(H). 


EXAMPLES 


E16: Every orientably regular map on the sphere is regular. The spherical regular 
maps are precisely the 2-skeletons of the five Platonic solids (see Figure 7.6.7), which 
coincide with the tessellations {3,3}, {3,4}, {4,3}, {3,5}, {5,3}, and two infinite fam- 
ilies of non-polyhedral maps {p, 2} and {2,p}, with p > 0, formed by imbedded cycles 
and their duals. 


E17: Since every map on the projective plane has a smooth 2-fold covering by a map 
on the sphere (Remark R8), it follows from Example E16 that there are four regular 
maps on the projective plane of types {3, 4}, {4, 3}, {3, 5}, {5,3} and infinite families of 
types {p, 2} and {2,p} with p > 0. 


E18: Regular and orientably regular maps on the torus are finite quotients of the Eu- 
clidean tessellations {4,4}, {6,3} and {3,6}. Their classification appears in [CoMo84]. 
There are infinitely many maps in each of these types. These include the maps {3, 6}4 
and {6,3}, depicted in Figure 7.6.2 and defined in Fact F88. 


E19: The Kepler—Poinsot regular star polyhedra, shown in Figure 7.6.16, are self- 
intersecting realizations of regular maps. These maps are {5,5|3} (12 pentagons on a 
surface of genus 4 — great dodecahedron and small stellated dodecahedron), {5,3}10 
(12 pentagons on the torus — great stellated dodecahedron), and {3,5}19 (20 triangles 
on the torus — great icosahedron). 


Figure 7.6.16: Star polyhedra. 
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E20: [ScWi85, ScWi86] From the history of automorphic functions come two regular 
maps of genus 3, the 1879 Klein map {7,3}s composed of 24 heptagons with auto- 
morphism group PGL(2,7), and the 1880 Dyck map {8,3}¢ composed of 12 octagons 
(shown in dual form in Figure 7.6.17). The Coxeter regular skew polyhedra in 
E* also provide examples of regular maps; they are {4,6|3}, {6,4|3}, {4,8]3}, and 
{8,4|3}. The Klein, Dyck, and Coxeter maps all have realizations in E°. 


Figure 7.6.17: The dual of Dyck’s map {8, 3}¢. 


FACTS 


F74: For an orientably regular map M with e edges |Aut*+(M)| < |2e| < |Mon*t(M)|. 
A map M is orientably regular if and only if |Autt(M)| = |2e| = |Mont(M)|. In an 
orientably regular map M, both the action of Autt(M) and the action of Mont(M) 
are regular. 


F75: For a map M with e edges, |Aut(M)| < |4e| < |Mon(M)| and M is regular if 
and only if |Aut(M)| = |4e| = |Mon(M)|. In particular, in a regular map M both the 
action of Aut(M) and the action of Mon(M) are regular on the flags of M. 


F76: An orientably regular map is regular if and only if it is reflexible. 


F77: [Vi83b, Wi78] A nonorientable map is regular if and only if its antipodal double 
cover is regular. 


F78: Regular maps with e edges are in a one-to-one correspondence with normal 
torsion-free subgroups of A of finite index 4e. Regular maps of type {p,q} with e edges 
are in a one-to-one correspondence with normal torsion-free subgroups of A(p, q,2) of 
finite index 4e. 


F79: Orientably regular maps with e edges are in a one-to-one correspondence with 
normal torsion-free subgroups of A* of finite index 2e. Orientably regular maps of 
type {p,q} with e edges are in a one-to-one correspondence with normal torsion-free 
subgroups of A*(p, q, 2) of finite index 2e. 
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F80: If M is a regular map, then Aut(M) ~ A/N for some normal subgroup N <A. 
If M is a regular map of type {p,q}, then Aut(M) ~ A(p,q,2)/N for some normal 
subgroup N < A(p,q,2). Similar statements hold for the class of orientably regular 
maps and subgroups of At and At (p,q, 2). 


F81: By Fact F80, the automorphism group of every regular map can be generated by 
three involutions, two of which commute. The automorphism group of every orientably 
regular map can be generated by two elements, one of which is an involution. 


F82: On each orientable surface S, there is a regular map. It is provided, e.g., by 
imbedding the bouquet of g circles into S, with a single face. However, if the underlying 
graphs are required to be simple, or maps to be chiral, then there are infinitely many 
gaps in the genus spectra in both cases [CoSiTu10]. 


F83: Not every nonorientable surface supports a regular map; for example, there are 
no regular maps on the nonorientable surfaces of genus 2 and 3. There are infinitely 
many nonorientable surfaces supporting no regular map [BrNeSi05, CoPoSi10]. 


F84: For any surface with Euler characteristic x < 0, there are at most finitely many 
regular maps. This follows from the Hurwitz formula stated in Fact F67 and from 
Fact F60. 


F85: For surfaces with Euler characteristic x > —600 all regular and orientably regular 
maps have been classified [Col2a, Col2b, CoDo01]. 


F86: A classification of regular maps on surfaces with Euler characteristic (5) = —p, 
—p*, and —3p, and of orientably regular maps with Euler characteristic —2p, where p 
is a prime, was accomplished in [BrNeSi05, CoPoSil0, CoSiTu10, CoNeSi12). 


F87: There are infinitely many regular maps of type {p,q} for each pair (p,q) such 
that 1/p + 1/q < 1/2. In fact, they may be chosen to have arbitrarily large face-width 
[Vi83b, NeSkO1]. 


F88: Two special cases have received particular attention, the regular maps {p, q},. 
where the single relation (797172)" has been added and the regular maps {p, q| m} where 
the single relation (7)717271)™ has been added to the presentation of A(p, q, 2). Coxeter 
and Moser [CoMo84] have provided partial tables of parameters p,q,r and p,q,m for 
which a finite regular map with those parameters exists. Figure 7.6.15 shows the regular 
map {5,5}3. It is now known exactly which of the maps {p,q}, are finite and which are 
infinite, except for one case, (p,q,r) = (3,7, 19); see [EdJu08, HaHol0]. The question 
of which maps {p, q|m} are finite was fully answered in [EdTh97]. 


F89: A connected graph is the underlying graph G of some orientably regular map if 
and only if Aut(G) contains a subgroup acting regularly on the dart-set such that the 
stabilizer of each vertex is cyclic [GaNeSiSk99]. 


F90: If a connected graph G underlies a regular map, then the map automorphism 
group Aut(G) contains a subgroup T° acting transitively on the dart-set of G such that 
the vertex-stabilizer is dihedral and the edge-stabilizer is isomorphic to the Klein four- 
group Z2 x Zz. The existence of a subgroup [ < Aut(G) satisfying the above conditions 
is sufficient for G to be the underlying graph of some regular map [GaNeSiSk99]. 
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F91: The complete graph K, underlies an orientably regular map if and only if n is a 
prime power. It underlies a regular map if and only if n = 2, 3, 4, or 6 [Ja83], [Wi89]. For 
every n = p* there are exactly ¢(p* —1)/k non-isomorphic orientably regular imbeddings 
of K,, where ¢ is the Euler function [Bi71, JaJo85]. 


F92: Regular and orientably regular imbeddings were classified for complete bipartite 
graphs [Jo10], n-dimensional cubes [CaCoDuKwNeWi11], and for some other families of 
graphs. In particular, for each n there exists at least one (orientably) regular imbedding 
of K,,, and @,, and for n = p a prime number, there exists exactly one regular 
imbedding of Kp,p. 


F93: Every Cayley map of a group [ is vertex-transitive, with [ acting as a group of 
map automorphisms by left multiplication. Numerous other useful results concerning 
Cayley maps can be found in [RiSiJaTuWa05]. 


F94: [Tu83] Ifa group T acts on an orientable surface S as a group of self-homeomorph- 
isms, then some Cayley graph G of TI’ imbeds in S,, and the natural action of T on G (by 
left multiplication) extends to an action of T on S. In other words, there is a Cayley 
map CM(T, X,p) with the supporting surface S. 


F95: Biggs [BiWh79, Theorem 5.3.71] proved that if a group T has an automorphism 
w whose restriction on a set X of generators of T is p, then the Cayley map CM(T, X, p) 
is orientably regular. In such a case the identity p(a~') = (p(x))~' holds. Following 
[SkSi92], a Cayley map satisfying this condition is called a balanced Cayley map. 
Skoviera and Sirai [SkSi92] proved that a balanced Cayley map CM (I, X, p) is regular 
if and only if p extends to a group automorphism of I. 


F96: Jajcay and Sirdi [JaSi02] proved that a Cayley map CM(I, X,p) is orientably 
regular if and only if there is a skew-morphism w of whose restriction to X is p. 


F97: Conder and Tucker [CoTu12] classified all orientably regular Cayley maps arising 
from cyclic groups. 


F98: [Th91, Bab91] For each g > 3, there exist only finitely many simple vertex- 
transitive graphs of orientable genus g, while there are infinitely many of genus 0, 1, 
and 2. 


F99: The double torus S2 has the interesting property that there are only finitely many 
finite groups that act on 5, as self-homeomorphisms groups, but there are infinitely 
many vertex-transitive graphs (in fact, Cayley graphs) with genus 2. 


EXAMPLES 


E21: Figure 7.6.18 represents a chiral orientably regular imbedding of the complete 
graph Ks into the torus. This example belongs to a family of balanced regular Cayley 
maps whose underlying graph is the complete graph K,, where q is a prime power. Let 
F = F(q) be the additive group of the Galois field of order g, and let F* denote the 
multiplicative group of the field (which is cyclic). Let 4 be a primitive element of F, 
and let p be the cyclic permutation of F* defined by the multiplication by pw. Then 
CM (F, F*,p) is a balanced orientably regular Cayley map with underlying graph the 
complete graph K,. 
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Figure 7.6.18: The complete graph Ks imbedded as a chiral regular Cayley map of Zs 
on the torus. 


E22: Coxeter and others noticed that (orientably) regular maps frequently occur as 
coverings of smaller (orientably) regular maps on other surfaces. For example, the 
regular toroidal maps {3,6}4 and {6,3}, in Figure 7.6.2 are 2-fold coverings of the 
tetrahedral map {3,3} on the sphere. Constructions of families of (orientably) regular 
maps using coverings appear in [JoSu00], [Si01], [Vi84], [MaNeSk02], [NeSk97a], and 
[Wi78], among others. 


E23: There are various group theoretical constructions of regular maps. In fact, every 
finite 2-generator group [ = (x,y), where y? = 1, gives rise to an orientably regular 
map with Autt(M) =T. Such a map can be described by the rotation system (D; R, L) 
where D=T, R(g) = xg, and L(g) = yg for each dart g € T (compare with Facts F79 
and F80). 


E24: For every finite group T = (xo, 21,22 | 22 = x? = x3 = (xoxo)? = 1,...) there 
is a regular map M with Aut(M) =T-. It is defined by the crystallization (T; 70, 71, T2) 
where 7;(g) = x;g for each flag g € T (compare with Facts F78 and F80). 


E25: The vertex-transitive maps on the sphere with simple underlying graphs, clas- 
sified by [Fllm79], consist of the regular spherical maps and the boundary complexes 
of the Archimedean solids (semi-regular polyhedra), of the prisms, and the antiprisms. 
[Bab91] gave a classification of the vertex-transitive maps on the Klein bottle with 
simple underlying graphs. 


REMARKS 


R36: The automorphism group of a regular or orientably regular map acts transitively 
on the set of vertices, on the set of edges, and on the set of faces. 


R37: Ifa map M is given in terms of a graph encoding G = (H,¢), then the flags of 
M are in a bijective correspondence with the vertices of H. Therefore M is regular if 
and only if H is vertex-transitive. In this case, H is a cubic Cayley graph of Aut(M) 
with generator set {1,71, 72} where each 7; is an involution corresponding to the perfect 
matching of H consisting of the edges with color 2. 


R38: In 1994 Malle et al. [MaSaWe94] proved that every nonabelian finite simple 
group can be generated by two elements, one of which has order two. A stronger result 
due to Stein [St98] implies that the element of order two can be chosen arbitrarily. From 
Example E23 it follows that every nonabelian finite simple group is the automorphism 
group of some orientably regular map. For some classes of finite simple groups, all 
orientably regular maps with automorphism group in the class have been classified, for 
example, for the projective linear groups PSL(2,q) [Sah69], for Ree groups [Jo94], or 
for the Suzuki groups [JoSi93]. 
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R39: There are infinitely many nonabelian finite simple groups that do not occur as 
automorphism groups of regular maps. These are Ag, A7, Ag, $4(3) = U4(2), Mis, 
M22, Mo3, McL, further PSL(3,q) and U3(q) for all prime powers gq, and PS'L(4,q) 
and U,(q) for even prime powers q. 


R40: Orientably regular maps of type {3,7} and {7,3} are known as Hurwitz maps 
because of their relationship to the Hurwitz bound (Facts F67 and F68); their automor- 
phism groups are called Hurwitz groups. Many finite simple groups are known to be 
Hurwitz groups while others are known not to be Hurwitz groups. For example, every 
sufficiently large alternating group is a Hurwitz group. Of the 26 sporadic groups 12 
are Hurwitz groups; for details, see [Co90, Co10]. 


R41: Every group automorphism of H is a skew-morphism with constant power func- 
tion m(a) = 1 for every x € H. Classifying all skew-morphisms of a given group is a 
difficult problem in general, and is open even for cyclic groups. 


R42: Let M be a Cayley orientably regular map CM(T, X,p). Then Autt(M) is 
isomorphic to a product TZ, of T with the cyclic group Z, of order k where k = |X| is 
the valency of M. Every generator of Z, gives rise to a skew-morphism of [’. For more 
information on skew-morphisms of groups see [CoJaTu07] and [KoNe11]. 


7.6.8 Enumeration of Maps 


W. T. Tutte [Tut63] pioneered map enumeration in the 1960’s. He developed a ma- 
chinery for deriving generating functions for several classes of rooted spherical planar 
maps, and as an application, derived a closed formula for the number of rooted spherical 
maps with given number of edges. At present, there are hundreds of results about map 
enumeration. In the following text we highlight results about enumeration of maps on 
a fixed surface by the number of edges, thus extending the classical result of Tutte for 
the sphere. These results can also be viewed as results about enumeration of subgroups 
of a given index in the universal triangle groups At and A, which shows their relation 
to the classical results of M. Hall [Ha49] and others who enumerated subgroups of a 
given index in free groups. Connections between map enumeration, matrix integrals, 
and 2-dimensional quantum gravity are explained in [Zv97, BoFrGu02). 


DEFINITIONS 


D61: <A rooted map is a map in which a flag has been distinguished. A rooted 
oriented map is an oriented map in which a dart has been distinguished. There is 
a one-to-one correspondence between rooted maps on orientable surfaces and rooted 
oriented maps; in what follows we therefore use the term rooted maps. 


EXAMPLES 


E26: For the sphere, the 2-connected rooted maps with four edges are shown in the 
first row of Figure 7.6.19. The first four of these comprise all 2-connected rooted maps 
with three vertices and three faces. The roots are in boldface. 


E27: In the second row of Figure 7.6.19 there are the rooted near triangulations with 
four inner faces and a root face with two edges. 
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Figure 7.6.19: Counting maps on the sphere. 


FACTS 


F100: [Tut63] The number of rooted maps on the sphere with n > 0 edges is 


2.3"(2n)! 
mo(n) = Cea 


F101: [Tut63] The number of 2-connected rooted maps on the sphere with n > 1 edges 
is 

2(3n — 3)! 

ni(2n — 1)!" 


F102: [N. Wormald] (see [GoJa83]) The number of 2-edge-connected rooted maps on 
the sphere with n > 0 edges is 


2(4n + 1)! 
(n+ 1)\(3n + 2)! 


F103: [BrTu64] The number of 2-connected rooted maps on the sphere with n > 1 
vertices and k > 2 faces is 


(Qn+k—5)!\(2k+n-—5)! 
(n — 1)'(k — 1)!(2n — 3)'(2k — 3) 


F104: [Tut63] A generating function for the number of 3-connected rooted planar maps 
with n > 3 edges was derived by Tutte. 


F105: A generating function for the number of rooted maps on a surface of genus 
g > 1 and with a given number of edges is known up to the coefficients of a polynomial 
of degree at most 6g — 6 [ArGi99, WaGil3]. In particular, the formula is known for 
g < 11; see [Ar87] for g = 1, [BeCa91] for g = 2 and 3, [MeGil1] for g = 4, [WaGil13] 
for g = 5 and 6, and [WaGiMel2] for 7 < g < 11. 
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F106: A generating function that counts rooted maps of genus g by the number v 
of vertices and by the number f of faces is the following formal power series in two 
variables u and w: 


M,(w,u) = °a mg(v, fyw’ul. 


v,f21 


In [Ar85] the function Mo(w, u) is determined by the following system of equations: 
Mo(w,u) = pq(1 — 2p — 2a), 


w = p(1 — p— 29), 
u = q(1— 2p — q). 


F107: [Li81, Li85) The number of isomorphism classes of oriented spherical maps with 
n edges is given by 


Lo(n) = = ma(n) + ote/a)(“F ” mote) (AS +2(7] n) mo ([5-1]). 


d|n 
d<n 


where mo(n) is the number of rooted spherical maps and ¢(d) is the Euler function. 


F108: A formula for the number of isomorphism classes of oriented maps of genus g 
with a given number of edges is derived in [MeNe06]. The formula is applicable provided 
that the numbers m,(n) of rooted maps with n edges of genus y < g are known. 


F109: Rooted maps and isomorphism classes of maps and of oriented maps with a 
given number of edges regardless of genus are enumerated in [BrMeNe10]. 


F110: In [BeCa86] it is proved that m,(n) ~ tgn°9—-))/212", where ty is a constant 
computable via non-linear recursions. The first two values are to = we and t; = oe In 
[Ga93] an asymptotic pattern for the number of rooted maps with n edges is determined 
for numerous classes of maps, extending the previously known results. 


F111: In [RiWo95] it is proved that between n-edged maps on a fixed surface those 
admitting a non-trivial symmetry form an exponentially small part. It follows that 
almost all maps on a fixed surface are rigid. 


F112: In [DrNe11] formulas for maps regardless of genus are analyzed. In particular, it 
is proved that log A(n) ~ § log U(n) ~ (n/2) logn, where A(n) and U(n) is, respectively, 
the number of reflexible and the number of all maps with n edges. It follows that for 
sufficiently large n, the number of reflexible maps is about the square root of the number 
all n-edged maps. 


REMARKS 


R43: By Whitney’s theorem (see Fact F8) enumeration of planar 3-connected graphs 
and that of planar 3-connected maps are equivalent problems. 


R44: The numbers of rooted oriented maps, and of rooted maps, with n edges are the 
same as the numbers of torsion-free subgroups of the group universal triangle group At 
of index 2n, and of the universal extended triangle group A of index 4n, respectively. 
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R45: The numbers of isomorphisms classes of rooted oriented maps, and of maps, with 
n edges are the same as the numbers of conjugacy classes of torsion-free subgroups of 
the group At of index 2n, and of the group A of index 4n, respectively. 


R46: Similar statements as in Remarks 44 and 45 hold when the genus of the underlying 
surface is fixed. 


R47: Rooted maps were also enumerated with respect to the number of vertices and 
faces; see for instance [WaGil3, WaGiMel12]. In particular, the generating function for 
maps by number of edges is M,(z,z), where M,(w,u) is the generating function for 
maps by number of vertices and faces from Fact F106. 


R48: The “Quotient Map Method” elaborated for deriving the formula for the number 
of isomorphism classes of oriented maps (see Fact F107) was applied to deriving similar 
formulae counting nonisomorphic planar maps of several other natural classes including 
2-connected maps [LiWa83], Eulerian maps, and loopless maps [LiWa04]. For more 
information see [Li04] and references included therein. 


R49: With every map one can associate a triangulation, or a quadrangulation, for 
example by employing the barycentric subdivision, or the dual of the medial. Enumer- 
ation of triangulations and quadrangulations is therefore of particular interest; for a 
sample of results, see, for instance, [GaWo02, ViPel10]. 


R50: Asymptotic behavior of maps on a fixed surface was investigated in [BeCa86, 
BeCaRi93, Ga93, RiWo95, Li99]. 


PROBLEM 


RP3: Derive a generating function for the number of isomorphism classes of spheri- 
cal maps with a given number of edges (a map and its mirror image are regarded as 
isomorphic). 
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INTRODUCTION 


Consider the graph C99 < C99, imbedded on the torus as a grid so that all faces are 
quadrilaterals. For any vertex v, the graph induced by all vertices of distance at most 
49 from v is imbedded exactly as if it were in the plane. In other words, it is locally 
planar, so it may share some properties of planar graphs. Representativity measures 
the extent of local planarity of an imbedded graph. Alternatively, an imbedded graph 
with large representativity may reveal properties of its surface. 

The theory of representativity was first introduced by Robertson and Seymour 
[RoSe88] in their work on graph minors, although hints occur in earlier works. See 
[GrTu87] and [MoTh01] for some of the proofs and for general reference in topological 
graph theory and in representativity. 


7.7.1. Basic Concepts 


There are several different, but related, ways to measure the local planarity of a graph 
imbedded on a non-spherical surface. We present these measures and their relations. 


DEFINITIONS 


D1: An open face (sometimes “open” is omitted) of a graph imbedding G > S isa 
connected component of S — G. The set of faces is denoted F(G — S), or sometimes, 
simply F. 
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D2: The union of a face f and its boundary walk is denoted f and is called a closed 
face. 


D3: A cellular imbedding is one where each face is homeomorphic to 2-dimensional 
Euclidean space. 


D4: Ina circular imbedding, the boundary of every face is a simple cycle. 


D5: An imbedding is a polyhedral imbedding if it is circular and the intersection 
of any two closed faces is a path. (Equivalently, we observe that each vertex has a 
wheel-neighborhood — with a possibly subdivided rim.) 


D6: A cycle C on a surface S is contractible if it separates the surface, and if one 
side of the separation is a disk. 


D7: If S is not a sphere, then the interior of a contractible cycle C, which is 
denoted int(C), is the side that is a disk. 


Ds: A k-nest of disjoint contractible cycles is a sequence C1,...,Cx such that C; C 
int(Ci41). 


D9: The edge-width ew(G), of a graph imbedding G > S is the (graph-theoretic) 
length of the shortest cycle in the graph that is non-contractible on the surface. 


D10: The face-width fw(G) of a graph imbedding G > S (alternatively the rep- 
resentativity, denoted p(G)) is the minimum cardinality |C 7 G|, taken over all non- 
contractible cycles C in S. That is, it is the smallest number k such that there exists 
faces f,,..., fg with a non-contractible cycle C contained in f,U---U f,. (The cycle C 
may intersect vertices, not just edges.) Informally, width is a synonym. 


D11: An imbedded graph is a dense imbedding on the surface is it has large face- 
width, with “large” to be interpreted in context. 


D12: The medial graph M(G) of a graph imbedding G > S is the imbedded graph 
whose vertex set is E(G), and whose edge set joins each pair of vertices representing 
consecutive edges in a face boundary of G > S. 


D13: The radial graph R(G) is the imbedded dual of the medial graph. Equivalently, 
the radial graph has vertex set V(G)U F(G > S), with edges joining incident elements. 
The radial graph is also called the vertex-face graph, or the vertex-face incidence 
graph. 


D14: The Euler genus ¥(S) of a surface S is twice the number of handles if the 
surface is orientable, and is the number of crosscaps if the surface is non-orientable. 
(This definition reflects the old maxim “a handle is worth two crosscaps”.) 
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EXAMPLES 


E1: The graph in Figure 7.7.1 is drawn in the torus, where the top edge is identified 
with the bottom edge and the left with the right. It has edge-width 4, as shown by 
the cycle on the bold edges. It has face-width 2, as shown by the dotted line. All line 
segments in the boundary of the square are edges in the graph; the dotted lines are not. 


Figure 7.7.1: A graph with edge-width 4 and face-width 2. 


E2: The graph in the left of Figure 7.7.2 shows K,4 imbedded in the plane. The dotted 
lines show its medial graph, the octahedron. On the right side is K4 and its radial 
graph, the cube. 


Figure 7.7.2: The medial graph and radial graph of K4. 


FACTS 


F1: In any cellular imbedding, any curve in the surface is ambient isotopic (or freely 
homotopic, i.e., can be continuously transformed) to a closed walk in the graph. More- 
over, any walk W in the graph is ambient isotopic to a curve C' in the surface that meets 
the graph only at its vertices. The number of points in CNG will not exceed the length 
of W. This gives the following two facts. 


F2: For any imbedding fw(G) < ew(G). 

F3: For triangulations fw(G) = ew(G). 

F4: An imbedded graph M is the medial graph of some graph G if and only if M is 
4-regular and the faces can be properly 2-colored. 


F5: An imbedded graph R is a radial graph of some graph G if and only if R is bipartite 
and every face is a quadrilateral. 


F6: The medial graph M of an imbedded graph G is identical to the medial graph of 
the dual G*. The imbedded graph and its dual are the only two graphs whose medial 
and radial graphs are M and R, respectively. 


F7: The face-width of an imbedded graph G is equal to half the edge-width of its radial 
graph, i.e., fw(G) = ew(R(G)) /2. 
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F8: The face-width of an imbedded graph G is equal to the face-width of its dual G*, 
ie., fw(G) = fw(G*). 


F9: An imbedding of a graph G is cellular if and only if G is connected and fw(G) > 1. 


F10: The following are equivalent: 1) an imbedding of a graph G is circular, 2) G is 
2-connected and fw(G) > 2, and 3) the radial graph R(G) has no multiple edges. 


F11: The following are equivalent: 1) an imbedding of a graph G is polyhedral, 2) G 
is 3-connected and fw(G) > 3, and 3) the radial graph R(G) has no multi-edges and 
every 4-cycle bounds a face. 


F12: Ina polyhedral imbedding of a 3-connected graph, the face boundaries are simple 
chordless cycles. 


F13: [RoSe88] 1) Let v be a vertex of an imbedded graph G and let k = |(fw(G)—1)/2]. 
Then there exists a k-nest with v € int(C1). 2) Let f be an open face of an imbedded 
graph G and let k = | fw(G)/2| — 1. Then there exists a k-nest with f € int(C)). 


See [Mo97] for additional results on face-width. 


7.7.2 Coloring Densely Imbeddable Graphs 


All planar graphs are vertex-4-colorable. Nonplanar graphs may have arbitrarily 
high chromatic number; for instance, the chromatic number of the complete graph K,, 
is n. However, all imbeddings of complete graphs have many non-contractible trian- 
gles, which implies that they have edge-width 3, so complete graphs are not densely 
imbeddable. As an illustration of the relationship between high representativity and 
planarity, we consider upper bounds on the chromatic numbers of graphs with imbed- 
dings of sufficiently high representativity. 


Coloring with Few Colors 


We first consider two 5-color theorems for graphs with imbeddings of high edge-width. 
If we allow six colors, then the required edge-width drops significantly. 


FACTS 


F14: [AlSt82] (combined with [Th94a]) If a graph G has a toroidal imbedding such 
that ew(G) > 4, then G is 5-colorable. 


F15: [Th93] If a graph G admits an imbedding G > S, such that ew(G) > 29*°, 
then G is 5-colorable. 


F16: [Fi78] Suppose that a graph G has exactly two vertices of odd degree, that these 
two vertices are adjacent, and it is imbeddable so that every face is a triangle. Then G 
is not 4-colorable. 


F17: [FiMo94] There is a constant c such that every graph G imbeddable on a surface 
of Euler genus g > 0 with ew(G) > clog g is 6-colorable. 


F18: [Th97] For each fixed surface S there is a polynomial-time algorithm that decides 
if a given graph imbedded on that surface is 5-colorable. 
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REMARK 


R1: It is not hard to construct imbeddings on any surface with exactly two vertices of 
odd degree that are adjacent, as in Fact F16, so the 5-color theorem of Fact F15 is best 
possible. 


Coloring Graphs that Quadrangulate 


We next examine the chromatic number of imbedded graphs with no face a triangle, 
or more specifically, where every face is of even length. These generalize the classical 
theorem of Grotzsch, which says that every planar graph of girth at least 4 is 3-colorable. 


DEFINITION 


D15: A quadrangulation is a graph imbedding such that each face is a quadrilateral. 


FACTS 


F19: If every face of a planar graph G is bounded by a cycle of even length, then G is 
bipartite (and hence has chromatic number 2). 


F20: The complete graph Ks is a non-bipartite graph that can quadrangulate the 
torus $;. By iteratively applying cut-and-paste techniques to this imbedding, we can 
construct quadrangulations of all of the higher surfaces S,, by non-bipartite graphs, 
hence, of chromatic number at least 3. Similarly, any non-orientable surface has a 
non-bipartite quadrangulation. 


F21: [Hu95] There is a function f(g) such that every graph G imbeddable on an 
orientable surface of genus g with every face of even size and with ew(G) > f(g) has 
chromatic number at most 3. 


F22: [FiMo94] There is a function f(g) such that every graph G of girth at least 4 
imbeddable on a non-orientable surface with g crosscaps and with ew(G) > f(g) has 
chromatic number at most 4. 


F23: [Yo96] Let G be a graph that has a simple quadrangulation of the projective 
plane. Then G has either chromatic number 2 or 4. 


F24: [Yo96] There exist quadrangulations of the projective plane of arbitarily large 
edge-width and chromatic number 4. 


F25: ({[ArHuNaNeOt01] and [MoSe99]) For any non-orientable surface there exist 
quadrangulations of arbitrarily large edge-width and chromatic number 4. 


REMARK 


R2: The fact there is no quadrangulation of the projective plane with chromatic 
number exactly 3 is surprising. So is the contrast between orientable surfaces, where a 
large width implies 3-chromatic, and non-orientable surfaces, where 4-chromatic is the 
best possible. 
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Coloring Graphs That Triangulate 


DEFINITIONS 
D16: A triangulation is a graph imbedding with every face a triangle. 


D17: An Eulerian triangulation is a triangulation of a surface such that the skeleton 
is Eulerian. 


FACTS 
F26: A graph that triangulates the plane is 3-colorable if and only if it is Eulerian. 


F27: [HuRiSe02] There is a function f(g) such that every Eulerian triangulation G of 
an orientable surface of genus g with ew(G) > f(g) has chromatic number at most 4. 


F28: [Na09] There is a function f(g) such that every Eulerian triangulation G of a 
non-orientable surface of genus g with ew(G) > f(g) has chromatic number at most 5. 


F29: [ArHuNaNeOt01] For every non-orientable surface S and every k there exists an 
Eulerian triangulation G with ew(G) > k and with chromatic number at least 5. 


RESEARCH PROBLEMS 


RP1: Thomassen [Th97]: Is there a surface S with chromatically 5-critical triangula- 
tions of arbitrarily large edge-width? 


RP2: Albertson [A181]: For each fixed surface S, does there exist a constant c(.S) such 
that there is a proper 4-coloring of all but c(S) vertices of any imbedded graph? 


RP3: For each fixed surface S, is there a polynomial-time algorithm that decides 
whether a graph imbedded on S is 4-colorable? 


RP4: (N. Robertson) Does there exist a constant k such that each cubic graph imbed- 
ded with face-width at least k is 3-edge-colorable? Griinbaum [Gr69] conjectured that 
k, = 3 suffices, but this has been disproved by Kochol [Ko09]. 


7.7.3. Finding Cycles, Walks, and Spanning Trees 


A fundamental result by Tutte [Tu56] says that 4-connected planar graphs are Hamil- 
tonian. We look for analogous theorems for locally planar graphs. 


DEFINITIONS 
D18: A spanning walk is a walk that visits every vertex. 


D19: A k-walk is a spanning walk that visits no vertex more than & times. 


FACTS 
F30: A graph has a Hamiltonian path if and only if it contains a spanning 1-walk. 


F31: Ifa graph G contains a k-walk, then G contains a spanning tree of maximum 
degree at most k + 1. 
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F32: [Ba66] Every 3-connected planar graph contains a spanning tree of maximum 
degree 3. 


F33: [BrElGaMeRi95] Every 3-connected planar graph imbeddedable on the torus or 
on the Klein bottle contains a spanning tree of maximum degree 3. 


F34: Results analogous to Fact F33 do not hold for surfaces of Euler genus three or 
more. In particular, K3,, quadrangulates a surface with Euler genus (n — 2)/2 when n 
is even. If n is at least 8, then any spanning tree of K3,, contains at least one vertex of 
degree at least 4. 


F35: [Yu97] Let G be a 3-connected graph imbedded on a surface of Euler genus g. If 
fw(G) > 48(29 — 1), then G contains a spanning 3-walk (and hence a spanning tree of 
maximum degree 4). 


F36: [Yu97] Let G be a 4-connected graph imbedded on a surface of Euler genus g. If 
fw(G) > 48(29 — 1), then G contains a spanning 2-walk (and hence a spanning tree of 
maximum degree 3; see also [ElGa94]). 


F37: [Yu97] Let G be a 5-connected triangulation on a surface of Euler genus g. If 
fw(G) > 96(29 — 1), then G contains a Hamiltonian cycle (and hence a spanning 1-walk 
and a spanning tree of maximum degree 2). 


F38: [ShYu02] There is a constant c and a function f(g) such that if G is a 3-connected 
graph on n vertices imbedded in an orientable surface of genus g with fw(G) > f(g), 
then G has a cycle of length at least cnl°8s(), 


CONJECTURE 


C1: There exists a function f(S) such that every 5-connected imbedding of a graph G 
on a surface S with fw(G) > f(S) is Hamiltonian. 


7.7.4 Re-Imbedding Properties 


Whitney [Wh33] proved that every 3-connected graph has an essentially unique 
imbedding in the plane, and he proved a similar theorem about imbeddings of graphs 
of connectivity 2 (see Fact F44). Do locally planar graphs have similar properties? 


LEW-Imbeddings 


DEFINITIONS 


D20: A large edge-width imbedding or LEW-imbedding is an imbedding G > S$ 
whose edge-width is strictly larger than the length of the longest boundary walk. 


D21: A rooted imbedding of a graph is an imbedding with a distinguished vertex v, 
an edge e incident with vertex v, and a face f incident with edge e. 
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FACTS 


F39: [Th90] A graph has an LEW-imbedding in at most one (homeomorphism type of) 
surface, and if such an imbedding exists, then it is a minimum Euler genus imbedding 
for that graph. 


F40: [Th90] In an LEW-imbedding of a 3-connected graph G, the face boundaries are 
the chordless cycles whose length is strictly less than ew(G). 


F41: [Th90] There is a polynomial-time algorithm that given a 3-connected graph G, 
either constructs an LEW-imbedding of G or concludes that no such imbedding exists. 


F42: [BeGaRi94] For each surface S there is a constant cg such that almost all rooted 
imbeddings G in S have ew(G) > cg log(|E(G)|). (Thus, for a fixed surface, we can 
expect the edge-width of a random imbedding to be reasonably large.) 


Imbeddings and Connectivity 


DEFINITIONS 


D22: Let G be a graph of connectivity two, and let C,,C2 be two subgraphs, each 
with a cycle, that partition the edge set of G and have only two vertices in common. 
Suppose that G is imbedded on a surface. Then we can replace the induced imbedding 
of one component, say C2, with its mirror image, as shown in Figure 7.7.3. This is 
called a Whitney 2-flip, or more succinctly, a 2-flip. 


Figure 7.7.3: A Whitney 2-flip. 


D23: Two imbeddings are Whitney-similar if one can be obtained from the other 
by a sequence of Whitney 2-flips. 


FACTS 


F43: Graphs with a cutpoint do not have unique imbeddings in the plane. If v is a 
cut-vertex of G incident with blocks B, and Bo, then By can be placed in any face of 
an imbedding of B, incident with v. 


F44:; [Wh33] Any two imbeddings of a 2-connected graph in the plane are Whitney- 
similar. 


F45: [Th90] If a 2-connected graph G has an LEW-imbedding on a surface S, then 
any other imbedding of G on S is Whitney-similar to it. 


F46: [Th90] Let G be a subdivision of a 3-connected graph with an LEW-imbedding 
in a surface S. Then G is uniquely imbeddable in S$, up to homeomorphism of pairs. 


868 Chapter 7. Topological Graph Theory 


Imbeddings and Genus 


We ask when an imbedding of sufficiently large face-width is a minimum genus imbed- 
ding and when it is unique. 


FACTS 


F47: [SeTh96] Let G be a graph imbedded on a surface of Euler genus g. If fw(G) > 
100 log g/ log log g, then G' does not imbed on any surface of smaller Euler genus. 


F48: [Ar92] For each integer & there is a graph G;, that has two imbeddings on two 
different surfaces each of face-width at least k. (Hence in Fact F47 the bound on the 
face-width cannot be replaced by a constant.) 


Re-Imbedding Results 


DEFINITION 


D24: Let G be a graph imbedded on the Klein bottle. The following planarizing 
orders measure how difficult it is to obtain a planar graph by cutting along the (image 
of the) graph. We define 

ord2(G) = min||C|/4| 


taken over all two-sided non-contractible cycles C' in the Klein bottle, where |C| denotes 
the length of C. We define 


ord, (G) = min||Ci|/4] + [|C2|/4] 


taken over all pairs of one-sided non-contractible non-homotopic cycles C1, C2. 


FACTS 


F49: [FiHuRiRo95] Let the graph G be imbeddedable in the projective plane with 
fw(G) # 2. Then the (orientable) genus of G is | fw(G)/2]. 


F50: ({RoTh91], see also [MoSc03]) Let the graph G be imbeddedable in the Klein 
bottle such that 
min{ord,(G), ord2(G)} 


is at least four. Then this minimum is the orientable genus of G. 


F51: ([RoVi90] and [Th90]) Any non-planar imbedding of a planar graph G has face- 
width less than or equal to 2. 


F52: [MoRo98] There is a function f(g) such that every 3-connected graph has at most 
f(g) imbeddings of face-width at least 3 in a surface of Euler genus g. 


F53: No such function f(g) exists if we consider 2-connected graphs, or if we consider 
imbeddings of face-width at least 2. 


For other results on re-imbedding planar graphs see [MoRo96]. 


CONJECTURE 


RP5: [FiHuRiRo95]: For each fixed non-orientable surface S', the (orientable) genus 
of graphs that imbed in S can be computed in polynomial-time. 
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7.7.5 Minors of Imbedded Graphs 


The concept of surface minors and representativity is very important in the sequence 
of papers by Robertson and Seymour culminating in the proof of Wagner’s Conjecture. 
A nice background is given in [Dil0]. A survey of results in this area is given in [Mo01]. 


Surface Minors 


DEFINITIONS 


D25: A minor of a graph G is a graph H that can be formed by a sequence of edge 
deletions and edge contractions in G. 


D26: A graph property P is hereditary under minors if whenever G has that 
property, then so does every minor of G. Alternatively, one says that the class of graphs 
with property P is closed under the taking of minors. 


D27: A surface minor of a graph imbedding G — S is an imbedding H > S$ 
constructed by a sequence of edge deletions and edge contractions in G. (If edge e is not 
a loop, then the edge-contracted graph G/e is constructed by contracting e in S. For 
the purposes of surface minors, we consider the contraction of a loop to be equivalent 
to its deletion.) 


FACTS 


F54: For each fixed surface S the property “G imbeds on S” is hereditary under 
minors. (We do not require that these imbeddings are cellular.) 


F55: For each fixed surface S the property “fw(G) < k” is hereditary under surface 
minors. 


The following results are fundamental in the theory of surface minors. 


F56: [RoSe86] For any plane imbedding of a graph G, there is a number & such that 
G is a surface minor of the natural plane imbedding of the Cartesian product Py x Py. 
See [DiEaTaTo94] for additional references. 


F57: [RoSe88] Let G > S be a graph imbedding on a surface other than the sphere. 
Then there is a number k such that every graph imbedding on S with face-width at 
least & has the imbedding G —> Sas a surface minor. 


Finding Imbedded Cycles 


Fact F57 ensures the existence of any imbedded minor, provided that the width of the 
given imbedding is large enough. However, the proof is existential and does not provide 
explicit bounds. Such bounds are known for certain types of graphs. 


DEFINITION 


D28: A set of cycles C),...,C, in a graph G imbedded on a surface S is a planarizing 
collection of cycles if cutting along all cycles C; simultaneously yields a connected 
graph imbedded in the plane. 
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FACTS 


F58: [Sc93] Every graph imbedded on the torus with face-width & contains [3k/4| 
disjoint non-contractible cycles. (Note that any two disjoint non-contractible cycles in 
the torus are homotopic.) 


F59: [BrMoRi96] Every graph imbedded on a surface with face-width k contains 
|(kK — 1)/2]| disjoint non-contractible homotopic cycles. 


F60: [Th93] Let S be an orientable surface of genus g and let G be a triangulation with 
ew(G) > 8(d+ 1)(29 — 1). Then G has a planarizing collection C1,...,C, of chordless 
cycles such that any two of these cycles are of distance at least d. 


F61: [Yu97] Let S be a surface with Euler genus g and let G be an imbedded graph 
with fw(G) > 8(d+1)(29—1). Then for some k (with g/2 < k < g), G has a planarizing 
collection C;,...,C of chordless cycles such that any two of these cycles are of distance 
at least d. 


7.7.6 Minor-Minimal Maps 


Fact F54 asserts that for each fixed surface S', the property “fw(G) < k” is hered- 
itary under surface minors. We look for the minor-minimal imbedded graphs with the 
property fw(G) > k, that is, imbedded graphs with face-width k but such that the 
deletion or surface contraction of any edge lowers the face-width. Any graph imbedded 
on S with face-width at least k must contain one of these minor-minimal imbeddings 
as a surface minor. The concept of minor-minimal graphs is often useful in inductive 
proofs. 


DEFINITIONS 


D29: A minor-minimal imbedded graph is an imbedded graph such that the 
deletion or surface contraction of any edge lowers the face-width. 


D30: In a graph G, let v be a vertex adjacent to exactly three other vertices a,b,c. A 
YA-transformation deletes v and its three incident edges and adds three new edges 
ab, bc, ca. 


D31: A AY -transformation is the inverse of a Y A-transformation. 


D32: Two graphs are YAY -equivalent if there is a sequence of YA- and AY- 
transformations changing one into the other. 


FACT 


F62: For every surface S and every k > 1 the number of minor-minimal maps on S 
with face-width & is finite. 


F63: Let Gy be a graph and let Ga be formed from Gy by a Y A-transformation. If 
Gy imbeds on a surface S', then Ga also imbeds on S. The converse is not necessarily 
true, but it is true if the 3-cycle being deleted is a face boundary. (When considering 
imbedded graphs, the AY-transformation is usually restricted to 3-cycles that bound a 
face.) 
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F64: If an imbedded G is YAY-equivalent to an imbedded G’, then fw(G) = fw(G’). 
Moreover if G is minor-minimal with face-width k, then so is G’. 


F65: [Ra97] Any two graphs in the projective-plane that are minimal with face-width 
k are Y AY-equivalent. In particular, they have 2k? — k edges. 


F66: [Ba87] There are exactly two graphs in the projective plane that are minimal 
with face-width 2. They are kK, and its geometric dual. 


F67: ({Ba91] and [Vi92]) There are exactly 7 minor-minimal maps in the projective 
plane with face-width 3. 


F68: [Ba87] There are exactly 7 minor-minimal maps on the torus with face-width 2. 


F69: [Hi96] There are exactly 56 minor-minimal maps on the torus with face-width 3. 
These fall into 7 classes under Y AY-equivalence. 


Similarity Classes on the Torus 


DEFINITION 


D33: Two imbeddings are similar tmbeddings if they are related by a sequence 
of operations, each a Y A-transformation, a AY-transformation, or the taking of the 
geometric dual. 


REMARK 


R3: In general, the geometric dual of an imbedded graph is not necessarily YAY- 
equivalent to the primal imbedding. Hence the number of similarity classes might be 
smaller than the number of Y AY-equivalence classes. 


FACT 


F70: [Sc94] For odd k there are exactly (k? + 5k)/6 similarity classes of maps on the 
torus with face-width k. For even k there are exactly (k? + 8k)/6 similarity classes. 


Kernels 


A “kernel” for a surface is an imbedding such that deleting or contracting any edge 
lowers the face-width in some direction, in the sense that a free homotopy class of 
closed curves may be regarded as a direction. 


DEFINITIONS 


D34: Let G— S be a graph imbedding, and let C' be a curve in the surface S. We 
define the p-function ju(G,C) as 
min{|W| /2} 


where this minimum ranges over all closed walks W in the radial graph R(G) that are 
freely homotopic to C. This is similar to the face-width of G, min{|C’ M G]}, except 
that we now restrict the minimum to those curves C’ freely homotopic to C. 


D35: An imbedded graph G is a kernel if for every proper imbedded minor H, there 
is a curve C such that u(H,C) < w(G,C). 
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FACTS 

F71: p(G,C) = u(G*,C) where G* is the geometric dual. 

F72: u(G,C) is invariant under Y A-exchanges. 

F73: If H is a surface minor of G, u(H,C) < w(G,C) for any curve C. 


F74: ([Sc92] and [Gr94]) Suppose that G and G’ are kernels on the same surface such 
that u(G,C) = u(G’,C) for all curves C. Then G and G’ are similar. 
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INTRODUCTION 


Triangles can be used as elementary pieces to build up a surface. Such construction 
of a surface generalizes to a simplicial complexin combinatorial topology. In topological 
graph theory, we regard the skeleton of a triangulation as a graph dividing a surface 
into triangles. 


7.8.1 Basic Concepts 


Although we are primarily interested in closed surfaces, it is worth noting that the 
basic concepts for triangulations on closed surfaces also work for surfaces-with-boundary, 
after suitable modifications. Imbeddings are implicitly taken to be cellular. 


DEFINITIONS 


D1: A triangulation of a closed surface S is an imbedding p: G > S of a simple 
graph G, such that 


e each face is bounded by a 3-cycle, and 
e any two faces share at most one edge. 


The latter condition excludes K3 on the sphere. 


D2: A graph imbedding G > S is triangular if every face is 3-sided. (Some triangular 
imbeddings are not triangulations: e.g., perhaps the skeleton is not simple, or perhaps 
a face-boundary has a repeated edge.) 
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D3: The skeleton of an imbedding p: G — S is the image p(G) of the imbedded 
graph. Informally we write G for p(G). 


D4: A graph is said to triangulate a surface if it can be imbedded on the surface as 
a triangulation. 


D5: A triangulation G > S of a surface-with-boundary S is subject to the 
additional requirement that each boundary component of the surface is the image of a 
cycle of the skeleton. 


D6: A Catalan triangulation is a triangulation G > S of a surface-with-boundary 
such that every vertex of G lies in a boundary component of S. (E. Catalan counted 
the number of such triangulations when the surface S' is a disk.) 


D7: The link in a triangulation p : G — S of a vertex v is the cycle through 
the neighbors of v whose edges lie on triangles incident on v. It is usually denoted by 
Ik(v, 9: G > S), or simply by Ik(v). 


D8: The star neighborhood in a triangulation p:G — S of a vertex v is the 
wheel subgraph in G obtained by joining v to each vertex in its link. It is denoted by 


st(v). 


D9: A clean triangulation is a triangulation G — S such that every 3-cycle in G 
bounds a face. 


D10: Two triangulations p} : G — S and pg: G > S with the same vertex set are 
combinatorially equivalent triangulations if they have the same set of face bound- 
ary cycles. (Precisely speaking, combinatorial equivalence is for labeled triangulations. ) 


D11: Two triangulations G; — S and Gz > S are isomorphic triangulations if 
there is a homeomorphism h: S — S such that h(G,) = Go. 


D12: Two triangulations G; — S and Gz > S are isotopic triangulations if there is 
a homeomorphism h: S > S$ with h(G,) = G2 that is isotopic to the identity mapping 
on S. (Roughly speaking, this means that one can be transformed continuously on the 
surface into the other.) 


D13: An imbedding is said to be k-representative if it has face-width at least k. (See 
87.7.) 


NOTATIONS 


NOTATION: As usual in topological graph theory, S, denotes the orientable closed surface 
of genus g, and N;, denotes the non-orientable closed surface of crosscap number k. The 
Euler characteristic of a surface S is denoted by y(S). 


NOTATION: F'(G — S) denotes the set of faces of a triangulation G + S. However, we 
usually let F(G) or F' denote the set of faces when only one imbedding of the graph G 
is under consideration. 


NOTATION: Each face of a triangulation G — S' can be specified by listing the three 
vertices u,v, w at its corners. Thus, it is often identified with the triple {u,v,w}, and 
hence one may write F(G) C (VO). In other contexts, a face may be denoted by its 
boundary cycle ww. 
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EXAMPLES 


E1: The 1-skeletons of the tetrahedron, the octahedron and the icosahedron all trian- 
gulate the sphere. 


E2: The unique imbedding of K7 on the torus, as shown in Figure 7.8.1, is a triangu- 
lation. (Its dual graph is the Heawood graph.) 


6 3 
1 1 


4 Re 
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6 3 


Figure 7.8.1: The complete graph K7 on the torus. 


E3: The unique imbedding of Kg on the projective plane, as shown in Figure 7.8.2, is 
a triangulation. (Its dual is isomorphic to the Petersen graph.) 


2 ANI 


Figure 7.8.2: The complete graph Kg on the projective plane. 


E4: There exists only one 7-vertex triangulation on the torus, up to isomorphism, but 
there exist infinitely many up to isotopy. The skeleton is isomorphic to K7 as a graph. 
Twisting it along a simple closed curve yields an infinite series of those. There exist 120 
toroidal triangulations over the vertex set {1,...,7} up to combinatorial equivalence. 


E5: There exists only one 6-vertex triangulation on the projective plane, up to iso- 
morphism, and also only one up to isotopy, since any auto-homeomorphism on the 
projective plane is isotopic to the identity mapping. The skeleton is isomorphic to K¢ 
as a graph. There exist 12 projective-planar triangulations over the vertex set {1,...,6} 
up to combinatorial equivalence. 
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FACTS 


Fi: A graph with at least four vertices can triangulate the sphere if and only if it is 
maximal planar. 


F2: A graph is isomorphic to a Catalan triangulation of the disk if and only if it is 
maximal outer-planar. 


F3: A graph cellularly imbedded on the sphere is a triangulation if and only if its dual 
is 3-regular and 3-edge-connected. 


F4: A graph cellularly imbedded on a closed surface other than the sphere is a trian- 
gulation if and only if its dual is 3-regular, 3-edge-connected, and 3-representative. 


F5: The link of a vertex in the interior of the surface is a cycle, and the link of a vertex 
at the boundary is a path. 


F6: Every triangulation of any closed surface has a 3-connected skeleton. More gener- 
ally, a graph is 3-connected if for each vertex v all the neighbors of v lie on a cycle. 


F7: The skeleton G of a triangulation on a closed surface is n-connected (n = 4 or 5) 
if and only if every cycle in G that separates the surface into two pieces, each of which 
includes at least one vertex, has length at least n. 


F8: Every triangulation on any closed surface except the sphere is 3-representative. 


F9: A triangulation on a closed surface except the sphere is 4-connected, and it is 
4-representative if and only if it is clean. 


F10: A triangulation with n vertices on a closed surface S with Euler characteristic 
x(S) has exactly 3(n — y(S)) edges and 2(n — x(S)) faces. 


F11: Let G be a triangulation on a closed surface S with Euler characteristic (S$) 
and let V; denote the number of vertices of degree 7 in G. Then we have: 


>2(6 - Vi = 6x(S) 

i>3 
F12: The equivalence up to isomorphism can be rephrased combinatorially as fol- 
lows; two triangulations G'; and G2 are isomorphic if there is a graph isomorphism ¢ : 
V(G1) — V(G2) which induces a bijection y* : F(G1) > F(G2) with y*({u,v, w}) = 
{e(u), P(v), e(w)f- 


F13: Let G; — S; and Gz > So be triangulations, and let f; and fo be triangles in G; 
and Go, respectively. Discard the interiors of those two faces, and paste the boundary 
of fi to the boundary of fo, thereby producing a connected surface with an imbedded 
graph. That resulting imbedding is a triangulation. 


EXAMPLE 


E6: Let K > S and H — S” be two 6-connected triangulations on different closed 
surfaces. Let v be a vertex of degree d > 6 in K with link v,v2---vg, and choose 
two faces of H — S’ sufficiently apart from each other. Identify these with two faces 
VU v2 and vu4us of the imbedding K — S. Then the resulting triangulation has a 5-cut 
{v, v1, v2, v4, Us}, but it contains no separating cycle of length less than 6. Therefore, 
Fact F7 does not hold for n = 6. 
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7.8.2 Constructing Triangulations 


What is the minimum number of triangles needed to build up a given surface? This 
question must have been asked frequently, but it is difficult to answer it precisely. 


Triangulations with Complete Graphs 


The solution of the “Map Color Theorem” gave us the precise formulas of the genus 
and the nonorientable genus of K,,, namely, 


(Kn) = [(n — 3)(n— 4)/12] and 7(Kn) = [(n — 3)(n — 4)/6] 


The complete graph K,, triangulates a suitable surface exactly when the inside of each 
ceiling function becomes an integer. The constructions give us triangulations on many 
closed surfaces and also a hint to answer the minimum triangulation question. 


DEFINITION 


D14: A triangulation is said to be tight if 
e the skeleton G is a complete graph, and 


e for any partition of V(G) into three nonempty subsets Vi, V2 and V3, there is a 
face v1 V2v3 € F(G) with v; € Vj. 


It is unteght otherwise. 


FACTS 


F14: [Ri74] The complete graph K,, over n vertices triangulates an orientable closed 
surface if and only if n = 0,3,4 or 7 (mod 12). The genus of such a surface is equal to 
(n — 3)(n — 4)/12. 


F15: [Ri74] The complete graph K,, over n vertices triangulates a nonorientable closed 
surface if and only if n = 0,1,3 or 4 (mod 6) and n #4 7. The genus of such a surface is 
equal to (n — 3)(n — 4)/6. 


F16: [Fr34] No complete graph triangulates the Klein bottle. 
F17: [BrSt01] The minimum order of a complete graph that admits nonisomorphic tri- 
angulations on a nonorientable closed surface is 9; for the orientable case, the minimum 


is 12. 


F18: [BoGrGrSi00] The complete graph K,, triangulates an orientable closed surface 
with bipartite duals in at least gn” /54—O(n) ways if n = 7 or 19 (mod 36) and in at least 
22n*/81-O(n) ways if n = 19 or 55 (mod 108). 


EXAMPLES 


E7: [LaNeWh94] The complete graph Ky triangulates the orientable closed surface 
Sq in at least three ways. 
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E8: [BrSt01] The complete graph Kyo triangulates the nonorientable closed surface N7 
in at least 14 ways. 


E9: [ArBrNe95] The complete graph K39 triangulates the nonorientable closed surface 
Ni17 in at least 2 ways; they are tight and untight. 


REMARK 


R1: A tight triangulation is necessarily isomorphic to a complete graph as a graph. As 
a natural generalization of the tightness which works for general triangulations, a notion 
called the “looseness”, has been introduced in [NeMi96] so that a tight triangulation has 
looseness 0. One graph may be imbedded as many triangulations on a closed surface 
having different loosenesses. It has been shown in [Ne05] that there is an upper bound 
for the difference between their maximum and minimum values, depending only on the 
surface. 


Minimum Triangulations 


Here we shall show the answer to our question on the minimum number of triangles 
to build up a surface. The corresponding formula is expressed below in terms of the 
number of vertices. 


DEFINITION 

D15: A minimum triangulation of a surface is a triangulation on the surface that 
has the fewest vertices (or equivalently, the fewest faces). 

FACTS 


F19: [JuRi80, Ri55] Let Vinin(S') denote the order of minimum triangulations of a 
closed surface S. If S 4 S2, No, N3, then: 


P4749 = 2S) 
D 


Vinin(S) — 


For the three exceptions, we have: 
Vinin(S2) = 10, Vanin (2) = 8, Vinin (V3) =9 


F20: Ifthe complete graph K,, triangulates a closed surface, then the skeleton of any 
minimum triangulation is isomorphic to Ky. 


F21: [HaRi91] The minimum number of faces in a clean triangulation of S2 is 24. 
F22: {HaRi91] The minimum number of faces in a clean triangulation of S, is asymp- 
totically equal to 4g as g — oo. 

EXAMPLES 


E10: The only minimum triangulations of the sphere, the projective plane and the 
torus are the unique imbeddings of Ky, Kg, and K7, respectively. 


E11: There exist precisely six minimum triangulations of the Klein bottle, up to 
isomorphism. 
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Covering Constructions 


Shortly after the solution of the Map Color Theorem, the theory of voltage graphs (see 
87.4) provided a unified topological analysis of that solution, as a branched covering, 
and an extensive generalization of its constructive method. In fact, many triangular 
imbeddings of complete graphs constructed for the Map Color Theorem can now be 
obtained as coverings of small graphs that triangulate suitable surfaces. There are also 
other ways to build triangulations from triangulations. 


DEFINITIONS 


D16: [Gr74] A voltage graph (G = (V, E), a) is a directed graph G with an assignment 
a: E — B of elements of a group 6 to its arcs. The group B is called the voltage 
group. 


D17: The net voltage on a walk in a graph is the product (or sum, if the voltage 
group is abelian) of the voltages along that walk. 


D18: The Kirchhoff voltage law (abbr. KVL) holds for an imbedded voltage graph 
if on every face boundary walk, the net voltage equals the identity of the voltage group. 


D19: The composition G[H] of a graph G with a graph A is the graph with vertex 
set V(G) x V(H) such that (ui, v1) is adjacent to (w2, v2) whenever either uz is adjacent 
to uz, or Vv, is adjacent to v2 with uj = ug. In particular, we denote G[K »] simply by 
Gm), where Ky» is the graph over m vertices with no edge. 


D20: A natural projection p: G(m) — G is called a covering with folds. This is an 
m-to-1 surjective homomorphism mapping (u,v) to u for each vertex u € V(G). (There 
will be a more general definition of a covering with folds in other contexts.) 


FACTS 


F23: [Gr74] An imbedded voltage graph (G — S,a) lifts to an imbedding G° > S® 
of a covering graph of G into a branched covering of the surface S, such that branch 
points occur only in the interiors of the faces, with at most one branch point per face. 


F24: [Gr74] Let G—> S bea triangular imbedding with voltage assignment a such that 
the Kirchhoff voltage law holds. Then the resulting graph imbedding in the covering 
surface of S is also a triangular imbedding. (If the covering graph G® is simple and not 
K3, then the resulting imbedding is a triangulation.) 


F25: [Bo82a] Let G be a triangulation on a closed surface S. If a positive integer m is 
not divisible by 2, 3 or 5, then G(,,) triangulates another closed surface with the same 
orientability as S. 


F26: [Bo82b] If a triangulation G on a closed surface S is eulerian, then G(,,) trian- 
gulates another closed surface with the same orientability as S. 


F27: [Ar92] If the complete graph K,, triangulates a closed surface S' and if each prime 
factor of m is at least n — 1 except the case of n = 4, m = 3, then Ky(m) triangulates 
another closed surface, where Ky, (m) stands for the n-partite graph Km,....m with partite 
sets of size m. 


peeeg 
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EXAMPLE 


E12: We observe in Figure 7.8.3 that the imbedding Bz; > Sj is a KVL triangular 
imbedding. Moreover, for any cyclic group Z,,, and that for n > 7, the covering graph 
is simple. Accordingly, Fact F24 implies that the covering imbedding is a triangulation. 
For sufficiently large n, the face-width of the covering imbedding is arbitrarily large. 


Figure 7.8.3: A KVL imbedding Bs > S; with voltages in arbitrary Zp. 


7.8.3 Irreducible Triangulations 


Look at one edge in a triangulation. There are two triangles incident to the edge 
from both sides and two wheels cover them. With such a local picture around an edge, 
one will guess that shrinking this edge yields another triangulation smaller than the 
original. Moreover, he might consider that this fact can be used for some proofs with 
induction on the number of vertices or edges. What is the first step of such induction? 
That is, what can we get, repeating this deformation as far as possible? “Irreducible 
triangulations” are exactly the answer. There is a strong connection between studies 
on irreducible triangulations and graph minor theory. 


Edge Contraction 


DEFINITIONS 


D21: Let acb and acd be the two faces sharing the edge ac of a triangulation G - S. 
Contraction of the triangulation on the edge ac is to shrink the adjacent triangles 
acb and acd to a path bad = bcd, as in Figure 7.8.4. We do not contract an edge of a 
triangulation unless it results in another triangulation on the surface S. 


Ka. XA 
AV a, 


Figure 7.8.4: Edge contraction. 


D22: A vertex splitting is the inverse operation of an edge contraction. 


D23: A contractible edge in a triangulation G — S on a closed surface is an edge 
whose contraction would yield another triangulation on S. 


D24: A triangulation G — S is said to be contractible to another triangulation 
H + S if it can be obtained from G — S by a sequence of edge contractions. 
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FACTS 


F28: An edge in a triangulation G on a closed surface, except K4 on the sphere, is 
contractible if and only if it is contained in exactly two cycles of length 3, which are the 
boundary cycles of two faces sharing the edge. 


F29: [Ne94] A triangulation G —> S is contractible to a triangulation H — S of the 
same surface if and only if the skeleton H of the latter triangulation is a minor of the 
graph G. 


REMARK 


R2: An edge contraction in a triangulation is different from that in graph minor theory. 
The former always decreases the number of edges by 3 at a time, while the latter does 
so by 1 at a time. 


Classification and Finiteness in Number 


DEFINITION 

D25: An irreducible triangulation on a closed surface is one that has no contractible 
edge. 

FACTS 


F30: Every irreducible triangulation on any closed surface, except the sphere, has 
minimum degree at least 4. 


F31: Every triangulation is contractible to an irreducible triangulation. Equivalently, 
it can be obtained from an irreducible triangulation by a sequence of vertex splittings. 


F32: [StRa34] The only irreducible triangulation on the sphere is the tetrahedron, 
whose skelton is isomorphic to K4. 


F33: [Ba82] There are precisely two irreducible triangulations on the projective plane 
up to isomorphism, shown in Figure 7.8.3. Their skeletons are isomorphic to Ke and 
K4+ K3. 


1 1 


Figure 7.8.5: The two irreducible triangulations on the projective plane. 
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F34: [La87] There are precisely 21 irreducible triangulations on the torus up to iso- 
morphism, which are given in Figure 7.8.6 below, where each pair of parallel sides of 
the rectangles should be identified. They are denoted by T1 to T21. The skeleton of 
T1 is isomorphic to K7. 


Figure 7.8.6: The 21 irreducible triangulations on the torus. 


F35: [LaNe97, Sul06] There are precisely 25 irreducible triangulations on the Klein 
bottle up to isomorphism.They are classified into two classes, namely, handle types 
Kh1 to Kh25 in Figure 7.8.7 below and crosscap types Kcl to Kce4 in Figure 7.8.8 
below. Identify each horizontal pair of sides in parallel and each vertical pair in anti- 
parallel to recover the handle types and identify each antipodal pair of vertices lying on 
the hexagons for crosscap types. 
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Kh6 Kh7 Kh8 Kh9 
Kh14 


Kh10 


Kh16 Kh17 kh18 Kh19 Kh20 
Kh21 Kh22 Kh23 Kh24 Kh25 


Figure 7.8.7: The 25 irreducible triangulations on No, handle types. 


F36: [BaEd89] There are only finitely many irreducible triangulations on any closed 
surface, up to isomorphism. 


F37: [JoWol0] Any irreducible triangulation of a closed surface S with (S$) < 0 has 
at most 13(2 — x(S')) — 4 vertices. 


EXAMPLES 


E13: All minimum triangulations on a closed surface are irreducible, but there are 
irreducible triangulations that are not minimum, in general. 


E14: Triangulations of closed surfaces by Ky, and Kym) are irreducible. 
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Ke3 Ke4 


Figure 7.8.8: The 4 irreducible triangulations on N2, crosscap types. 


E15: Let G; > S; and Gz — Sp» be two irreducible triangulations on closed surfaces. 
Form the connected sum of these triangulations by discarding the interiors of a face in 
each and pasting on the boundary cycles. The resulting triangulation on the surface 
S15» is irreducible. In particular, all four crosscap types of irreducible triangulations 
on the Klein bottle are obtained from two of those on the projective plane in this way. 


REMARKS 


R3: No pair of skeletons for distinct irreducible triangulations on the torus are isomor- 
phic as graphs. 


R4: The only pair of isomorphic skeletons for distinct irreducible triangulations on the 
Klein bottle is (Kh2, Kh5). 


R5: The only pair of isomorphic skeletons for distinct irreducible triangulations on the 
torus and the Klein bottle is (T3, Kh1). 


Other Irreducibility 


There are some studies on triangulations with prescribed properties which are irreducible 
or minimal with respect to edge contractions — those of higher representativity or with 
large minimum degree, for example. 


DEFINITIONS 


D26: An essential cycle in a graph imbedding on a closed surface is a cycle that 
bounds no 2-cell region on S. 


D27: A k-irreducible triangulation on a closed surface is a triangulation such that 
each edge is contained in an essential cycle of length at least k. 


D28: A clean triangulation is minimal if no edge contraction results in a clean 
triangulation. (This is the same as 4-irreducibility.) 
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FACTS 


F38: On any closed surface S' except the sphere, every triangulation containing no es- 
sential cycle of length less than k& can be obtained from some k-irreducible triangulation 
by a sequence of vertex splittings. 


F39: [MaNe95] There are only finitely many k-irreducible triangulations on any closed 
surface, up to isomorphism. 


F40: [MaMo92] There are only finitely many minimal clean triangulations on any 
closed surface, up to isomorphism. 


F41: [FiMoNe94] There are exactly five minimal clean triangulations on the projective 
plane. 


F42: [NaNe02] On any closed surface S except the sphere, every triangulation with 
minimum degree at least 4 is obtainable from some irreducible triangulation by splitting 
vertices, preserving their degrees to be at least 4, and adding octahedra to faces. 


F43: [NaNe02] Every triangulation on the sphere with minimum degree at least 4 can 
be obtained from the octahedron in the same way as in Fact F42. 


REMARKS 


R6: On any closed surface S' except the sphere, every irreducible triangulation is 
3-irreducible. 


R7: The finiteness of the number of minor-minimal k-representative imbeddings follows 
from Fact F39 ([MaNe95], where a k-irreducible triangulation is called a k-minimal 
triangulation). A simple proof of the same result can be found in [GaRiSe96]. 


7.8.4 Diagonal Flips 


An edge in a triangulation can be regarded as a diagonal in a quadrilateral, and 
switched to the other diagonal, which is called “a diagonal flip”. All sufficiently large 
triangulations of the same size differ from each other only by diagonal flips ([Ne94]). 


DEFINITIONS 


D29: A diagonal flip of an edge ac shared by triangles acb and acd in a triangulation 
G means to replace ac with the other diagonal bd in the quadrilateral abcd, as shown in 
Figure 7.8.9. We do not perform a diagonal flip unless the resulting skeleton would be 
a simple graph. 


D30: Two triangulations on a surface are equivalent under diagonal flips if one 
can be transformed into the other by a finite sequence of diagonal flips. 


D31: A frozen triangulation is a triangulation such that no edge can be flipped. 
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A. We 
VV VAY 


Figure 7.8.9: Diagonal flip. 


D32: The standard triangulation on the sphere with n vertices is shown in Figure 
7.8.10, and is denoted by A,_3. The skeleton is isomorphic to P,-2 + Ko. 


U W 


Figure 7.8.10: The standard triangulation A, on the sphere. 


NOTATION: The result of adding A,, to an arbitrarily chosen face of a triangulation 
G — S on a closed surface is denoted by G+ A,, > S. (All possible triangulations 
G+Am — S are equivalent under diagonal flips. See [Ne94].) 


FACTS 


F44: [Wa36] Any two triangulations on the sphere with the same number of vertices 
are equivalent under diagonal flips, up to isotopy. 


F45: [NeWa90] Any two triangulations on the projective plane with the same number 
of vertices are equivalent under diagonal flips, up to isotopy. 


F46: [De73] Any two triangulations on the torus with the same number of vertices are 
equivalent under diagonal flips, up to isomorphism. 


F47: [NeWa90] Any two triangulations on the Klein bottle with the same number of 
vertices are equivalent under diagonal flips, up to isomorphism. 


F48: [Ne94] For every closed surface S, there exists a natural number N(S) such that 
two triangulations G; and G2 with |V(Gi)| = |V(G2)| > N(S) are equivalent under 
diagonal flips, up to isomorphism. 


F49: [NaOt97] For every closed surface S, there exists a natural number N(S) such 
that two triangulations G; and G2 with |V(G1)| = |V(G2)| => N(S) are equivalent 
under diagonal flips, up to isotopy. 


890 Chapter 7. Topological Graph Theory 


F50: [Ne01] Let G; and G2 be two triangulations on a closed surface S' with the same 
number n of vertices, and let m > 18(n—x(S)). Then any two subdivided triangulations 
of the form G; + A,, and Gz + A,, are equivalent under diagonal flips, up to isotopy. 


F51: [Ne94] A frozen triangulation is irreducible. 


EXAMPLES 


E16: For the sphere Sp, the projective plane N;, the torus S; and the Klein bottle 
N2, we have the following numbers: 


N(So) =4, N(Ni)=6, N(Si:)=7, N(N2)=8 


E17: The irreducible triangulations on the torus can be partitioned into five equivalence 
classes, as follows, under sequences of diagonal flips: 


CTS. ATOATS Pa, Tale ATT. ATO yo To0l E21) 


E18: If K,, or Kj (m) triangulates a closed surface, then it is a frozen triangulation. 


REMARKS 


R8: The problem of equivalence would be nearly trivial, and the lower bound N(S) 
for the order of triangulations would be meaningless, if we allowed diagonal flips that 
resulted in non-simple skeletons. If the requirement of simpleness is removed, then there 
is a greedy algorithm to transform one of two triangulations into the other. See [Ne01] 
for details. 


R9: Within the theory of diagonal flips in topological graph theory, the positions 
of vertices may be moved on surfaces, up to homeomorphism or isotopy. However, in 
computational geometry, there are studies of diagonal flips in triangulations in which 
the vertices have fixed positions in the plane. 


R10: The bound N(S) in Fact F48 is actually necessary, because there exist frozen 
triangulations on infinitely many surfaces, as discussed in [Ne99b]. The arguments 
needed to prove the theorem also work for labeled triangulations and for triangulations 
with boundary, with suitable modifications. See [Ne99a]. 


R11: The bound N(S) in Fact F49 is large and unknown, even when S$ is the torus or 
the Klein bottle. 


Estimating Bounds 


We consider how many diagonal flips are necessary to transform one triangulation into 
another. 


DEFINITION 


D33: A pseudo-minimal triangulation is a triangulation such that no sequence of 
diagonal flips transforms it into one having a vertex of degree 3. 
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FACTS 
F52: [Ne94] A pseudo-minimal triangulation is irreducible. 


F53: [Ne94] Let {T;} be the set of the pseudo-minimal triangulations of a closed 
surface S. The precise value of N(S') is equal to the minimum number WN such that all 
the subdivisions T; + Ay_|v7,)| can be transformed into one another by diagonal flips, 
up to isomorphism. 


F54: [Ne01] Let Vpse(S) denote the maximum order taken over all the pseudo-minimal 
triangulations of a closed surface S with Euler characteristic x(S). Then we have: 


N(S) < 19 Vose(S) — 18 x(S) 


F55: [Ne98] Given a closed surface S, there are two constants a; and ao, depending 
only on S$, such that any two triangulations G; > S$ and Gg > S with n > N(S) 
vertices can be transformed into each other by at most 2n? + ayn + ag diagonal flips, 
up to isomorphism. 


F56: [MoNaOt03] Any two triangulations with n vertices on the sphere can be trans- 
formed into each other, up to isotopy, by at most 6n — 30 diagonal flips if n > 5. 


F57: [MoNa03] Any two triangulations with n vertices on the projective plane can be 
transformed into each other, up to isotopy, by at most 8n — 26 diagonal flips. 


F58: [GaUrWa01] Any two labeled triangulations with n vertices on the sphere can 
be transformed into each other, up to isotopy, by O(n log n) diagonal flips. 


EXAMPLE 


E19: All pseudo-minimal triangulations on the sphere, the projective plane, the torus 
and the Klein bottle are minimum triangulations on these surfaces. 


REMARK 


R12: Since there is a linear upper bound for the order of irreducible triangulations 
with respect to the genus of S, the upper bound for N(S) given in Fact F54 also is 
linear. 


Catalan Triangulations 


There are some studies on diagonal flips in Catalan triangulations with the same frame- 
work as above, although the lack of interior vertices is an obstacle to the general argu- 
ments in [Ne94]. Furthermore, there has been shown an amazing method for Catalan 
triangulations of polygons in [SITaTh88], in harmony with combinatorics, hyperbolic 
geometry and computer science. 


DEFINITION 


D34: A punctured surface is a closed surface with one hole, that is, a surface with 
connected boundary. 
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FACTS 


F59: [SlTaTh88] Any two Catalan triangulations of an n-gonal disk with n > 13 can be 
transformed into each other by at most 2n — 10 diagonal flips. There exists an example 
attaining this bound. 


F60: [EdRe97] Any two Catalan triangulations of the Mobius band with the same 
number of vertices are equivalent under diagonal flips, up to isomorphism. 


F61: [CoNa00a] Any two Catalan triangulations of the punctured torus with the same 
number of vertices are equivalent under diagonal flips, up to isomorphism. 


F62: [CoNa00b] Any two Catalan triangulations of the punctured Klein bottle with 
the same number of vertices are equivalent under diagonal flips, up to isomorphism. 


F63: [CoGlMaNa02] Given a puctured surface S, there exists a natural number M(S) 
such that two Catalan triangulations G and G2 of S are equivalent under diagonal 
flips, up to isomorphism, if |V(G1)| = |V(G2)| > M(S). 


Preserving Properties 


Any diagonal flip preserves the order of triangulations while an edge contraction de- 
creases it by one. Nevertheless, the former is closely related to the latter, as Facts 51 
and 52 suggest. This makes a connection of the theory of diagonal flips to graph minor 
theory and leads us to more general or formal arguments on conditional generating of 
triangulations. 


DEFINITIONS 


D35: A class P of triangulations on S is said to be splitting-closed if it is closed 
under vertex splittings. 


D36: Let P be a class of triangulations. A triangulation is called a P-triangulation 
(or a triangulation with property P) if it belongs to P. 


D37: A P-diagonal flip in a P-triangulation G is a diagonal flip such that the 
resulting graph is also a P-triangulation. 


D38: Two P-triangulations G; and G2 are said to be P-equivalent under diagonal 
flips if they can be transformed into each other by a finite sequence of P-diagonal flips. 


D39: A class P of triangulations on a closed surface S is said to be closed under 
homeomorphism if h(G) € P for any member G € P and for any homeomorphism 
hA:S->S., 


FACTS 


F64: [BrNaNe96] For any closed surface S and for any splitting-closed class P of 
triangulations on S, there exists a natural number Np(S) such that if G; and G2 are two 
P-triangulations with |V(G1)| = |V(G2)| > Np(S), then G; and G2 are P-equivalent 
under diagonal flips, up to isomorphism. 
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F65: [BrNaNe96] For any closed surface S and for any splitting-closed class P of 
triangulations on S' which is closed under homeomorphism, there exists a natural number 
Np(S) such that if G, and G, are two P-triangulations with |V(G,)| = |V(G2)| > 
Np(S), then G, and G2 are P-equivalent under diagonal flips, up to isotopy. 


F66: [KoNaNe99] For any closed surface S except the sphere, there exists a natural 
number N4(S) such that two triangulations G, and Gz on S with minimum degree at 
least 4 can be transformed into each other by a finite sequence of diagonal flips, up to 
isomorphism, through those triangulations if |V(Gi)| = |V(G2)| > Na(S). 


F67: [KoNaNe99] Two triangulations on the sphere, except the double wheels, with 
minimum degree at least 4 can be transformed into each other, up to isotopy, by a finite 


sequence of diagonal flips through those triangulations if they have the same number of 
vertices. 


EXAMPLE 


E20: The following properties are splitting-closed and closed under homeomorphism. 
(i) Being k-representative. 
(ii) Intersecting any non-separating simple closed curve in at least k points. 

(iii) Containing at least k disjoint homotopic cycles. 

(iv) Containing at least & disjoint cycles. 


(v) Containing k distinct spanning trees. 


REMARK 


R13: The class consisting of triangulations on a closed surface with minimum degree 
at least 4 is not splitting-closed, and hence the meta-theorems in [BrNaNe96] cannot be 
used to prove the theorems in [KoNaNe99]. 


7.8.5 Rigidity and Flexibility 


A triangulation may seem quite rigid. So one might guess that it is hardly possible 
for a graph that triangulates a closed surface to have another imbedding on that same 
surface, which is actually true for the sphere. However, the complete graph triangulates 
a closed surface in numerous ways. Here we shall consider many facts on the rigidity 
and flexibility of triangulations. 
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Equivalence over Imbeddings 


To analyze many imbeddings of a graph, it is often useful to deal with an imbedding as 
a map rather than a drawing on a surface. That is, an imbedding of a graph G into a 
surface S' is an injective continuous map f : G — S from a 1-dimensional topological 
space G to S. 


DEFINITIONS 


D40: Two imbeddings f,, fo : G — S of a graph into a surface are equivalent 
imbeddings if there exists a homeomorphism h: S > S with hf, = fo. 


D41: Two imbeddings fi, fo : G — S of a graph into a surface are congruent 
imbeddings if there exists a homeomorphism h : S > S$ and a graph automorphism 
0:G—>G with hf; = foo. 


D42: An automorphism o € Aut(G) is called a symmetry of an imbedding f : G3 S 
if there is a homeomorphism h: S > S with hf = fo. 


D43: The symmetry group of an imbedding f : G — S is the subgroup Sym(f) 
in Aut(G) consisting of the symmetries of the imbedding. 
FACTS 


F68: Ifa simple graph has a triangular imbedding on a closed surface, then all of its 
imbeddings on that surface are triangular. 


F69: Equivalent triangular imbeddings of a graph G have the same set of face boundary 
cycles over V(G). 


F70: Congruent triangular imbeddings of a graph G correspond to isomorphic trian- 
gulations. 


F71: An imbedding f :G— S is equivalent to fo for any symmetry o € Sym(f). 


F72: An imbedding f : G > S is congruent but is not equivalent to fo for any 
automorphism a € Aut(G) — Sym(f). 


F73: The number of inequivalent imbeddings of a graph G congruent to a fixed imbed- 
ding f : G— S is equal to |Aut(G)|/|Sym(f)|. 


Uniqueness of Imbeddings 


It has been known that the skeleton of an imbedding of sufficiently large representativity 
is rigid, that is, it has a unique imbedding on the surface that contains it. However, 
relatively simple conditions force any skeleton of a triangulation to be rigid. 


DEFINITIONS 


D44: A graph is said to be uniquely imbeddable on a surface S if all of its imbeddings 
into S' are equivalent. 
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D45: A skew vertex in a triangulation G on a closed surface S is a vertex v such 
that there are at least two cycles each of which contains all the neighbors of v. 


FACTS 


F74: The skeleton of every triangulation on the sphere is uniquely imbeddable on the 
sphere, up to equivalence. This is an easy consequence of the well-known fact that every 
3-connected planar graph is uniquely imbeddable on the sphere. 


F75: [NeNaTa97] A graph that triangulates a closed surface is uniquely imbeddable on 
that surface, up to equivalence, if any face has at most two skew vertices as its corners. 


F76: [Ne83] The skeleton of a 4-representative triangulation on a closed surface is 
uniquely imbeddable on that surface, up to equivalence. 


F77: [Ne83] The skeleton of a 6-connected toroidal triangulation is uniquely imbeddable 
on the torus, up to congruence, and also up to equivalence, with three exceptions, shown 
in Figure 7.8.11. 


0 2 
0 2 7 1 

6 1 6 0 

5 0 5 7 

4 6 4 6 

3 5 3 5 0 0 
2 4 2 4 2 2 
1 ar, 4 3 1 1 
0 2 0 2 0 0 
T(7,2,1) T(8,2,1) T(3,0,3) 


Figure 7.8.11: The three exceptional 6-connected toroidal triangulations. 


F78: [Ne84a] The skeleton of a 5-connected projective-planar triangulation is uniquely 
imbeddable on the projective plane up to equivalence, unless it is isomorphic to Ke. 
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F79: [Ne84b] The skeleton of a 6-connected Klein-bottle triangulation is uniquely 
imbeddable on the Klein bottle up to congruence, and likewise up to equivalence, with 
one exception, illustrated in Figure 7.8.12. 


0 0 

2 1 

1 2 

0 0 
Kh(3,3) 


Figure 7.8.12: The exceptional 6-connected Klein-bottle triangulation. 


TERMINOLOGY 


In Negami’s early papers [Ne84a] etc., the uniqueness means that up to congruence, and 
an imbedding f : G > S is said to be a faithful imbedding if Sym(f) = Aut(G). By 
his terminology, a graph can be said to be uniquely and faithfully imbeddable if it 
is uniquely imbeddable up to equivalence in our sense. In some papers, two imbeddings 
are said to be weakly equivalent if they are congruent. 


Re-Imbedding Structures 


It is hardly possible to classify all the mechanisms that generate inequivalent imbeddings 
of a graph. However, there is a theory to describe the flexibility of triangulations. 


DEFINITIONS 


D46: A face of a triangulation G — S is a panel if its boundary cycle bounds a face 
in every imbedding of G in S. 


D47: The panel structure of an imbedding G > S is a pair (G > S,P) in which P 
is the set of all the panels. 


D48: The panel structures of two triangulations G; + S and G2 — S are said to be 
equivalent panel structures if the 2-simplicial complexes obtained from the skeletons 
G and G2 by inserting all of their panels are homeomorphic. 

FACTS 

F80: [LaNe99] Two faces incident to a contractible edge of a triangulation are panels. 


F81: [NeNaTa97] A face that has at most two skew vertices at its corners is a panel. 


F82: [NeNaTa97] Two triangulations on a closed surface having equivalent panel struc- 
tures admit the same number of inequivalent imbeddings on the surface. 


F83: The number of inequivalent imbeddings of a triangulation G > S (with S' closed) 
does not exceed that of an irreducible triangulation to which G — S is contractible. 
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F84: [NeNaTa97] There exist only finitely many panel structures on each closed surface, 
up to equivalence. 


F85: [La92] Every projective-planar triangulation admits exactly 1, 2, 3, 4, 6 or 12 
inequivalent imbeddings on the projective plane. 


F86: [Sa03] Every toroidal triangulation admits exactly 1, 2, 3, 4, 5, 6, 8, 10, 12, 14, 
16, 24, 48 or 120 inequivalent imbeddings on the torus. 


EXAMPLE 


E21: Let Q(S) denote the maximum number of inequivalent imbeddings taken over 
all graphs that triangulate a given closed surface S. For the sphere So, the projective 
plane Nj, the torus S; and the Klein bottle Nz, we have: 


Q(So) =1, Q(N1) = 12, Q(S1) = 120, Q(N2) = 36 


The first three are attained by Ky, Kg, and K7 on these surfaces in order while the last 
one is attained by the triangulation obtained from two copies of Kg on the projective 
plane by pasting them along one pair of faces. 


REMARK 


R14: To determine the maximum number of inequivalent imbeddings taken over all 
triangulations on a closed surface, it suffices to investigate irreducible triangulations, 
by Fact F83. On the other hand, the classification of panel structures exhibits all “re- 
imbedding structures” and enables us to decide all possible values that appear as the 
number of inequivalent imbeddings of triangulations. See [NeNaTa97] for the theory of 
panel structures. 


Imbeddings into Other Surfaces 


What happens when we imbed the skeleton of a triangulation into other surfaces? 
DEFINITION 


D49: A graph G is said to quadrangulate a surface S if G can be imbedded on S' so 
that each face is a 4-cycle. 


FACTS 


F87: {HoGl77] The skeleton of a triangulation on an orientable closed surface is an 
upper imbeddable graph. That is, it can be cellularly imbedded on a suitable orientable 
closed surface with one or two faces. 


F88: [LaNe99] A graph triangulates both the torus and the Klein bottle if and only if 
it has the structure shown in Figure 7.8.13, where each triangle with C) may be divided 
into many triangles. 
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Figure 7.8.13: Triangulating both the torus and the Klein bottle. 
F89: [NaNeOtSi03] For any closed surface S, there is a triangulation of the sphere 
whose skeleton quadrangulates the surface S. 


F90: [NaNeOtSi03] No 5-connected graph triangulates the sphere and quadrangulates 
another orientable closed surface. 


F91: [NeSu00] There is a 5-connected graph that triangulates the sphere which quad- 
rangulates the nonorientable closed surface Nz if and only if k = 10 or k > 12. Such a 
triangulation for k = 10 is unique up to isomorphism. 


F92: [Su03] If two closed surfaces S$; and S2 satisfy the relation 2y(S 1) — x(S2) > 4, 
then there is a graph that triangulates S; and quadrangulates S2. 


EXAMPLES 
E22: The octahedron K2 9 triangulates the sphere and quadrangulates the torus. 


E23: The complete graph Ky, triangulates the sphere and quadrangulates the projective 
plane. 


E24: The only irreducible triangulation on the torus whose skeleton also triangulates 
the Klein bottle is T3, and the triangulation on the Klein bottle is isomorphic to Kh1. 


E25: [HaRi89] The complete graph K,, triangulates and quadrangulates two different 
closed surfaces if and only if n = 0,1,4 or 9 (mod 12). This is exactly what Euler’s 
formula requires. 
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INTRODUCTION 


To every finite geometry, there are associated two distinct graphs, the Levi graph 
and the Menger graph. Imbeddings of these graphs into closed 2-manifolds can lead to 
models of the geometries. The emphasis is on imbeddings that are optimal with respect 
to euler characteristic, or that utilize important collineations of the geometry as map 
automorphisms. 

The graphs in this section are simple. 


7.9.1 Finite Geometries 


A finite geometry might be of intrinsic interest; it might produce a useful block 
design; or, it might have an aesthetically pleasing model. 


DEFINITIONS 
D1: A geometry (P,L) consists of a non-empty set P called points, together with a 


non-empty collection L of subsets of P called lines. 
D2: A finite geometry is a geometry (P, L) whose point-set P is finite. 


D3: A (v,b,r,k,)-balanced incomplete block design (abbr. BIBD) is a finite 
geometry (P,L) with v = |P| and b = |L| that satisfies the following three axioms of 
uniformity: 

e Every point is in exactly r lines. 

e Every line consists of exactly k points. 


e Every pair of points belong to exactly A common lines. 
D4: The axiom of uniqueness for a (v,b,7r, k, A)-BIBD is \ = 1. 
D5: A Steiner triple system is a (v,b,r,k, X)-BIBD with k = 3 and \ = 1. 
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D6: A (v,b,7, k; A1, A2)-partially balanced incomplete block design is just like a 
BIBD, except that the points can be regarded as the vertices of a fixed strongly regular 
graph, in which two points are non-adjacent (or adjacent) if they belong to exactly A; 
(or Ag) common lines (abbr. PBIBD). 


D7: An (r,k)-configuration is a finite geometry (P,L) that satisfies the first two 
axioms of uniformity, but replaces the third axiom with the following: 


e Every pair of points belong to at most one common line. 
D8: A 3-configuration is an (r,k)-configuration such that k = 3. 


D9: A (v,b,7r, k; 0, 1)-block design (abbr. BD) is an (r, k)-configuration (P, L), where 
v = |P| and b = |L|, such that no or some pairs of distinct points are not collinear 
(Ai = 0), and the other pairs of distinct points are uniquely collinear (Az = 1). 


D10: A symmetric configuration (v), is a (v,b,r,k;0,1) block design such that 
r =k, which implies v = b, by Fact F1 below. 
D11: For each natural number n > 1, the n-point geometry has n points, and all 
the 2-subsets of those points as its lines. 
D12: A finite affine plane of order n is a finite geometry (P, L) that satisfies these 
axioms: 

e Two distinct points are in a unique common line. 


e For a given point not in a given line, there is a unique parallel (non-intersecting) 
line containing that point. 


e There exist four distinct points, no three collinear. 

e There exists a line having exactly n points. 
D13: A finite projective plane of order n (abbr. I(n)) is a finite geometry (P, L) 
that satisfies these axioms: 

e Two distinct points are in a unique common line. 

e Two distinct lines contain a unique common point. 

e There exist four distinct points, no three collinear. 


e There exists a line having exactly n + 1 points. 


FACTS 


F1: In any (v,b,7r, k, A)-BIBD, (v, 6,7, k;0,1)-PBIBD or (v, b,r,&;0,1)-BD, the axioms 
of uniformity imply that vr = bk. 


F2: A 3-configuration (P,L) exists if and only if ur = 3b and v > 2r +1. Thus, a 
symmetric configuration (v)3 exists if and only if v > 7. 


F3: For a prime power n > 9 that is not prime, there are at least two non-isomorphic 
projective planes of order n. There are exactly four, for n = 9. 


F4: For n = 2,3,4,5,7, and 8, I(n) is uniquely PG(2,n). 
F5: Neither a II(6) nor a II(10) exists. 


F6: There is no known finite projective plane II(n) where n is not a prime power. 
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F7: [BrRy49] Ifn = 1, 2(mod 4), and if n is not a sum of two squares, then no projective 
plane II(n) exists. 


F8: The affine plane AG(2,q) is a resolvable (q?,q? +. q,q¢+1,q,1)-BIBD. Such planes 
exist for every prime power g. The resolvable feature is that the q? + q lines partition 
into g+1 parallel classes of q lines each, each class partitioning the point set. 


F9: Every triangle in the Euclidean plane has the following four triples of concurrent 
lines: 


e The perpendicular bisectors of the three sides meet in the ctrcumcenter. 
e The altitudes meet in the orthocenter. 
e The internal angle bisectors meet in the tncenter. 


e The medians meet in the centroid. 


If the triangle is equilateral, then all four points of concurrency coincide. This fact is 
used as background for Example E10. 


EXAMPLES 


El: The Euclidean plane is an (infinite) affine plane. 
: ; n(n — 1) 
E2: The n-point geometry is an [ n, =p 1,2,1)-BIBD. 


E3: An r-regular graph is representable as an (r, 2)-configuration. 
E4: An r-regular, k-uniform hypergraph is representable as an (r, k)-configuration. 


E5: The Theorem of Pappus in Euclidean geometry states that if A, B,C are distinct 
points on line L and A’, B’, C’ are three different distinct points on line L’ 4 L, then the 
three points D = AB’N A’B, E = AC'N A’C, F = BC'N B'C are collinear. This gives 
a 3-configuration on the nine points {A, B,C, A’, B’,C’, D, E, F} called the geometry 
of Pappus, which is a (9,9,3,3;0,1)-PBIBD. (See [Wh01], for example, for a diagram 
of the geometry of Pappus.) 


E6: The Theorem of Desargues in Euclidean geometry states that if triangles ABC 
and A’B’C’ are in perspective from point P, then the three points D = ABN A’ B’, FE = 
AC 0 A’C’,F = BCN B'C’ are collinear. This gives a 3-configuration on the ten 
points {P, A, B,C, A’, B’,C’, D, E, F} called the geometry of Desargues, which is a 
(10, 10, 3,3;0,1)-PBIBD. (See [Wh01], for example, for a diagram of the geometry of 
Desargues.) 


E7: For each prime power gq, there is a classical finite affine plane AG(2,q), which is 
a (¢7,¢7+4¢,¢+1,¢,1)-BIBD with \ = 1. The affine plane AG(2,¢q) has as its points 
the 1-dimensional affine subspaces of the 3-dimensional vector space over GF'(q), and 
as its lines the 2-dimensional affine subspaces. In particular, AG(2,2) is the 4-point 
geometry. 


E8: For each prime power q, there is a classical finite projective plane II(q) = PG(2, q), 
which is a (qg7+q+1,q?+¢+1,¢+1,q+1,1)-BIBD with \ = 1. The projective plane 
PG(2, q) has as its points the 1-dimensional vector subspaces of the 3-dimensional vector 
space over GF'(q), and as its lines the 2-dimensional vector subspaces. 
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E9: To every projective plane II there corresponds an affine plane II’ (obtained by 
deleting one edge and all points on that edge from IT), and conversely. The affine plane 
obtained from PG(2,q) is AG(2,q). 


E10: The Fano plane, a familiar 3-configuration that models PG(2, 2), is shown in 
Figure 7.9.1. It has seven points. The three sides, the three medians, and the incircle 
form its seven lines. This (7,7,3,3,1)-BIBD is the smallest non-trivial Steiner triple 
system. 


Figure 7.9.1: The Fano plane. 


The line {0, 1,3} is called a perfect difference set, since each non-identity element of 
Z7 appears uniquely as a difference of two elements in the set. It generates the other 
six lines, using translations by Z7. But note that this model has several defects (to be 
remedied in §7.9.3): 
e The circular line {1,2,4} is differently depicted. Yet Z7 acts transitively on the 
line set L, as a subgroup of the full collineation group (of order 168). 


e The point 0 is distinguished by its central position, yet Z7 acts transitively on the 
point set P as well. 


The point 2 (for example) seems to be between points 3 and 5. Yet there is no 
concept of betweenness in this geometry. 


e One cannot discern that r = 3 by looking at small neighborhoods of points 0, 1, 2, 
and 4. 


e There are three extraneous intersections of lines (i.e., of the three cevians with 
the circle) that have no meaning in the geometry. 


E11: The projective plane PG(2,4) can be cyclically generated, using Z2), from the 
perfect difference set {0,1,6,8,18} = Lo. Then line L; = Lo +7, for 0 <i < 20. Let 
Lo = {0,6, 18}, with Li, = £5 + 3k, for 0 <k <6. Then using the Z7 subgroup of Za), 
we find the Fano plane PG(2,2) contained within PG(2, 4). 


E12: The Theorem of Desargues applies to the full projective plane PG(2,4); for 
example, the triangles 3 5 7 and 10 13 9 are in perspective from point 2, producing 
the three collinear points 19,6,20 and the geometry of Desargues, contained within 
PG(2,4). Next, deleting line Lo and its five points from PG(2,4) yields AG(2, 4), with 
its five parallel classes, as shown below. 


2 7 919 2 3 10 20 3 4 9 11 5 1113 2 1011 16 7 
4 5 10 12 7 13 15 4 138 141910 7 12 14 3 12 13 20 9 
14 15 20 11 9 14 16 5 15 16 2 12 «9 10 15 17 «+17 2 «4 «(#214 
16 17 3 13 #11 12 17 19 20 5 7 17 #19 20 4 16 19 3 5 15 


Notice that the bold (sub)lines give the three parallel classes of a Pappus configuration 
within AG(2,4) and hence within PG(2, 4). 
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7.9.2 Associated Graphs 


DEFINITIONS 


D14: The Menger graph of a geometry (P,L) has the point set P as its vertex set 
and all edges of the form {p1,p2}, for p1,p2 € P, where p; and pp are collinear; i.e., 
Pp1,p2 € l, for some 1 € L. 


D15: The Levi graph of a geometry (P,L) is the bipartite graph having PU L as its 
vertex set and all edges of the form {p,1}, for p € P,l € L, and p € 1. (Similarly, Levi 
graphs can encode objects and blocks of a block design, or vertices and hyperedges of a 
hypergraph.) 


D16: A (d,g)-cage is a graph of minimum order among all d-regular graphs having 
girth g. 


FACTS 
F10: In general, every BIBD of order v has K, as its Menger graph. 


F11: A Levi graph represents its geometry uniquely, whereas two or more different 
geometries might have the same Menger graph. 


F12: The Levi graph of an (r,k)-configuration has girth g > 6. The Levi graph of a 
symmetric configuration (v);, is k-regular. 


F13: [Si66] There is an (n + 1,6)-cage of order 2(n? +n +1) if and only if there exists 
a finite projective plane II(n). 


F14: The Levi graph of PG(2,n) is an (n + 1,6)-cage of order 2(n? +n+1), forna 
prime power. 


F15: Ifthe Menger graph of a (v, b, 7, k;0,1)-BD is strongly regular, then the geometry 
is a partially balanced incomplete block design. 


EXAMPLES 


E13: In general, the Menger graph of n-point geometry is K,, while the Levi graph of 
that geometry is homeomorphic to K,, (each edge of K,, is replaced by a path of length 
two). 


E14: The projective plane PG(2,2) and the 7-point geometry are clearly different 
geometries, having 7 and 21 lines, respectively, yet both have Menger graph K7. 


E15: The Levi graph of PG(2,2) is the Heawood graph, the unique (3,6)-cage. The 
Levi graph of 7-point geometry is obtained from K7 by performing an elementary sub- 
division on each edge, that is, by replacing each edge with a path of length two. 


E16: The Menger graph of a 3-configuration (v,b,7,3;0,1) is 2r-regular of order v, 
and its edge set decomposes into 6 disjoint 3-cycles. For instance, the strongly-regular 
octahedral graph is the Menger graph for a (6,4,2,3;0,1)-PBIBD called the “Pasch 
configuration”. The shading in Figure 7.9.2 below depicts the C3-decomposition. 
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Figure 7.9.2: The Pasch configuration. 


E17: The Menger graph for the geometry of Pappus is 3,3), the complement of the 
graph 3K3. The Menger graph for the geometry of Desargues is the complement of 
the Petersen graph. Since both Menger graphs are strongly regular, the corresponding 
incomplete block designs are both partially balanced. 


7.9.3. Surface Models 


DEFINITIONS 


D17: A model for an axiom system ™& for a finite geometry (P, L) is an inter- 
pretation of the points and lines of © such that each interpreted axiom in © is a true 
statement about (P, L). 


D18: A model of a geometry (P, L) is abstract if it specifies P as an abstract set of 
points and FL as a collection of subsets of P. It is concrete if it represents P as a finite 
set of points of R”, for some positive integer n, and LE as a collection of locally one- 
dimensional subsets of R” (often a geometric realization of a graph or of a hypergraph). 


D19: An axiom system © is consistent if no contradictions can be derived from it. 


D20: An axiom system % is independent if no axiom in ¥ can be derived from the 
other axioms in ©. 


D21: An axiom system » is complete if every statement in the undefined and defined 
terms of ©} can either be proven true or proven false, using ¥. 


D22: The genus of a finite geometry is the genus of its Levi graph. 
FACTS 
The first three facts below indicate the importance of models in the formal study of 


geometry. Informally, models have heuristic and pedagogical functions, and might even 
provide aesthetic pleasure. 


F16: An axiom system © is consistent if there is a model for . 


F17: An axiom system © is independent if, for each axiom o € %, a model can be 
found satisfying (U — 0) A(~ 0). 
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F18: An axiom system © is complete, if there is a unique (up to isomorphism) model 
for &. 


F19: The genus of n-point geometry (n > 2) is [(n — 3)(n — 4)/12]. In consideration 
of Example 13, this follows from [RiYo68]. 


F20: [FiWh00] The geometry of Pappus has genus 1. That of Desargues has genus 2. 
The Desargues model has a 3-fold rotational symmetry that fixes the point of perspec- 
tivity, the line of perspectivity, the two triangles of perspectivity, and nothing else. 


F21: [Wh95] Surface models for PG(2,q), g a prime power, depend upon the residue 
of q(mod 3): 


(a) If g = 2(mod 3), then PG(2,q) has genus 1 + (q — 2)(q?7 + q¢ + 1)/3; all the 
hyperregions are triangular. 


(b) If g= 1(mod 3), then PG(2,q) can be modeled on the surface of genus 1 + (gq — 
1)(q7 ++q4+1)/3, with q?+q+1 hyperregions pentagonal and all others triangular. 


(c) If g = 0(mod 3), then PG(2,q) is conveniently modeled on an orientable pseu- 
dosurface of characteristic (3 — 2¢)(q? + q+ 1)/3, with q? +q+1 hyperregions 
quadrilateral and all others triangular. 


In each case, the group Z,24 41 acts regularly, as a group of map automorphisms, on 
the point set, the line set, and on each orbit of the region set for the modified Levi 
graph imbedding. 


F22: Topological models for AG(2, q) are obtained by deletions from the above models 
for PG(2,q). 


F23: There is a toroidal symmetric 3-configuration (v)3 on v points for all vu > 7. 


F24: [Wh02] There is a 3-configuration (3n,n?,n,3;0,1)-PBIBD with Menger graph 
Knnn and genus (n—1)(n—2)/2 for alln > 1. This generalizes the Pasch configuration. 


F25: An imbedding of the Levi graph for a finite geometry on a closed orientable 
2-manifold (a surface) can be readily modified to model the geometry (on the same 
surface) by an imbedded graph G having bichromatic dual: vertices model points, and 
boundaries of regions of one fixed color model lines. (The regions of the other color are 
hyperregions.) If the geometry is a 3-configuration, then G is its Menger graph. The 
process reverses, so that the Levi graph can be obtained from such an imbedding of G. 


F26: Familiar classes of strongly regular graphs include the regular complete m-partite 
graphs Ky) (where m,n > 1) and the line graphs L(K,,) and L(Kn,n). 


F27: L(Knn) = Kn x Kn, the cartesian product. 
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EXAMPLES 


E18: An axiom system for the n-point geometry is as follows: 
(i) There are exactly n points. 
(ii) Two distinct points belong to a unique common line. 
(iii) Each line consists of exactly two points. 
This axiom system is consistent, since the complete graph K,, is a model. For n = 4, 


we have the following abstract model: 


P = {1,2,3,4} L = {{1, 2}, {1, 3}, {1, 4}, {2, 3}, {2, 4}, {3, 4}} 


The graph in Figure 7.9.3 is a concrete model (which could be imbedded in R? or, 
reversing the Riemann stereographic projection, imbedded on the sphere in R°). 


Figure 7.9.3: A concrete model for the 4-point geometry. 


This axiom system is independent, for n > 3, since: 


e The complete graph K,,+1 shows that axiom (i) does not follow from axioms (ii) 
and (iii). 


e The cycle graph C,, shows that axiom (ii) does not follow from axioms (i) and 
(iii). 

e The path graph P, (a one-line geometry) shows that axiom (iii) does not follow 
from axioms (i) and (ii). 


This axiom system is complete, since all models are isomorphic to K,. (The abstract 
and concrete models above for n = 4, for example, are isomorphic under the identity 
function on the point sets, inducing the identity function on the line sets.) 


E19: Medial graph constructions serve to model geometries having Menger graphs 
L(kK,,). For G = K4 as in Figure 7.9.3, we have M(G) = L(K4) = K3,2), as in Figure 
7.9.2. For n > 4, the models will be on pseudosurfaces. 


E20: Surgical constructions can be employed to model geometries having Menger 
graphs L(Ky,,), using Fact F27 and appropriate imbeddings of K,, for n = 1 or 3 
mod 6; these imbeddings will have bichromatic dual with one color class consisting of 
all triangles. These models will be on generalized pseudosurfaces. 


E21: The imbedded Menger graph of Example E16 of §7.9.2 can be readily modified 
to obtain a spherical imbedding of the corresponding Levi graph. Thus the Pasch 
configuration has genus 0. 
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E22: Conversely, the Levi graph for the Fano plane is the Heawood graph, the unique 
(3, 6)-cage, which has genus 1. The modification of a toroidal imbedding of the Heawood 
graph (see Figure 7.9.4(a) below) gives a bichromatic-dual imbedding of K7, the Menger 
graph for the Fano plane (see Figure 7.9.4(b) below). The seven triangular regions of 
either color class model the lines of this geometry. Thus the Fano plane has genus one. 
All the defects of the traditional model for this geometry, displayed in §7.9.1 (where the 
lines correspond to the unshaded regions in Figure 7.9.4(b)) are now remedied. 


(b) 


Figure 7.9.4: Levi graphs for (a) the Fano plane and (b) the Menger graph. 


E23: Voltage graph constructions are useful in forming surface models of geometries 
having Menger graphs K,,(,). The imbedded Menger graph of Example E17 covers the 
voltage graph imbedding of Figure 7.9.5, which uses the voltage group B = Z3 x Zs. 


(1,0) 
(1,1) 
(0,1) 


Figure 7.9.5: An imbedded bouquet with voltages in Z3 x Z3; the covering graph is the 
Menger graph for the Pappus geometry. 


References 


BrRy49] R. H. Bruck and H. J. Ryser, The nonexistence of certain finite projective 
planes, Canad. J. Math. 1 (1949), 88-93. 


FiWh00] R. M. Figueroa-Centeno and A. T. White, Topological models for classical 
configurations, J. of Statistical Planning and Inference 86 (2000), 421-434. 


RiYo68] G. Ringel and J. W. T. Youngs, Solution of the Heawood map-coloring prob- 
lem, Proc. Nat. Acad. Sci. U.S.A. 60 (1968), 438-445. 


Section 7.9. Graphs and Finite Geometries 911 


Si66] R.R. Singleton, On minimal graphs of maximum even girth, J. Combin. Theory 
1 (1966), 306-332. 


Wh95] A. T. White, Efficient imbeddings of finite projective planes, Proc. London 
Math. Soc. (3) 70 (1995), 33-55. 


Wh01]) A. T. White, Graphs of Groups on Surfaces: Interactions and Models, North- 
Holland Mathematical Studies 188, 2001. 


Wh02] A. T. White, Modelling finite geometries on surfaces, Discrete Math. 244 (2002), 
479-493. 


912 Chapter 7. Topological Graph Theory 


Section 7.10 
Crossing Numbers 


R. Bruce Richter, University of Waterloo 
Gelasio Salazar, Universidad Auténoma de San Luis Potosi 


7.10.1 Drawings of Graphs and Crossing Numbers ...................4. 912 
7.10.2 General Techniques and Bounds ................. 0. . cece eee eee 914 
7.10.3 Crossing Numbers of Some Families of Graphs .................. 916 
7.10.4 Crossing-Critical Graphs 1.1.2.0... ceee cece eae 920 
7.10.5 Algorithmical Aspects ....... 0... cece cece eee eee ete e ene e enn enes 921 
7.10.6 Other Definitions of Crossing Number ......................00005 922 
7.10.7 Crossing Sequences ........... cece eee eee cence eee e ene teen eeeees 924 
7.10.8 Applications of Crossing Numbers ...............00:0eceeee eee ee 925 
7.10.9 Suggestions for Further Reading ................ 0. cece eee ee eee 926 
Rererencés csi ates diet nes J Bate aan nals «Mae ees ta eens as aa 926 


INTRODUCTION 


When a graph cannot be drawn in the plane without edge crossings, it is natural 
to seek plane representations of it in which the number of crossings is minimized. The 
crossing number of a graph G is the minimum number of crossings in a drawing of G 
in the plane. Before the turn of the century, most results in the field revolved around 
the estimation or calculation of the crossing number of a graph or of a family of graphs. 
In the last fifteen years or so we have witnessed a remarkable increase in the number 
of deep, general results of a structural character, giving rise to a flourishing theory of 
crossing numbers. 


7.10.1 Drawings of Graphs and Crossing Numbers 


For both theoretical and practical reasons, it is often needed or desired to generate 
a representation of a graph in some topological space. In an embedding of a graph, the 
most natural injective properties are satisfied, whereas in a drawing, some injectivity 
properties are relaxed. 
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Embeddings 


DEFINITIONS 


D1: An embedding of a graph G in a topological space X is a mapping that associates 
to the vertices of G distinct elements of X, and to each edge of G a homeomorphic image 
of [0,1], so that the endpoints of the image of an edge are precisely the elements of X 
representing the endvertices of the edge, and if the images of any two edges have a point 
in common, then this point represents a common vertex. 


EXAMPLES 


El: Every finite graph that does not contain as a subgraph a subdivision of K33 or 
Ks can be embedded into R? (Kuratowski’s Theorem). 


E2: Every countable graph can be embedded into R®, by placing the vertices on a line 
L and then, for each edge e, using a different plane P. containing L to draw e. 


Drawings 


The preceding example justifies why most research on graph representations is focused 
on 2-dimensional spaces, such as the plane or compact surfaces. More often than not, 
an input graph cannot be embedded in a given surface. Thus we relax one condition of 
an embedding, and arrive at a natural concept of a drawing. 


DEFINITIONS 


D2: A drawing of a graph G in a surface S is a mapping that assigns to the vertices 
of G distinct elements of S, and assigns to each edge a homeomorphic image of [0, 1], 
disjoint from the vertex points except that the endpoints of the image of an edge are 
precisely the elements of S representing the endvertices of the edge. 


D3: A drawing of a graph G in a surface S is a normal drawing if 
(i) the interiors of the images of any two edges have at most one intersection point 
and, moreover, if such an intersection point exists then it is a crossing (rather 
than tangential); 
(ii) no point in S belongs to the interiors of the images of three distinct edges (loosely 
speaking, no three edges cross at a common point). 


REMARKS 


R1: It is an easy exercise to show that if D is any drawing of a graph G in a surface S, 
then there is a normal drawing D’ of G in S with no more crossings than D. 


R2: We follow the common practice to make no distinction between the vertices and 
edges of a graph and the corresponding objects (points or closed arcs) in the host topo- 
logical space in an embedding or a drawing of the graph. Thus we allow ourselves, for 
instance, to use expressions such as “crossings between edges”, instead of the unneces- 
sarily precise “crossings between the closed arcs representing edges” . 
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Crossing Numbers 


More often than not, it is impossible to draw a graph on a surface without crossings of 
edges, and our attention then turns to the matter of minimizing the number of crossings 
over all drawings of the graph upon consideration. 


DEFINITION 


D4: The crossing number of a graph G in a surface S, denoted crg(G), is the 
minimum number of crossings of edges in a drawing of G in S. 


REMARKS 


R3: We are mostly concerned with the crossing number of graphs in the plane. Thus, 
for simplicity, whenever the host surface S'is the plane, we will omit the explicit reference 
to S and simply write cr(G). 


RA: The definition of crossing number given above is the most common one, but many 
variants have been considered in the literature. Although we shall be mostly concerned 
with results around this notion of crossing number, we will also state a few results on 
some of these variants (see §7.10.6). 


EXAMPLES 


E3: The complete bipartite graph K3 3 has crossing number 1. It is not difficult to 
show, using the Jordan Curve Theorem, that K33 cannot be drawn in the plane without 
crossings, and so cr(3,3) > 1. On the other hand, A3,3 can be drawn in the plane with 
exactly one crossing (see Figure 7.10.1), and so the reverse inequality cr(K3,3) < 1 also 
holds. 


Figure 7.10.1: A drawing of K3,3 with one crossing, showing cr(/3,3) < 1. 


7.10.2 General Techniques and Bounds 


There are few general results relating the crossing number of a graph to its number 
of vertices and edges. Some elementary bounds can be obtained by a trivial application 
of Euler’s Formula, although these tend to be of little use or interest. By far the most 
powerful general tool known is the Crossing Lemma, which is not only of theoretical 
interest, but also has a surprising number of connections and applications to other 
branches of mathematics (see §7.10.8). 
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The Crossing Lemma 


The following statement is perhaps the most fundamental result in the field of crossing 
numbers. It was conjectured by Erdés and Guy in [EG73], and proved by Ajtai, Chvatal, 
Newborn, and Szemerédi [ACNS82], and independently by Leighton [Le83]. Below we 
state the Crossing Lemma and some of its most interesting variants and refinements. 


FACTS 


F1: (The Crossing Lemma) There exists a positive constant c such that, for every 
graph G with n vertices and m > 4n edges, er(G) > c-m3/n?. 


The current best upper and lower bounds for the crossing constant c (roughly 0.09 
and 1/33.75) were obtained in [PT97]. 


F2: [PST00] Let g be a positive integer. There is a positive constant c := c(g) such 
that if G has girth at least 2g + 1 and m > 4n, then cr(G) > c-m?t9/n1*9, 


F3: [FPT10] For every k € N, there is a constant cz, > 0 such that in every drawing 
of a graph with n vertices and m > 3n edges, no two of which intersect in more than k 
points, there are disjoint sets of edges E,, E2, each of size at least cym?/n?, such that 
every edge in F; crosses all edges in E. 


F4: [PRTT06] Every graph G with n vertices and m edges satisfies 
er(G) > (1/31.1) -m?/n? — 1.06n 
F5: [PRTT06] Every graph G with n vertices and m > (103/6)n edges satisfies 
er(G) > (1024/31827) -m*?/n? 


Crossing Numbers, Bisection Width, and Cutwidth 


The crossing number is closely related to other important graph theoretical parameters. 


DEFINITIONS 


D5: The bisection width bw(G) of a graph G = (V,£) is the minimum number of 
edges whose removal divides G into two parts having at most 2|V|/3 vertices each. 


D6: Let G = (V,E) be a graph. For each bijection » : V > {1,2,3,...,|V|} and 
each i € {1,2,...,|V]}, let cw(,7) denote |{uv € BE: w(u) < i < w(v)}|. Then the 
cutwidth cw(G) of G is min, max; cw(2, 2). 


FACTS 


F6: [PSS96] Let G be a graph with n vertices of degree di, d2,...,d,. Then 


cr + (1/16) ode > (1/40)bw?(G). 
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F7: [DV03] Let G be a graph with n vertices of degree di, d2,...,dn. Then 
er(G) + (1/16) a4 > (1/1176)cew?(G). 
Since cw(G) > bw(G) for any graph G, this fact refines the previous fact. 


Crossing Numbers, Immersion, and Congestion 


Leighton [Le83] showed how to use immersions and congestion as a general technique 
to obtain bounds for crossing numbers. 


DEFINITIONS 


D7: Let G, = (Vi, F1) and Gg = (V2, Ex) be graphs. An immersion of G, in G2 is 
a pair (¢,w), where ¢: Vi — V9 is an injection and w associates to each edge e = wv in 
E, a path in G2 with endpoints ¢)(u) and ¢1(v). 


D8: Let G) = (Vi, £1) and G2 = (V2, E2) be graphs, and let (¢, W) be an immersion of 
G in Gp. For each e € Ep, define cg.(¢,W) = [{f € E:le € Y(f)|}. The congestion 
of the immersion (¢, 4), denoted by cg(¢, w), is maxcer, {cg.(¢, v)}. 


FACT 


F8: [Le83] Let G; = (Vi, £,) and Gp = (V2, Fy) be graphs, and let (4,1) be an 
immersion of G, to Gg with congestion cg(¢, 7). Then, 


cr(G1) _[Va]A2(Ga) 


C2) 2 Gu 2 


7.10.3. Crossing Numbers of Some Families of Graphs 


As with any graph-theoretical parameter, it is of natural interest to investigate the 
crossing number of particular (families of) graphs. Unlike most graph-theoretical pa- 
rameters, the crossing numbers of perhaps the most natural families of graphs (namely, 
the complete graphs K,, and the complete bipartite graphs Ky) are not known, except 
for a few cases. 


Complete Bipartite Graphs 


It is widely accepted that the field of crossing numbers can be traced to a question 
posed by Turan, while he worked in a labor camp during World War II [BW10]. In our 
current terminology, Turan raised the question of calculating cr(Kin,n). 


EXAMPLE 


E4: The drawing of Ks ¢ in Figure 7.10.2 has 24 crossings. It is straightforward to gen- 
eralize this to a drawing of Ky,» with Z(m,n) := |m/2]||(m — 1)/2||n/2] | (n — 1)/2] 
crossings, for any m and n. These drawings are attributed to Zarankiewicz [Za54]. 
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Figure 7.10.2: A drawing of K5,6 with 24 crossings. 


CONJECTURE 


In [Za54], Zarankiewicz claimed to have proved that cr(Kinn) = Z(m,n), for all 
positive integers m,n. Paul Kainen (1965) and Gerhard Ringel (1966) independently 
found a gap in Zarankiewicz’s argument (see [Gu68]). This gave rise to the following 
conjecture, one of the most important open crossing number problems to this day. 


Cl: (Zarankiewicz’s Conjecture) cr(Km,n) = Z(m,n) for all positive integers m,n. 


FACTS 


F9: For all positive integers m and n, cr(Kmn) < Z(m,n). This follows from the 
observation that there exist drawings of Kim.» with exactly Z(m,n) crossings. 


A straightforward counting argument shows the following. 


F10: (Parity Argument, [K170] ) If er(K2,-1,) = Z(2r—1, n) for some positive integers 
r,n, then cr( Korn) = Z(2r,n). 


It is not hard to prove that cr(K3n) = Z(3,n) for every n; for an elegant proof, 
see [HR90]. In [K170], Kleitman proved that cr(K5,n) = 7(5,n) for every n. In [Wo93], 
Woodall gave computer-aided proofs showing cr( 7,7) = Z(7,7) and cr(K7,9) = Z(7, 9). 
Using these facts and the Parity Argument, we have the following. 

F1l: cr(Kmn) = Z(m,n) for all m,n such that n > m, m < 6, and for (m,n) € 


3) 


{(7, 7), (7,8), (7,9), (7, 10), (8,8), (8, 9), (8, 10)}. 


The next three statements have computer-assisted proofs, bounding cr( Aim») using 


semidefinite programming techniques. 
3 cr(K7n) 
F12: [DMP limnsoo eo > 0.9687. 
RSO6] lim soo Foe > 0.9687 
F13: [DPS07] limp 5.0. 288) 0.9766 
nce Fg ay > 0.9766. 
Kon 
F14: [DPSO7] lim, yo KX?) 5 0.9669. 


Z(9,n) 
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Complete Graphs 


In [BW10], Beineke and Wilson discuss the origins of the investigation of the crossing 
number of the complete graphs K,,. It appears that the first to seriously devote time to 
produce drawings of K,, with as few crossings as possible was the British artist Anthony 
Hill, who eventually approached graph theorist Frank Harary with his findings, resulting 
in the joint paper [HH62]. 


EXAMPLE 


E5: The drawing of Kg in Figure 7.10.3 has 18 crossings. It is straightforward to gener- 
alize this to a drawing of K,, with Z(n) := (1/4)|m/2] |(m — 1)/2]|(m — 2)/2||(m — 3)/2| 
crossings, for any positive integer n. As described in [BW10], this paradigm for drawing 
Ky, was first devised by Hill. 


Figure 7.10.3: A drawing of Kg with 18 crossings. 


CONJECTURE 


C2: [HH62] cr(K,,) = Z(n), for every positive integer n. 


FACTS 


F15: For every positive integer n, cr(K,) < Z(n). This follows from the observa- 
tion (see the preceding example) that there exists a drawing of K,, with exactly Z(n) 
crossings. 


In analogy with complete bipartite graphs, a straightforward counting argument 
shows the following. 


F16: (Parity Argument) If cr(K2,_1) = Z(2r — 1) for some integer r, then cr(K2,-) = 
Z(2r). 
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F17: [EG73, PRO7] cr(K,,) = Z(n) for all n < 12. 
Fis: [DPS07| 


Other Families of Graphs 


Other families of graphs have also received considerable attention. Special interest has 
been devoted to graphs with good interconnection properties (such as the n-dimensional 
cube Q,, and the Generalized Petersen Graphs P(n,k)), and to the Cartesian products 
of some families of graphs. 


FACTS 
In the next statement, the upper bound was conjectured in [EG73] to be the crossing 
number of Q,. The lower and upper bounds are from [SV93] and [FDSV08], respectively. 


F19: 


1 ‘ 2 5 n? +1 | n> 
— 4" — Docs n) <4" = Va 
504 (n* + 1) < cr(Q io 5 


Turning to the Generalized Petersen Graph P(n,k), it is easy to verify that P(n, 2) 
is planar if n is even or n = 3. It is also easy to prove that the crossing number of 
P(5, 2) (the Petersen graph) is 2. Few exact results are known. 


F20: [EHK81] For every odd integer n > 7, cr(P(n, 2)) = 3. 


F21: [RS02] For every k > 3, cr(P(3k + h,3)) equals k + h if h = 0 or 2, and equals 
k+3 if h=1. The sole exception is P(9,3), whose crossing number is 2. 


For k < 3, it is readily checked that cr(P(3k,k)) is planar. By the previous fact, 
we have cr(P(3k,k)) = 2 for k = 3. The following statement, proved by Fiorini and 
Gauci [FG03], gives cr(P(3k, k)) for all other values of k. 


F22: cr(P(3k,k)) =k, for every k > 4. 


Substantial effort has been devoted to computing the crossing numbers of the Carte- 
sian products C,,0 C,,. In 1972, Harary, Kainen, and Schwenk observed that C,,0 Cy, 
can be drawn in the plane with (m— 2)n crossings, and conjectured that cr(C,,0C,) = 
(m — 2)n, for all integers m,n such that n > m > 2. This conjecture has been verified 
for small values of m (for every n). The next statement collects results from [RB78, 
BR80, DR95, KRS96, RSO1, ARO7]. 


F23: cr(C,,0C,) = (m — 2)n for all m,n such that n > m and m < 7. 
F24: [GS04] cr(C,,0C,,) = (m — 2)n for all m,n such that n > m(m + 1). 


The crossing number of the Cartesian product of the m-star S,, and the n-path P, 
was conjectured by Jendrol’ and Séerbova [JS82], and succinctly proved by Bokal [Bo07], 
using his novel “zip product” technique. 


F25: cr(S,,0P,) = (n—1)|2|| +], for all positive integers m,n. 


The crossing numbers of the Cartesian products of other graphs of small order with 
paths, stars, or cycles have also been investigated. Good collections of results in this 
direction are given in [K101] and [DK11]. 


920 Chapter 7. Topological Graph Theory 


7.10.4 Crossing-Critical Graphs 


As with other graph theoretical parameters, it is of natural interest to investigate 
those graphs that are critical, in the sense that every subgraph of the given graph has 
a smaller value of this parameter. 


DEFINITION 


D9: For a positive integer k, a graph G is k-crossing-critical (or simply k-critical) 
if cr(G) > k and every proper subgraph H of G satisfies cr(H) < k, and G is homeo- 
morphically minimal with this property. 


FACTS 


F26: The only 1-critical graphs are K33 and Ks. This is an immediate consequence 
of Kuratowski’s Theorem. 


A full classification of 2-critical graphs is a disproportionately harder task, as it 
has been known for some time that there are infinitely many 2-critical graphs. On the 
positive side, we have the following result. 


F27: [Ri88] There are exactly eight cubic 2-critical graphs. 


It is interesting to note that there exist graphs that are k-critical for some numbers 
k < cr(G), such as C30) C3, which is 2-crossing-critical but has crossing number 3. On 
the other hand, the following result from [RT95] shows that k-critical graphs cannot 
have an arbitrarily large crossing number. 


F28: If G is k-critical, then cr(G) < 3k + 16. 
As an immediate consequence of this, one obtains the following. 


F29: Every graph G with cr(G) = k has an edge e with cr(G \ e) > 2er(G) — 2. 

In [RT95], Richter and Thomassen conjectured that this last result could be remark- 
ably strenghtened, asking if there is a positive constant c such that every graph G with 
cr(G) = k has an edge e with cr(G \ e) > k— Vk. Fox and Cs. Téth proved a stronger 
version of this conjecture [FT06], showing it true, in particular, for dense graphs. Later 
Cerny, Kynél, and G. Téth [CKT08] further refined this result and proved the following. 


F30: For every €,y > 0, there is an n,.,y such that every graph G with n > n.y 
vertices and m > n'** edges has a subgraph G’ with m’ edges such that 


and 


In a result of a more structural character, Geelen, Richter, and Salazar [GRS02] proved 
that k-critical graphs have bounded tree-width (see §2.4.1). This was then strengthened 
by the next result, due to Hlinény [H103]. 
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F31: Let k be any positive integer, and let G be a k-critical graph. Then G has 
path-width (see §2.4.1) at most 6(72 log k + 248)k?. 


It was conjectured in [RS09] that this last statement could be further refined, to show 
that k-critical graphs have bounded band-width. Since graphs with bounded band- 
width obviously have bounded degree, this conjecture got disproved by the following 
result by Dvoraék and Mohar [DM10]. 


F32: For every k > 171 and every positive integer d, there exists a k-critical graph 
containing a vertex of degree at least d. 


This last statement implies that, for any positive integer d, there exist k-critical graphs 
(for & > 171) containing a subgraph isomorphic to Ky,g. Following on this theme, the 
next result was proved in [HS10]. 


F33: Let k,n be positive integers with n > 30k? + 200k. Then no k-critical graph 
contains a subdivision of K2 pn. 


The construction of critical graphs with prescribed properties has been a driving force 
behind much of the research on k-critical graphs. Particularly interesting examples of 
infinite families of critical graphs are the (by now) classical construction reported by 
Kochol in [Ko87] and the paradigm-shifting constructions given by Hlinény in [H108]. As 
an example of the knowledge gained in the quest of understanding the richness of critical 
graphs, we finally put forward the following mutually complementary statements. 


F34: [Bol0] Let r € (3,6) be a rational number and k an integer. There exists a 
convex continuous function f : (3,6) +7 such that, for k > f(r), there exists an infinite 
family of simple 3-connected crossing-critical graphs with average degree r and crossing 
number k:. 


F35: [HST12] For each fixed positive integer k, there are only finitely many k-crossing- 
critical simple graphs of average degree at least six. 


7.10.5 Algorithmical Aspects 


It is hardly surprising that computing crossing numbers is NP-hard. Until recently, 
very little was known regarding nontrivial conditions under which the crossing number 
can be approximated, and about restricted families of graphs computing the crossing 
numbers remains hard. In the last few years we have seen substantial progress on these 
fronts, as revealed by the collection of facts gathered below. 


DEFINITIONS 
D10: A graph G is a near-planar graph if it has an edge e such that G'\e is planar. 


D11: A graph G is an apex graph if it has a vertex v such that G \ v is planar. 


TERMINOLOGY: CROSSINGNUMBER is the decision problem: “Given a graph G and an 
integer k, is cr(G) < k?”. 
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FACTS 
F36: [GJ83] CROSSINGNUMBER is NP-complete. 


The next two statements indicate that computing the crossing number remains hard 
even for seriously restricted families of graphs. 


F37: [H106, PSS11] CRossINGNUMBER is NP-complete, even when restricted to cubic 
graphs. 


F38: [CM10] CROSSINGNUMBER is NP-complete, even when restricted to near-planar 
graphs. 


Grohe [Gr04] was the first to show that computing the crossing number is fixed- 
parameter tractable. His quadratic-time algorithm was later improved. 


F39: [KRO7] For each fixed integer k, there is a linear-time algorithm that determines 
whether cr(G) < k. 


F40: [GHLS08] There is a polynomial-time constant factor approximation algorithm 
for the crossing number of bounded-degree graphs embeddable in the projective plane. 


F41: [HS07, CH10] Let S be any fixed orientable surface. There is a polynomial-time 
constant factor approximation algorithm for the crossing number of bounded-degree 
graphs embeddable in S. 


F42: [CHM12] There is a polynomial-time constant factor approximation algorithm 
for the crossing number of bounded-degree apex graphs. 


F43: ([Ch11] There is a polynomial time algorithm that approximates the cross- 
ing number of a graph with n vertices and maximum degree A within a factor of 
O(n9/! poly(A log n)). 


F44:  [Cal2] There is a constant co > 1 such that, if P 4 NP, then there is no co- 
approximation algorithm for the crossing number, even when restricted to cubic graphs. 


7.10.6 Other Definitions of Crossing Number 


While the definition of crossing number we have worked with is the one that has 
attracted most research, there are many more (both reasonable and interesting) ways 
to define the crossing number of a graph. We shall briefly review below a few alter- 
native definitions of crossing number (closely following Székely’s view in [Sz04]; see 
also [PT00a]), along with some interesting results that involve them. For an author- 
itative and comprehensive treatise on the different definitions of crossing number, we 
strongly recommend the survey by Schaeffer [Sc12]. 


DEFINITIONS 


To put the definition of crossing number we have used in this chapter into the right 
context against other variants, we start by presenting this definition in a slightly different 
way. In the first four definitions below, D is a drawing of a graph G. 
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D12: The standard crossing number cr(D) of D is the number of pairs (x, {a, 3}), 
where x is a point of the plane and a and £ are open arcs of D representing distinct 
edges of G such that x € aN 8. The standard crossing number cr(G) of G is the 
minimum cr(D) taken over all drawings of G in the plane. 


D13: The pair crossing number pcr(D) of D is the number of pairs of open arcs a 
and @ of D representing distinct edges of G such that aN 8 #0. The pair crossing 
number pcr(G) of G is the minimum per(D) taken over all drawings of G in the plane. 


D14: The odd crossing number ocr(D) of D is the number of pairs a and £ of open 
arcs of D representing distinct edges of G such that |aM (| is odd. The odd crossing 
number ocr(G) of G is the minimum ocr(D) taken over all drawings of G in the plane. 


D15: The independent odd crossing number iocr(D) of D is the number of pairs 
a and £ of open arcs of D representing distinct edges of G that are not incident with a 
common vertex and such that |aM 6| is odd. The independent odd crossing number 
iocr(G) of G is the minimum iocr(D) taken over all drawings of G in the plane. 


D16: The minor crossing number mcr(G) is min{cr(H) | G is a minor of H}. 


D17: A drawing of a graph is rectilinear (or geometric) if all the edges are straight 
segments. The rectilinear crossing number T%(G) of a graph G is the minimum 
cr(D) taken over all rectilinear drawings of G in the plane. 


FACTS 
We start with some straightforward observations. 
F45: For every graph G, iocr(G) < ocr(G) < per(G) < er(G) < G&G). 
Tutte introduced iocr(G) and proved the following two facts in [Tu70]. 
F46: Ifa graph G satisfies iocr(G) = 0, then cr(G) = 0. 
F47: If D,D’ are two drawings of a graph G, then iocr(D) = iocr(D’) (mod 2). 


Also in [Tu70], Tutte asked whether iocr(G) = cr(G) for all graphs G. Pach and 
Toth [PTOO] raised the (still open) important question of whether or not, for all graphs 
G, one has per(G) = cr(G). They proved the following (see also [PSS07]). 


F48: cr(G) < 2(ocr(G))?. 


Pelsmajer, Schaefer, and Stefankovit [PSS08] gave an elegant construction proving 
the existence of graphs G for which ocr(G) < pcr(G) (and hence ocr(G) < cr(G)), 
which in particular answers Tutte’s question in the negative. The currently best pro- 
portionality constant separating pcr(G) and ocr(G) (for some graph G) was derived by 
Toth. 


F49: [To08] There exist graphs G such that 0.855 - per(G) > ocr(G). 


One strongly appealing feature of the minor crossing number (introduced in [BF M06]) 
is that it is minor-monotone, so that it blends well with the graph minors theory of 
Robertson and Seymour. A strong result involving this parameter is the following up- 
per bound by Bokal, Fijavz, and Wood [BF W08]. 
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F50: For every graph H there is a constant c = c(H) such that every H-minor-free 
graph G has mer(G) < c-|V(G)|. 


We remark that for the standard crossing number this last fact does not hold. Indeed, 
in the following related result, the dependence on |V(G)| and A(G) is best possible. 


F51: [DKMW08] Let H be a fixed graph. Then every graph G = (V, E) that excludes 
G as a minor has crossing number O(A(G) - |V(G)]). 


We also note that the general techniques and bounds in 7.10.2 were carried over to 
the minor crossing number in [BCSV10]. 

We finally mention three results involving the rectilinear crossing number, only 
scratching the surface of this important parameter that falls into the realm of com- 
binatorial geometry. The next fact was proved by Bienstock and Dean [BD93]. 


F52: For each integer k > 4, there is a graph G;, with cr(G) = 4 and @&(G) > k. 


Scheinerman and Wilf [SW94] discovered the following striking connection between 
cr(K,,) and Sylvester’s Four Point Constant g, from geometric probability [Sy84]. 


F53: 


Thus, the rectilinear crossing number of K,, is of interest elsewhere in mathematics. 
The exact value of (A) is known for n < 27 and for n = 30 [CHLV11, ACFLS12)]. It 
is also interesting that in general it differs from cr(K,). 


F54: [EG73] cr(K,) = @(K,,) if and only if n < 7 or n = 9. For all other values of 
n, cr(K,,) > @(K,). 


F55: [ACFLS10, ACFLS12] 


(Ky) 
nt 


0.37997 < lim < 0.38048. 


7.10.7 Crossing Sequences 


Any finite graph can be embedded in some (orientable or non-orientable) finite genus 
surface. Sirai [Si83] investigated how the crossing number of a graph increases as we 
move to host surfaces with smaller genus, until finally arriving at the sphere. Sirdii’s 
work is the first in a series of papers with particularly interesting and surprising results. 


NOTATION: The compact orientable surface of genus h will be denoted 5S), and the 
compact non-orientable surface of genus k will be denoted Nx. 


DEFINITIONS 
D18: The orientable crossing sequence of G is crgs,(G),crg,(G), crs,(G).... 


D19: The non-orientable crossing sequence of G is crs,(G), crn, (G), crn, (G).... 
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D20: A sequence ag, a1, a2... is a conver sequence if aj41 — ai42 < a; — aj41 for 
each 7. 


REMARKS 


R5: It is trivial to see that the (orientable or non-orientable) crossing sequence of 
every graph is non-increasing. Moreover, if crs,(G) 4 0 (that is, if G is nonplanar), 
then the sequence is strictly decreasing until we arrive at a (orientable or non-orientable) 
surface S' for which crg(G) = 0. 


FACTS 

F56: [Si83] If ao,ai,...,@,% is a convex sequence of integers such that ap > a, > 
ag > -:+: > ay = 0, then there is a graph whose orientable crossing sequence is 
(ao, 41,---,@%,0,0,...). There is also a graph whose non-orientable crossing sequence 
is (ao, @1,.-.,@%,0,0,...). 


F57: [{ABSO1] There exist graphs whose orientable crossing sequence is not convex. 


F58: [ABS01] For all positive integers ag,a1 such that apo > ai, there exists a graph 
whose non-orientable crossing sequence is (ag, @1,0,0,...). 


F59: [DMS11] For all positive integers ap, a1 such that ap > aj, there exists a graph 
whose non-orientable crossing sequence is (ao, a1,0,0,...). 


7.10.8 Applications of Crossing Numbers 


In a breakthrough paper, Székely introduced the “crossing number method” to give 
simple proofs of “hard” problems in combinatorial geometry, more specifically of inci- 
dence problems. At its heart it uses the Crossing Lemma for lower bounds. Below we 
offer a small sample of results that either use Székely’s method as a crucial ingredient 
or make essential use of its consequent incidence results. 


FACTS 


F60: ((Sz97]; first proved in [ST83]): Given n points and @ straight lines in the plane, 
the number of incidences among the points and lines is O((né)?/° + n + &). 


TERMINOLOGY: A class I of curves in the plane has k degrees of freedom and multiplicity- 
type s if (i) for any & points there are at most s curves of T' passing through all of them; 
and (ii) any pair of curves from IT intersect in at most s points. If P is a finite point set 
and C is a finite set of curves, then I(P,C) denotes the number of incidences between 
them. 


F61: [PS98] Let P be a set of m points and let C be a set of n simples curves all lying 
in the plane. If C has & degrees of freedom and multiplicity-type s, then 


I(P, C) = c(k, s) (mriee—0 ycar-ayron—9 smn), 


where c(k, s) is a positive constant that depends on k and s. 
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F62: [De98] The number of (k + 1)-sets that are possible with n points in ? is at most 
6.48n(k + 2)1/3, 


F63: Given n points and n families of concentric circles each with at most k circles in 
the plane, the maximal number I(n, k) of incidences between the points and the circles is 
O(n!°/7k5/7) [ATT98]. Moreover [STT02], I(n, k) is O(nt-4571 49-6286) (the latter bound 
is better than the former when n is large enough compared to k). 


We finally mention that Solymosi and Tao recently used the crossing number method 
to establish near-sharp Szemerédi—Trotter type bounds on the number of incidences 
between points and k-dimensional algebraic variaties in ¢ for various values of k and d. 
The precise result is too technical to state here; we refer the reader to [ST 12]. 


7.10.9 Suggestions for Further Reading 


Pach and Toth have a stimulating paper [PT00a] centered on crossing number prob- 
lems. Many remain open and we expect they will continue to serve as a guide to 
important advances in the theory. 

We highly recommend Beineke and Wilson’s lively account [BW10] of the early 
history of crossing numbers. 

The current authors have an earlier survey on crossing numbers [RS09]. While there 
is substantial overlap with this chapter, there are significant differences and we suggest 
the reader look there for supplementary material. 

We also recommend the survey by Mutzel [Mu09], which devotes a section to an 
overview of the computer-assisted exact computation of crossing numbers, a topic not 
included in the present survey. 

The survey by Schaeffer [Sc12] mentioned in Section 7.10.6 contains not only many 
variants of what a crossing number is, but also a very interesting, thoughtful discussion. 
We also mention the exhaustive (and useful) bibliography compiled and continually 
updated by Imrich Vrt’o [Vr]. 

The crossing number is only one parameter that measures the nonplanarity of a 
graph. Although slightly outdated, the survey [Li01] is still an excellent reference on 
many other nonplanarity parameters. 
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Glossary for Chapter 7 


amalgamation — of two graphs G and H: forming a new graph from their disjoint 
union by merging a subgraph of G with an isomorphic subgraph of H. 
___, edge: pasting two graphs together across an edge from each. 
__, vertex: pasting two graphs together across a vertex from each. 
annulus: synonym for cylinder. 
apex graph: a graph G in which there is a vertex v such that removing v from G 
results in a planar graph. 
Archimedean solid: a semi-regular polyhedron; it has regular polygons as faces and 
the same configuration of faces at each vertex. 
automorphism of a map: an isomorphism of the map onto itself. 
axiom system: a list of axioms for a mathematical structure. 
___, complete: an axiom system in which every well-formed statement can either be 
shown to be true or be shown to be false. 
___, consistent: an axiom system having no contradictions. 
___, independent: a system in which no axiom is derivable from the others. 
___, model for: an interpretation of the undefined terms so that each interpreted 
axiom is true. 
balanced incomplete block design (abbr. BIBD): a geometry of v points (each 
in r lines) and 6 lines (each containing k points) such that each pair of points belong 
to 2 lines. 
bar-amalgamation — of two disjoint graphs G and H: the result of running a new 
edge e between a vertex u of G and a vertex v of H; denoted G,, *- Hy. 
barycenter — of a face of a cellular graph imbedding: a point in the interior of the 
face, corresponding to the image of the center of the geometric polygon that the face 
represents. 
base graph — of a voltage graph construction: the graph to whose edges the voltages 
are assigned. 
base imbedding for an imbedded voltage graph (G + S,a): the imbedding (G > S) 
of the base graph in the base surface. 
base surface for an imbedded voltage graph (G — S,a): the surface S in which the 
voltage graph is imbedded. 
Belyi function: a meromorphic function from a (closed) Riemann surface S' onto the 
Riemann sphere So, with at most three singular values forming a subset of {0, 1, oo}. 
Betti number (£(G) — of a graph G: the number of edges in a co-tree, which is equal 
to |Eg| —|Ve| +1. 
BIBD: see balanced incomplete block design. 
bisection width bw(G) of a graph G = (V, E): the minimum number of edges whose 
removal divides G into two parts having at most 2|V|/3 vertices each. 
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boundary — of a 2-manifold M: the subspace of those points in M that do not have 
neighborhoods homeomorphic to open disks; instead, their fundamental neighbor- 
hoods are homeomorphic to half-disks. 

boundary-separating closed curve — in a region of a noncellular graph imbedding: 
a closed curve that separates that region so that at least one boundary component 
of the region lies on each side of the separation. 

boundary-walk specification — of a polygonal complex: a list of the signed boundary 
walks of the faces. 

bouquet B,,: the graph with one vertex and n self-loops. 

branch point — in the codomain S of a branched covering p : S + S$: a point where 
branching occurs. 

branch set — of a branched covering p : § — S: the discrete subset B C S to whose 
complement the restriction of the branched covering p is a covering projection. 

branched covering, combinatorial — of a surface S with a cellularly imbedded graph 
G by a surface S with a cellularly imbedded graph Gia face-to-face, edge-to-edge, 
vertex-to-vertex mapping that is topologically a branched covering with every branch 
point occurring in the interior of some face; exemplified by the natural projection 
associated with an imbedded voltage graph. 

branched covering, topological: a continuous function p : § > S between surfaces, 
whose restriction to the complement $ — B of a discrete subset BC S is a covering 
projection. 

branched covering space: the domain of a branched covering p : S — S of a surface 
S. 

cactus: a graph constructed from a tree T and a subset S C Vr by replacing each 
vertex of S' by a cycle. 

(d, g)-cage: a graph of minimum order among all d-regular graphs of girth g. 

canonical factors for an abelian group: the factors Z,,, of the canonical form. 

canonical form for an abelian group: the form Z,, x --- X Zm,, where m;|mj41 
for j =1...r. 

Cayley graph — for a group A and generating set X: the graph whose vertices are 
the elements of A and such that, for each element a € A and each generator x € X, 
there is a directed edge from a to az. 

Cayley map: an imbedding of a Cayley graph on a surface, possibly specified by a 
rotation scheme. 

0-cell — of a polygonal complex: see vertex of a polygonal complex. 

1-cell — of a polygonal complex: see edge of a polygonal complez. 

2-cell — of a polygonal complex: see face of a polygonal complex. 

2-cell imbedding — of a graph: see cellular imbedding. 

cellular imbedding — of a graph into a surface: a graph imbedding such that the 
interior of each face is an open disk. 

centroid of a triangle: the point common to the three medians of the triangle. 

cevian of a triangle: a line from a vertex of the triangle to the opposite side. 

chiral map: a map that is symmetrical, but not regular. 

chromatic number of a surface S: the least number of colors sufficient to properly 
color the faces (or vertices) of any map on S. 

circular imbedding: an imbedding where each face is bounded by a simple cycle. 

circumcenter of a triangle: the point common to the three perpendicular bisectors 
of the sides of the triangle. 

closed surface: a surface that as a topological space is compact and without boundary. 
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closed under minors — of a graph class: a graph class C such that for every graph G 
in C, all minors of G are also in C. 

closed-end ladder L,,: the graph obtained from the cartesian product P, x K2 by 
doubling the edges v; x Kz and v, x Kz at both ends of the path. 

cobblestone path J,,: the graph obtained by doubling every edge of the path P,. 

complete set of forbidden minors — for a class F closed under minors: a set M of 
minimal forbidden minors such that for every graph G that is not in F, there exists 
a graph in M that is a minor of G. 

composition of a graph G with a graph H: the graph obtained from copies of H 
corresponding to all vertices of G, by adding all possible edges between two copies 
corresponding to an adjacency of G. 

(r, k)-configuration: a geometry having every point in r lines, every line consisting of 
k points, and each pair of points in at most one line. 

__, symmetrical: an (r, k)-configuration such that r = k. 

congestion at an edge e of the host of a graph mapping f : G — H: the cardinal- 
ity |f~*(e)| of its preimage; terminology for modeling the emulation of distributed 
computation. 

congestion of the mapping f : G — H: the maximum congestion on any edge, 
taken over all edges of H; terminology for modeling the emulation of distributed 
computation. 

congruent imbeddings: two imbeddings f,, fo: G— S with hf; = foo for some 
surface homeomorphism h: S — S$ and some graph automorphism 0: G > G. 

connected sum — of two surfaces S and S’: a surface obtained by excising the interior 
of a closed disk in each surface and then gluing the corresponding boundary curves; 
denoted by S#S’. 

consistent orientation — of a polygonal complex: orientation of the faces such that, 
within a union of oriented boundary walks, none of the edges is traversed twice in 
the same direction. 

contractible closed curve — on a surface S: a simple closed curve C' on S, such that 
the closure of one of the components of S'— C is a disk. 

contractible edge — in a triangulation: an edge whose contraction does not create a 
multiple adjacency. 

contractible to the triangulation G — S: said of a triangulation that can be 
transformed into the given triangulation G — S by a sequence of edge contractions. 

convolution — of two sequences (a;) and (b;): a combinatorial operation that produces 
a sequence (cz), with cz, = agby + aybp_1 +--+ + axbo. 

co-tree: the edge complement of a spanning tree of a graph. 

covering or covering projection, combinatorial, — of a graph G by a graph G: an 
edge-to-edge, vertex-to-vertex mapping that is topologically a covering; exemplified 
by the natural projection associated with a voltage graph. 

covering or covering projection, combinatorial, — of a surface S with a cellularly 
imbedded graph G by a surface S with a cellularly imbedded graph G: a face-to-face, 
edge-to-edge, vertex-to-vertex mapping that is topologically a covering; exemplified 
by the natural projection associated with an imbedded voltage graph. 

covering or covering projection, topological: a continuous function p : X > X 
between locally arcwise connected topological spaces, in which every point of the 
codomain X has an open neighborhood U such that each arc-component of p~'(U) 
is mapped homeomorphically onto U by p. 

__, k-fold: a covering projection that maps & to 1. 
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__, regular — onto a space X: a covering projection onto X, such that there exists a 
group of covering transformations that acts freely and transitively on it. 
co-tree — the edge-complement of a tree. 
covering space: the domain of a covering projection p : ee a 
___, regular — of a space X: the domain of a regular covering projection onto X. 
covering transformation — for a covering projection p : X — X: an autohomeomor- 
phism h on X such that ph=p. 
covering with folds: a natural projection from the composition of a graph G with 
Km to the graph G. 
Coxeter complex: the barycentric subdivision of the tessellation {p,q}, formed by all 
mirrors of reflection symmetries. 
Coxeter group (of rank 3): a group with presentation by three generators po, (1, p2 
and the relations pj = pj = p3 = (pop1)? = (p1p2)? = (p2/0)" = 1. 
k-crossing-critical graph: a graph G with cr(G) > k such that every proper subgraph 
HT of G satisfies cr(H) < k, and such that G is homeomorphically minimal with this 
property. 
crosscap distribution — of a graph G: the sequence whose j2” entry is oF (G), starting 
with a (possibly empty) subsequence of zeroes, followed by the subsequence of the 
crosscap imbedding numbers, and then an infinite sequence of zeroes. 


__, polynomial — of a graph G: the polynomial Ig(y) = S07; (G)y’. 
j=l 


crosscap — on a surface: a subspace of the surface that is homeomorphic to a Mobius 
band. 
___, number - of a closed nonorientable surface S or of the sphere: the integer & such 
that S is homeomorphic to Nz; denoted 4(S); 0 for the sphere. 
crosscap number — of a graph: the minimum crosscap number of a surface in which 
the graph is imbeddable. 
__, range — of a graph G: the integer interval [F,,:.(G), Imax (@)]- 
__, symmetric of a group A: the smallest number c such that A has a Cayley graph 
imbedded in a nonorientable surface of crosscap number c. 
crossing number of a graph G in a surface S$, denoted crg(G): the minimum number 
of crossings of edges in a drawing of G in S. 
__, pair crossing number of a drawing D: the number pcr(D) of pairs of open arcs 
a and 8 of D representing distinct edges of G such that aN 8 4 @. 
__, pair crossing number of a graph: the minimum pcr(D) taken over all drawings 
of G in the plane. 
odd crossing number of a drawing D: the number ocr(D) of pairs a and 6 of 
open arcs of D representing distinct edges of G such that |aN {| is odd. 
odd crossing number of a graph G: the minimum ocr(D) taken over all draw- 
ings of G in the plane. 
minor crossing number mcr(G): the minimum cr(H) such that G is a minor 
of H. 
rectilinear crossing number ¢r(G) of a graph G: minimum cr(D) taken over 
all rectilinear drawings of G in the plane. 
crystallization of a map M: a quadruple (F;70,7,72) that encodes a map on any 
closed surface. In the quadruple, F' is the set of flags of M and, for 7 € {0,1, 2}, 7; 
is a fixed-point-free involution of F' transposing two incident flags sharing an edge 
of M**" colored i. The group (79,71, 72) is transitive on F’, (7279)? = 1, and 7279 is 
fixed-point-free. 
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current — on a directed edge e: the value a(e) assigned to edge e by a current assignment 
in a group called the current group. 

__, assignment — on a digraph G = (V,E) imbedded in a surface S: a function 
a from edge set E to a group 6; used to specify a derived digraph; it must be 
stated explicitly that the algebraic values on the edges are currents, rather than 
voltages. 

___, group: the group in which a current assignment takes its values; usually a finite 
group. 

current graph: a pair (G — S,a) comprising an imbedded digraph and a current 
assignment, it specifies a graph imbedding. 

cycle rank, — of a connected graph G = (V,£): the number |E| — |V| + 1, which is 
the number of edges in a co-tree of a spanning graph; denoted 3(G) (for Betti, an 
Italian mathematician). 

cycle rank: — of a possibly non-connected graph G = (V, E): the number |E| —|V| +c, 
where c is the number of components. 

cylinder: a surface homeomorphic to the unit cylinder in R°, i.e., to 

{(2,y,z) |e? +y? =1,0<2<1} 

dart: a directed edge. 

deficiency of a branch point — of order r in a regular n-sheeted branched covering: 
the number n — n/r. 

deficiency €(G) of a graph G: the minimum value of €(G,T), over all spanning trees 
T of G. 

deficiency €(G,T) of a spanning tree T — in a graph G: the number of odd compo- 
nents of the co-tree G — T. 

delta-Y transformation — in the theory of triangulations: see AY -transformation 
under transformation. 

dense imbedding: an imbedding of “large” face-width, where “large” is relative to 
the context. 

derived digraph — specified by a voltage graph (G = (V, E),a): the covering graph 
G® associated with the specified type of voltages, i.e., permutation or regular. 

derived digraph, »,,-permutation — for a voltage assignment a: E(G) + Sp: the 
digraph G° = (V°%, E®), with V* = V(G) x {1,...,n} and E* = E(G) x {1,..., nr}; 
if the edge e joins vertex u to vertex v in G, then the edge e; = (e, 7) joins vertex 
uj = (u,j) to the vertex vgj) = (v, a(J)). 

derived digraph, regular - for a voltage assignment a : E(G) — 8B: the digraph 
G° = (V°, E®), with V® = V(G) x B and E® = E(G) x B; if the edge e joins vertex 
u to vertex v in G, then the edge e; = (e, 7) joins vertex u; = (u,b) to the vertex 
Uba(e) = (v, ba(e)). 

derived graph: formally, the result of deleting directions from a derived digraph; 
informally, the derived digraph itself is also called a derived graph. 

derived imbedding, — for an imbedded voltage graph: the imbedding G* — S® of the 
derived graph into the derived surface; constructed as described in §7.4. 

derived imbedding, — for a current graph: the derived imbedding for the voltage graph 
of which it is the dual. 

derived surface for an imbedded voltage graph (G > S,a): the cellular 2-complex S® 
that results from fitting to each closed walk in the set Q of lifted boundary walks in 
the derived graph G® a polygonal region (whose number of sides equals the length 
of that closed walk). 

dessin d’enfant: a bipartite map with a fixed bipartition (coloring) of its vertices. 
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diagonal flip — in a triangulation: to switch the diagonal in a quadrilateral formed by 

two faces that meet on an edge (i.e., that edge is the “diagonal” ). 

__, P-equivalent under: two P-triangulations that can be transformed into each 
other by a sequence of P-preserving diagonal flips. 

__, P-preserving: a diagonal flip that preserves a specified property P. 

digon — in a graph imbedding: a two-sided face. 

disk — closed, open: a topological space homeomorphic, respectively, to the closed or 

to the open unit disk. 

drawing of a graph G in a surface S$: a mapping that assigns to the vertices of G 

distinct elements of S, and assigns to each edge a homeomorphic image of [0, 1], 
disjoint from the vertex points, except that the endpoints of the image of an edge 
are precisely the elements of S representing the endpoints of the edge. 

__, rectilinear: a drawing in which all the edges are straight segments. 

dual, topological: a concept due to Poincaré associated with an involutory mathe- 

matical property. 

__, graph — for a cellularly imbedded graph G in any closed surface S: the graph 
G* whose vertices are the barycenters of the faces of the imbedding G > S, such 
that through each edge e € E(G) there is an edge that joins the dual vertex in 
the region on one side of the edge to the dual vertex on the other side (a self-loop 
if a face meets itself on edge e). 

___, map — of a graph imbedding G — S: the map corresponding to the dual imbed- 

ding. 

, imbedding — of a graph imbedding G — S: the imbedding G* — S obtained 
while constructing the dual graph. 

, of a current graph (G = (V, FE) > S,a: FE — B): the imbedded voltage graph 
whose base imbedding G* — S, is dual to the imbedding G > S (which involves 
reversing the orientation from the primal imbedding surface, if S is orientable), 
such that for each primal directed edge e € E, the dual edge e* has voltage 
a*(e*) = a(e). 

ear: a path attached at its end-vertices to a graph; the name was inspired by some 

drawings in which such paths had the shape of human ears. 

ear decomposition — of a graph G: a partition of the edge set of G into an ordered 

collection Po, P,,---, P,, such that Po is a simple cycle and P;, 7 > 1, is a path with 
only its endpoints in common with Pp +---+ Pj_1. 

edge contraction, — in a graph: removing the edge e then identifying the two vertices 

u and v; topologically, the edge is shrunk homotopically to a point. 

edge contraction — in a triangulation: topologically shrinking an edge, and then 

excising the two degenerate faces (digons) that result. 

edge-suppression — of an edge of a cubic map: removal of an edge e incident with two 

faces and smoothing the end-vertices of e, thereby producing a smaller cubic map. 
edge-width — ew(G) of an imbedded graph G: the length of the shortest cycle in the 
graph that is non-contractible in the surface. 
elementary subdivision — of an edge: the operation of replacing the edge by a path 
of length two; a special case of the PL-topological concept of barycentric subdivision. 

emulation of distributed computation: porting a distributed algorithm from the 
parallel computer (guest) for which it is designed to a computer (host) with a different 
parallel architecture; modeled by a graph mapping from a graph model for the guest 
to a graph model for the host. 
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essential curve — on a surface: a simple closed curve that is not contractible on the 
surface; that is, either it does not separate the surface, or if it separates, then neither 
side of the separation is a disk. 
essential cycle — in an imbedded graph: a cycle that bounds no cellular region of the 
imbedding surface. 
Euclidean space group: a group of isometries of the Euclidean plane. 
euler characteristic, — of a closed surface S: the integer .(S) defined by y(S) = 2—2g 
if S is homeomorphic to g-torus S,, and x(S) = 2 —k, if S is homeomorphic to a 
non-orientable surface with k crosscaps Nz. 
euler characteristic — of a cellular imbedding of a graph G = (V, FE): the alternating 
sum |V| —|£| +|F|, where F is the set of faces. 
euler genus 7(S) — of a surface S: twice the number of handles if S is orientable, and 
the number of crosscaps if S is non-orientable. 
euler genus of a group A: the minimum number d such that the group A has a 
Cayley graph that imbeds in a surface of Euler genus d. 
___, symmetric of a group A: the smallest number d such that the group A has a 
Cayley graph imbeddable in a surface of Euler genus d. 
Euler (Polyhedral) Equation (sometimes “Euler Formula”) — for a cellularly imbed- 
ded graph G > S: the equation v—e+ f =2-— (S). 
face — of a graph imbedding: a component of the complement of the image of the graph. 
___, boundary of: the vertices and edges encountered while traversing the face bound- 
ary walk. 
__, boundary walk of: the closed walk that encircles the face; it may have repeated 
edges and repeated vertices. 
___, closed: the face and its boundary. 
__, open: the face without its boundary. 
face of a polygonal complex: a polygon used in the construction of the polygonal 
complex, viewed as a subspace of that complex. 
face-width fw(G — S) — of a graph imbedding G > S: the minimum value of the 
number |C'NG|, taken over all non-contractible cycles C' in the surface S; also called 
the representativity; 
fb-walk: abbreviation for face-boundary walk. 
fiber over an edge e — of a voltage graph (G = (V,E),a: E — 6): in the derived 
graph G*, the edge subset {e} x B = {e, : b € B}. 
fiber over a vertex v — of a voltage graph (G = (V,E),a: E — B): in the derived 
graph G®, the vertex subset {v} x B = {wy :b € B}. 
fiber over x — where ~ is a point in the codomain of a topological covering projection 
p: X — X: the set p71(a). 
finite geometry: a geometry whose point set is finite. 
flag: an ordered triple (Fo, F,, F2) of pairwise incident faces of a map of dimensions 0, 
1, and 2, respectively; represents a triangle in a map. 
free action — of a group on a graph: an automorphism group with no fixed vertices, 
except by the identity automorphism. 
freely acting group H of covering transformations for a covering projection p: xs 
X: a group such that no transformation except the identity has a fixed point in x 
fundamental polygon — for a closed surface S: a polygon whose edges are pairwise 
identified and pasted so that the resulting polygonal complex has only one face and 
so that it is homeomorphic to the surface S. 
f-vector — for a graph map: see vector. 
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genus distribution polynomial — of a graph: the polynomial Ig(x) = x 15(G)x). 
j=0 


genus distribution sequence — of a graph G: the sequence whose j£” entry is 7;(G), 
starting with a (possibly empty) subsequence of zeroes, followed by the subsequence 
of the orientable imbedding numbers, and then an infinite sequence of zeroes. 

genus of a surface S: the number of handles for an orientable surface (and sometimes, 
the number of crosscaps for a nonorientable surface); the surface S, has genus g; 
denoted y(S). 

genus of a graph: the smallest genus of any orientable minimum genus. 

genus imbedding — of a graph: an imbedding of the graph in a surface of smallest 
possible genus; short for minimum genus imbedding. 

genus of a group A: the minimum number g such that A has a Cayley graph that 
imbeds in a surface of genus g. 

__, symmetric A: the smallest number g such that A has a Cayley graph symmet- 
rically imbedded in an orientable surface of genus g. 

genus range of a graph G: the integer interval [Ymin(G), Ymax(G)]. 

geometry: a pair (P,L) where P is a non-empty set and L is a non-empty collection 
of subsets of P. 

__, lines of: elements of the set L. 
___, points of: elements of the set P. 

graph-encoded map — often abbreviated GEM: a particular system for describing a 
map using colored graphs. 

(p,q,r)-group A: a group with presentation 

A= (x,y,2:2°% =y? = 2? =1, (ay)? = (yz)? = (wz)" =1) 


, proper: a (p,q,r) group such that the subgroup generated by xy and yz has 
index two. 
group action on a surface: a subgroup of the homeomorphism group of the surface. 
guest graph — for a graph mapping: the domain of the mapping; terminology used 
when modeling the emulation of distributed computation; see host graph. 
half-disk: a topological space homeomorphic to the unit half-disk. 
Halin graph: a graph formed by joining pairs of consecutive leaves (in a pre-order or 
post-order traversal) of a plane tree, so that a cycle passes through the leaves. 
hereditary property — under minors: a property such that whenever a graph has it, 
then so do all of its minors. 
homeomorphic graphs: two graphs that become isomorphic after smoothing all their 
degree-2 vertices. 
host graph for a graph mapping: the codomain; terminology used when modeling the 
emulation of distributed computation; see guest graph. 
Hurwitz group: a (2,3, 7)°-group. 
hypermap: a generalization of a graph imbedding to a hypergraph representation on 
a surface. 
hyperregion — of an imbedding of a hypergraph in a surface: a component of the 
complement of the image of the hypergraph in the surface. 
imbedded voltage graph: a pair (G > S,a), such that (G, a) is a voltage graph such 
that S' is a closed surface in which the graph G is (cellularly) imbedded. 
imbedding — of a graph: an imbedding of the topological realization of the graph. 
__, 2-cell: see cellular imbedding. 
__, cellular: an imbedding of a graph G on a surface S such that the components of 
S \ G are open disks. 
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imbedding — of a topological space: an immersion which is (globally) one-to-one. 

___, congruent imbeddings: imbeddings f;, fo : G — S for which there exists 
a homeomorphism h : S — S and a graph automorphism 0 : G > G with 
hf = foo. 

___, equivalent imbeddings: two imbeddings f;, fo: G—- S with hf; = fo for some 
homeomorphism h: S > S. 

immersion — of a topological space: a continuous mapping that is locally one-to-one. 

incenter of a triangle: the point common to the three internal angle bisectors of the 
triangle. 

interior of a contractible cycle C on a surface S: the component of S — C that is 
homeomorphic to the plane. 

irreducible triangulation of Nj, crosscap-type: an irreducible triangulation on the 
Klein bottle that splits into two triangulations on the projective plane. 

irreducible triangulation of N2, handle-type: an irreducible triangulation of the 
Klein bottle that contains no separating cycle of length 3. 

irreducible triangulation: a triangulation that has no contractible edge. 

isomorphic triangulations: two triangulations on a surface such that there is an 
auto-homeomorphism on the surface mapping one skeleton onto the other. 

isomorphism of maps: a homeomorphism of the respective surfaces that induces a 
graph isomorphism of the respective graphs. 

isotopic triangulations: two triangulations on a surface one of which can be trans- 
formed continuously on the surface into the other. 

kernel: an imbedded graph G such that for every proper minor H, there is a curve C 
with u(H,C) < u(G,C). 

Kepler—Poinsot regular star polyhedra: some self-intersecting realizations of reg- 
ular maps. 

Kirchhoff current law (KCL) — at a vertex v of a current graph: a possible condition, 
namely, that the net current at v is the group identity. 

Kirchhoff current law (KCL) — on a current graph: a possible condition, namely, 
that KVL holds at every vertex. 

Kirchhoff voltage law (KVL) — on a closed walk W in a voltage graph: a possible 
condition, namely, that the net voltage on W is the identity of the voltage group. 
Kirchhoff voltage law (KVL) — on an imbedded voltage graph: a possible condition, 

namely, that KVL holds on every face boundary walk in the graph. 

Klein bottle Nz: a closed nonorientable surface obtained by identifying the pairs 
of points {(a,y,—1),(x,—y,1)} on the two boundary components of the cylinder 
{(x,y, 2) | 2 +y? =1,-1< z < 1}; its crosscap number is 2. 

Kuratowski’s Theorem: the theorem that every non-planar graph contains a home- 
omorphic copy either of Ks or of K33. 

large edge-width imbedding: an imbedding where the edge-width is strictly larger 
than the length of the longest facial cycle. 

large-edge-width map: a map whose edge-width is greater than the number of edges 
in any face boundary. 

Levi graph of a geometry: a graph whose edges join incident point/line pairs of that 
geometry. 

LEW-imbedding: short for large edge-width imbedding. 

lift of a walk W — in a voltage graph (G,a): a walk W in the derived graph that is 
mapped isomorphically onto W by the natural projection. 

line;: an element of a geometry; esp. an affine line in a real Euclidean space. 

line: an edge of a graph. 
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link of a vertex v in a triangulation: the cycle around v through all its neighbors. 

load at a vertex v in the host of a graph mapping f : G > H: the cardinality | f~'(v)| 
of its preimage; terminology for modeling the emulation of distributed computation. 

load of the mapping f : G > H: the maximum load at a vertex, taken over all 
vertices of H; terminology for modeling the emulation of distributed computation. 

2-manifold: a topological space in which each point has a neighborhood that is home- 
omorphic either to an open disk or to a half-disk. 

map: a cellular imbedding of a graph on a surface. 

maximum crosscap imbedding — of a graph: a cellular imbedding into a closed 
nonorientable surface of maximum crosscap number. 

maximum crosscap number of a graph G ~ also called the mazimum nonorientable 
genus: the largest integer k such that the graph G has a cellular imbedding in the 
nonorientable surface N;,; denoted Fno2(G). 

maximum crosscap number ~— of a graph: the maximum of the set of integers k such 
that G has a cellular imbedding in the nonorientable surface N;; 0 if the graph is 
planar; denoted Fmax(G). 

maximum genus — of a graph G: the largest integer g such that the graph G has a 
cellular imbedding in the orientable surface S,; denoted Ymaz(G). 

maximum genus imbedding — of a graph: an imbedding of the graph into a closed 
orientable surface of maximum genus. 

medial graph — M(G) of an imbedded graph G: an imbedded graph whose vertices are 
the edges of G and whose edges join two vertices corresponding to two consecutive 
edges in a face boundary of G. 

Menger graph of a geometry: a graph whose edges join collinear points of a geom- 
etry. 

minimal forbidden minor ~ for a class ¥ of graphs closed under minors: a graph G 
that is not in F, but such that every proper minor of G is in F. 

minimal triangulation: a simplicial polyhedral map such that the contraction of any 
edge results in a map that is no longer polyhedral. 

minimum crosscap imbedding - of a graph: an imbedding into a closed nonori- 
entable surface of minimum crosscap number; an imbedding in the sphere if possible. 

minimum crosscap number of a graph G — also known as the minimum nonori- 
entable genus: the smallest integer k such; 0 if the graph is planar; denoted 7,,,;,,(@). 

minimum crosscap number ~ of a graph: the minimum of the set of integers k such 
that the graph is imbeddable in the nonorientable surface N;,; denoted Fmin(G) or 
4(G). 

minimum genus — of a graph G: the minimum integer g such that the graph G' has 
an imbedding into the orientable surface S, of genus g; denoted ymin(G) or 7(G). 

minimum genus imbedding — of a graph: an imbedding of the graph into a closed 
orientable surface of minimum possible genus. 

minor of a graph G: a graph formed from G by a sequence of edge deletions and edge 
contractions. 

minor-minimal imbedded graph: an imbedded graph such that the deletion or 
surface contraction of any edge lowers the face-width. 

minor of a map M: a map M obtained from map M by deleting and/or contracting 
edges. 

Mobius band: a surface obtained from a 2-x-2 square {(z,y)|-l <a <1,-l<y<1l} 
by pasting the vertical sides together with the matching (—1,2) > (1, —2). 

monogon ~ in a graph imbedding: a one-sided face. 
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mu-invariant u(G,C) — for a cycle C in an imbedded graph G: half the minimum 
length of a closed walk W, taken over all walks W in the radial graphs that are 
homotopic to C. 

natural action of a group A on the Cayley graph C(A, X): left multiplication by 
elements of A. 

natural projection for a voltage graph (G = (V, E),a: E > B): the graph mapping 
G° — G comprising the vertex function v, + v and the edge function e, + e. 
(Thus, the natural projection is given by “erasure of subscripts”.) 

natural projection for an imbedded voltage graph (G > S,a): the extension of the 
natural projection p : G* — G to the surfaces, so that it maps the center of each 
polygon f in the derived imbedding G* — S® to the center of the region of the 
imbedding G > S bounded by p(bd(f)). 

near-planar graph: a graph G that has an edge e such that G \ e is planar. 

near triangulation: a rooted map in which every nonroot face is a 3-gon. 

Nebesky nu-invariant v(G): an invariant used in calculating the crosscap number of 
a graph. 

neighborly polyhedral map: a polyhedral map in which every pair of distinct vertices 
is joined by an edge. 

k-nest — in an imbedding: a sequence C),...,C, of disjoint contractible cycles such 
that C; is in the interior of Cj+1. 

net current at a vertex v of a current graph: for an abelian group, the sum of the 
inflowing currents; for a non-abelian group, the product of the inflowing currents in 
the cyclic order of the rotation at v. 

net voltage on a walk in a voltage graph: for an abelian group, the sum of the 
voltages on the walk, taken in the order of traversal; for a non-abelian group, the 
product of the algebraic elements in its voltage sequence, in cyclic order (which is 
unique up to conjugacy). 

non-contractible cycle — in an imbedded graph: a cycle that is non-contractible on 
the surface. 

non-orientable 2-manifold: a 2-manifold that contains a subspace homeomorphic to 
the Mobius band. 

non-orientable surface with k crosscaps Nz: a connected sum of k copies of the 
projective plane Nj. 

non-revisiting path: a path p in the graph of a map M such that the set pM F is 
connected, for each face F of M. 

non-separating cycle — in an imbedded graph: a cycle whose removal separates the 
surface. 

orientable 2-manifold: a 2-manifold which is not non-orientable. 

orientation reversing curve — on a surface: a simple closed curve whose regular 
neighborhood is a Mobius band. 

oriented boundary walk — of a face of a graph imbedding in an oriented surface: the 
closed walk in the 1-skeleton that results from traversing the face boundary in the 
direction of orientation. 

oriented polygon: a polygon together with a direction (clockwise or counterclockwise) 
of traversal of its boundary, designated to be preferred. 

oriented polygonal complex: polygonal complex together with a consistent orienta- 
tion. 

orthocenter of a triangle: the point common to the three altitudes of the triangle. 

outerplanar graph: a graph that can be drawing in the plane so that every vertex 
lies on the exterior face. 
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overlap matrix — for a general rotation system p of a graph G and a spanning tree T: 
the matrix M, 7 = [m;,,;] whose entries are given for all pairs of edges e;,e; of the 
cotree G — T by 

1 ift J and pure(p) |r+este, is nonplanar 
myj= 41 ift= 7 and edge i is twisted 
0 otherwise 
The notation pure(p)|r+e,+e, means the restriction of the underlying pure part of 

the rotation system p to the subgraph T + e; + e;. 

panel — of a triangulation: a 3-cycle in the skeleton that always bounds a face in any 
imbedding of the skelton graph in that same surface. 

panel structure — of a triangulation: the composite structure of a triangulation and 
its panels. 

panel structures, equivalent: two panel structures whose skeletons with all panels 
inserted form homeomorphic 2-complexes. 

parallel lines — of a geometry: lines with no point in common. 

partially balanced incomplete block system (abbr. PBIBD): a geometry of 
v points (each in r lines) and 6 lines (each containing & points), together with a 
strongly regular graph (whose vertices are the points of the geometry) such that two 
non-adjacent points belong to 1 lines and two adjacent points belong to Az lines. 

pasting topological spaces X and Y — along homeomorphic subspaces: obtaining a 
new topological space from the original ones by identifying the points of the home- 
omorphic subspaces under a homeomorphism. 

permutation scheme: a particular system for describing a map using a pair of per- 
mutations. 

Petrie dual of a map M: a map M? with the same underlying graph as M, whose 
face boundaries are closed walks in Gy, such that any two consecutive edges, but 
not three, belong to a face of M. 

Platonic solids: the five regular geometric solids — tetrahedron, octahedron, cube, 
dodecahedron, icosahedron. 

pinched open disk: a topological space obtained from several copies of open disks by 
identifying their respective centers to a single vertex. 

planar graph: a graph whose minimum genus is 0. 

planarizing collection of cycles: aset C),...,Cg of cycles in an imbedded graph such 
that cutting along all of the C; simultaneously yields a connected graph imbedded 
in the plane. 

planarizing curve — for a nonplanar region of a noncellular graph imbedding: a sepa- 
rating closed curve such that all of the boundary components lie to one side of the 
separation and all of the genus lies to the other. 

point,: a point of Euclidean space or a topological space. 

point,: a vertex of a graph. 

polygonal complex: roughly, a topological space obtained from a set of oriented 
polygons by pasting some of these polygons to each other (and to themselves) along 
their sides. 

polyhedral imbedding: an imbedding such that the intersection of any two face 
boundaries is either empty or a path. 

polyhedral map: a map M whose face boundaries are cycles, and such that any two 
distinct face boundaries are either disjoint or meet in either a single edge or vertex. 

__, weakly neighborly: a polyhedral map for which every pair of vertices is con- 
tained on a face. 
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preferred direction — of the traversal of a polygon boundary: a chosen direction of 
traversal of sides of the polygon (clockwise or counterclockwise). 

projective plane N,: a closed surface obtained from the closed unit disk by identifying 
pairs of boundary points that are diametrically opposite relative to the center of the 
disk. 

projective plane: the nonorientable surface of genus 1. 

2-pseudomanifold: a topological space in which each point has a neighborhood that 
is homeomorphic either to an open disk, to a half-disk, or to a pinched disk. 

pseudosurface: a 2-pseudomanifold, usually assumed to be connected. 

punctured surface: a surface with one boundary component. 

quadrangulation: a graph imbedding all of whose faces are 4-sided. 

quadrilateral: a 4-sided face of a graph imbedding. 

radial graph R(G) of an imbedded graph G: an imbedded bipartite graph whose 
vertices are the vertices and faces of G, and whose edges join incident elements. 

ramification point of a covering: see branch point. 

realization — of a map: an imbedding of the map into Euclidean space E% such that 
each face is a plane convex polygon and adjacent faces are not coplanar. 

regular map: a map whose automorphism group acts transitively on the set of flags. 

representativity p(G) — of an imbedded graph G: same as the face-width of the 
imbedding. 

Ringel—Youngs Theorem: the theorem that the chromatic number of the orientable 
surface S, equals the Heawood number 


poe 


rooted graph: a graph with a distinguished vertex or edge, called the root-vertex or 
root-edge. Sometimes a graph has more than one root-vertex or root-edge. 

rooted imbedding: an imbedding with a distinguished vertex v, an edge e incident 
with v, and a face f incident with e. 

rooted map: a map in which a flag has been distinguished. 

rotation (global) — on a graph: an assignment of a rotation at each vertex. 

rotation at a vertex: a cyclic permutation on the set of half-edges incident to the 
vertex. 

rotation system: a purely combinatorial description of an imbedding of a graph G on 
a surface, by giving a rotation at each vertex of G. 

semicellular graph imbedding -— of a graph G: an imbedding G > S' whose regions 
are planar, but which may have more than one boundary component. 

separating closed curve — on a surface: a simple closed curve the excision of which 
splits the surface into two components. 

p-sequence — of a polyhedral map: the sequence {p;}, where p; is the number of i-gonal 
faces. 

v-sequence — of a polyhedral map: the sequence {v;}, where v; is the number of vertices 
of degree 7. 

signed boundary walk — of a face of a polygonal complex: the list of the signed edges 
that occur on an oriented boundary walk of that face. 

signed edge — in a polygonal complex: the occurrence of an oriented edge or of its 
reverse edge within a walk in the 1-skeleton of the polygonal complex. 

similar imbeddings: two imbeddings such that one can be changed into the other by 
a sequence of YA- and AY-transformations and the taking of geometric duals. 
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simple map: a map in which each vertex has degree 3. 

simplicial map: a map where each face boundary is a 3-cycle. 

skeleton, or 1-skeleton — of a polygonal complex: the graph consisting of the vertices 
and edges of the polygonal complex. 

skew polyhedron: a realization of a polyhedral map in R®, for d > 3. 

skew vertex — in a triangulation: a vertex whose skeleton has at least two different 
cycles that contain all of its neighbors. 

smoothing — a degree-2 vertex: an operation that removes a degree-2 vertex v then 
adds a new edge between the two neighbors of v. 

sphere Sy: a surface homeomorphic to the standard sphere {(z, y, z) | @7+y?+2? =1} 
in R®. 

standard triangulation on the sphere: an n-vertex triangulation on the sphere 
whose skeleton is isomorphic to the join P,-2 + Ko. 

star neighborhood of a vertex v in a triangulation: the wheel obtained by joining 
the link of v to v. 

Steiner triple system: a balanced incomplete block design with k = 3 and \ = 1. 

stratified graph for a graph G: a graph in which each imbedding of G is represented 
by a vertex, and in which each edge represents a transition between two imbeddings. 

strong symmetric genus of a group A: the smallest number g such that the group 
A has a Cayley graph with a strongly symmetric imbedding in an orientable surface 
of genus g. 

strongly cellular imbedding — of a graph: an imbedding such that the closure of 
each face is a closed disk. 

strongly noncellular graph imbedding G — S: an imbedding with at least one 
nonplanar region. 

strongly noncontractible closed curve — in a region of a noncellular graph imbed- 
ding: a curve such that cutting it open reduces the genus of the region. 

strongly symmetric imbedding of a Cayley graph C(.A, X) in an orientable sur- 
face S: an imbedding such that the natural action of A on C(A,X) extends to an 
orientation-preserving action on the surface S. 

surface: a 2-manifold, usually assumed to be connected, compact, and without bound- 
ary, unless otherwise declared. 

surface minor — of an imbedded graph: another imbedded graph in the same surface 
formed by a sequence of edge deletions and contractions in the surface. 

surface with k holes: a surface obtained by removing the interiors of k disjoint disks 
from a closed surface. 

symmetric imbedding of a Cayley graph C(A, X) in a surface S: an imbedding 
such that the natural action of A on C(A, X) extends to the surface S. 

symmetrical map: a map with at most two orbits under the action of the automor- 
phism group on the set of flags. 

symmetry group of an imbedding f : G > S: the subgroup of automorphisms that 
are symmetries of an imbedding f; denoted Sym(f). 

symmetry of an imbedding f : G > S: an automorphism o € Aut(G) such that 
hf = fo for some homeomorphism h: S — S. 

tessellation {p,q}: the classical tiling of the sphere, Euclidean plane, or hyperbolic 
plane into p-gons, of which gq are incident at each vertex. 

topological realization — of a graph: a topological space obtained from the graph by 
first assigning to each of its edges a closed interval and then identifying endpoints 
of intervals according to the coincidences of the corresponding endpoints of edges of 
the graph. 
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g-torus Sy: a connected sum of g copies of a torus; this surface is usually called the 
orientable surface of genus g. 

torus S;: the orientable surface of genus 1; a closed surface obtained by rotating a 
circle {(a, y, z) | (a — 2)? + y? =1,z = 0} around the y-axis. 

total imbedding distribution — of a graph G: the bivariate polynomial 


Iq(a,y) = Iq(x) + Ta(y) = Sue aig S07, (@y 
j=0 j=l 


AY-transformation — in the theory of triangulations: the operation of deleting the 
three edges joining three mutually adjacent vertices and inserting a new vertex with 
new edges to all three vertices; the inverse of a Y A-transformation. 

Y A-transformation — in the theory of triangulations: a graph formed by deleting a 
vertex v of degree three and adding in a new 3-cycle incident with its neighbors. 
transitively acting group H — of covering transformations for a covering projection 
p: eee? Gee group whose restriction to every fiber is a transitive permutation 

group. 

triangle group: a group of isometries generated by rotation about the vertices of a 
triangle with angles 1/p,a/q,7/r; the symmetry group of the tessellation of type 
{p,q}. 

__, full: a group generated by the reflections in the sides of a triangle with angles 
w/p,1/q,n/T. 

triangle: a 3-sided polygon, a figure in plane geometry; see centroid, cevian, circum- 
center, incenter, orthocenter. 

triangle: a 3-sided face of a graph imbedding. 

triangles: a 3-cycle of a graph. 

triangular imbedding: an imbedding that imbeds a graph with all faces 3-sided. 

triangulates a surface — a possible graph property: having a triangular imbedding in 
some surface. 

triangulation of a surface: a simplicial map where each face boundary is a 3-cycle. 

, Catalan: a triangulation on a surface-with-boundary whose boundary includes 
all vertices. 

, clean: a triangulation such that every 3-cycle in the skeleton bounds a face. 

, combinatorially equivalent: two triangulations that have the same set of face 
boundary cycles. 

, eulerian: a triangulation with each vertex of even degree. 

, frozen: a triangulation such that no edge can be flipped without giving the 
skeleton a double adjacency. 

, k-irreducible: a triangulation such that each edge is contained in an essential 
cycle of length at least k. 

, isomorphic triangulations: two triangulations G; — S and Gz — S such that 
there is a homeomorphism h : S > S such that h(Gi) = Go. 

, isotopic triangulations: two triangulations Gj — S and Gg > S' such that 
there is a homeomorphism h : S > S with h(G,) = G2 that is isotopic to 
the identity mapping on S. (Roughly speaking, this means that one can be 
transformed continuously on the surface into the other.) 

, minimal clean: a clean triangulation minimal with respect to edge contractions. 

, minimal — of a surface: a triangulation on the surface having the fewest vertices. 

, k-minimal: the same as a k-irreducible triangulation. 

, pseudo-minimal: a triangulation such that no sequence of diagonal flips trans- 
forms it into one having a vertex of degree 3. 
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__, tight: a triangulation G — S such that, for any partition of V(G) into three 
nonempty subsets V,, V2 and V3, there is a face vivgu3 € F(G > S) with v; € Vj. 
__, untight: a triangulation that is not tight. 
type-{p,q} map: a map with p edges incident with each vertex and q edges incident 
with each face. 
underlying cellular imbedding — of a semicellular graph imbedding: the imbedding 
obtained by cutting each non-cell region open along a maximal family of boundary- 
separating closed curves and capping the holes with disks. 
underlying graph of a map: the 1-skeleton. 
underlying semicellular imbedding — of a strongly noncellular graph imbedding 
G — S: the imbedding obtained by cutting each non-cell region open along a max- 
imal family of boundary-separating closed curves and then capping the holes with 


disks. 
unimodal sequence {a,,}: a sequence such that there exists at least one integer M 


such that 
Qm—-1 < am forall m< M and am >Gm+41 for all m>M 

uniquely imbeddable on a surface S — a possible graph property: having a unique 
imbedding on the surface (up to a suitable equivalence). 

unit disk — closed, open: respective subsets {(2, y) | 7+y? < 1} and {(a, y) | 27+y? < 
1} of the Euclidean plane together with the inherited Euclidean topology. 

unit half-disk: a subset {(x,y) | x > 0,2? + y? < 1} of the Euclidean plane together 
with the inherited Euclidean topology. 

upper-imbeddable graph: a graph G whose maximum genus is equal to |G(G)/2], 
where 6(G) is the cycle rank of G. 

f-vector — for a graph map: the triple (fo, fi, f2) where f; is the number of i-dimen- 
sional faces of the map. 

vertex — of a polygonal complex: the image of arbitrarily many polygon corners that 
have been pasted together when building the polygonal complex. 

vertex-amalgamation — of two disjoint graphs G and H: the result of identifying a 
vertex u of G and a vertex v of H. Notation: G, * Hy. 

vertex-face graph: same as the radial graph. 

vertex-face incidence graph: same as the radial graph. 

vertex splitting: an operation on a map inverse to edge contraction — a single vertex 
is replaced by two vertices joined by an edge. 

vertex-transitive action of a group of automorphisms on a graph: a group such that 
for any pair of vertices, there is an automorphism taking one vertex to the other. 

vertex-transitive map: a map whose automorphism group acts transitively on the 
set of vertices. 

voltage — on a directed edge e: the value a(e) assigned to e by a voltage assignment. 

voltage assignment - on a digraph G = (V,£): a function a from edge-set E to a 
group 6; used to specify a derived digraph. 

voltage graph: a pair (G,a), where G is a digraph and a : Eg > B is a voltage 
assignment, an algebraic specification of a derived graph. 

voltage group: the group in which a voltage assignment a: Eg — B takes its values. 

voltage sequence on a walk W = vp, €1, 1, €2,---;€n; Un in a voltage graph (G,a): 
the sequence of voltages a1,...,@n encountered, where a; = a(e;) or a(e;)~1, re- 
spectively, depending on whether edge e,; is traversed in the forward or backward 
direction. 
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walk in a voltage graph (G,q): a walk in G as if it were undirected, so that some of 
its edge-steps may proceed in the opposite direction from the assigned direction on 
the edge it traverses. 

k-walk: a spanning walk that visits no vertex more than k times. 

weakly neighborly polyhedral map: see polyhedral map. 

wheel-neighborhood, having a — a possible property of a vertex v: the property 
that any two face boundaries containing v intersect in a path. 

Whitney flip: a transformation of an imbedding of a 2-connected graph that replaces 
a subgraph by its mirror image. 

Xuong tree TJ in a graph G: a spanning tree whose deficiency is equal to the deficiency 
of the graph G. 

YAY-equivalent graphs: two graphs such that one can be changed into the other by 
a sequence of YA- and AY-transformations. 

Y A-transformation: see transformation. 
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INTRODUCTION 


Extremal graph theory is concerned with inequalities among functions of graph in- 
variants and the structures that demonstrate that these inequalities are best possible. 
Accordingly, in its wide sense, it encompasses most of graph theory. Nevertheless, there 
is a clearly identifiable body of core extremal results: in this all-too-brief review we shall 
present a selection of narrowly interpreted extremal results. 

Since by now there are many thousands of papers on extremal graph theory, no short 
survey of extremal graph theory has any hope of being complete. There is no doubt 
that the selection of topics in this survey, in which we shall concentrate on the basic 
graph invariants such as size, maximal and minimal degrees, connectivity, number of 
r-cliques, and independence number, strongly reflects the tastes and preferences of its 
authors. 


CONVENTIONS AND NOTATIONS 


C1: Unless explicitly stated, all graphs are assumed to be defined on the vertex set 
[nm] = 41, 23..237) 
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C2: G(n) stands for a graph with n vertices and G(n,m) stands for a graph with n 
vertices (“of order n”) and m edges (“of size m”). Thus, the statement 


“if in G = G(n) with n > 3, every vertex has degree at least n/2, then G 
is Hamiltonian...” 


means that every graph of order n > 3 and minimal degree n/2 is Hamiltonian. 
N1: v(G) and e(G) denote the numbers of vertices and edges in a graph G. 
N2: A(G) and 6(G) stand for the maximal and minimal degrees of G. 


N3: Ifwisa vertex of a graph G, then '¢(w) is its set of neighbors, and dg(u) = |Te(u)| 
is its degree. We use d(u) and ['(u) instead of dg(w) and I'¢(u) when it is clear which 
graph G is intended. 


N4: Ifthe graphs G; and G2 are disjoint graphs, then G1 + G2 denotes the join of G1 
and G2 (see Subsection 1.1.3), that is, the union of G, and G2 together with some new 
edges joining every vertex of G, to every vertex of G2. 


N5: We denote by k,(G) the number of copies of the complete graph K, in the graph G. 


DEFINITIONS 


D1: Given graphs H and G, we say that G is H-free if G has no subgraph isomorphic 
to H. 


D2: An s-clique is a complete subgraph on s vertices. Thus, k,(G) equals the number 
of s-cliques of the graph G. 


D3: Given a graph G, the clique number w(G) of G is the order of its largest clique. 
D4: The independence number a(G) of G is the clique number of its complement. 


D5: A graph is k-connected if the deletion of fewer than k of its vertices leaves it 
connected. The vertex connectivity «(G) of a graph G is the maximal number & such 
that G is k-connected. 


EXAMPLES 


Let us illustrate the difference between the wide and narrow interpretations of extremal 
graph theory with two examples. 


El: Let f(n) be the minimal number of triangles needed to cover all the edges of a 
complete graph with n vertices. Then 


f(n) < (5)i3 


and equality holds iff there is a Steiner triple system of order n. This assertion is not 
really a result in extremal graph theory, but leads to design theory. 


E2: Consider the statement that “every 2-connected graph with n vertices and minimal 
degree k contains a cycle with at least min{2k,n} vertices”. This is clearly a result in 
extremal graph theory with the narrow interpretation: it expresses a relationship involv- 
ing the number of vertices, the minimal degree, the connectivity, and the circumference 
(maximal length of a cycle). 
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8.1.1 Turan-Type Problems 


The quintessential problem in extremal graph theory is the following question due to 
Turan: given 3 < r < n, what is the maximal size graph G(n) that does not contain K,.? 
Equivalently, what is the maximal size of a K,-free G(n)? 


DEFINITION 
The Turan problem has been substantially generalized. 


D6: Let {F,}°_, be a sequence of families of graphs, and let ®(n, F;,) be the set of 
graphs G(n) that are H-free for every H € Fy. 


e The function ex(n, F;,) of n is called the extremal function of the sequence 
{Fn}. 

e The graphs G € ®(n, F;,) for which e(G) = ex(n, F;,) are called extremal graphs. 

e In this context, the families {F;,}°-_, are called forbidden graphs. 


e The aim to find or estimate ex(n, F;,) and to determine the extremal graphs is 
called a Turdn type problem for the families of forbidden graphs {F;,}>—,. 


Turan’s Theorem and Its Extensions 


The fundamental theorem of Turan has been the driving force of extremal graph theory 
for more than six decades. 


DEFINITIONS 


D7: Let n> r > 2 be integers. The Turdn graph T,(n) is the complete r-partite 
graph whose classes are as nearly equal as possible. 


D8: The Turdn number t,(n) is the size of the Turan graph T,(n). 


FACTS 


Fl: Ifn=rs+t(0<t<r-1), then T,(n) has ¢ classes of cardinality [n/r] and 
r —t classes of cardinality |n/r|. Therefore, 


r—-1 
2r 


2 > t,(n) > a 
re eo 


F2: Mantel [Man07] If a graph G = G(n) is K3-free, then e(G) < |n/4]. 


F3: Turdn’s theorem [Tur41] If a graph G = G(n) is K,4,-free, then e(G) < t,(n); 
and if e(G) =t,(n), then G = T,(n). 


F4: Zykov [Zyk49], Erdés [Erd62a] If a graph G = G(n) is K,+1-free, then for every 
s =2,...,r we have k,(G) < k.(T,(n)); and if ks(G) = ks(T,-(n)) for some s such that 
2<s<r, then we have G = T,.(n). 
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F5: Erdés [Erd70] For every K,.1-free graph G, there exists an r-partite graph H with 
V(H) = V(G), such that dg(u) < dy(u) for every u € V(G). If G is not a complete 
r-partite graph, then H may be chosen so that dg(u) < dy(u) for some u € V(G). 


F6: KhadZiivanov [Kha77], Fisher and Ryan [FiRa92] If the graph G is K,+1-free, 
then for every s = 1,...,7—1 we have 


1/s 1/(s+1) 
kis(G) s [ ks+1(G) 
(") ale Sa) 
F7: Motzkin—Straus inequality [MoSt65] If a graph G = G(n) is K,+1-free and 
X1,.--,%, are nonnegative numbers then 


S ae =< ye 


veEV(G) uel (v) veEV(G) 


2 


F8: Bomze inequality [Bom97] If the graph G = G(n) is K,+1-free and if the 


numbers 21,...,2%» are nonnegative and not all zero, then , 


S- Lan +5 Oe < (1-3) ae (1) 


vEV(G) uel(v) uEV(G) 


Let %u,,---,2u, be the nonzero numbers in {a1,...,@,}. Equality in (1) is attained if 
and only ifqg=r, ty, =-++: = ®y,, and the vertices ui1,...,u, are an r-clique in G. 


Structural Properties of the Graphs G(n,t,(n) + 1) 


Turdn’s theorem guarantees that every G(n,t,(n) + 1) contains a K,41, but further 
investigations revealed a lot more properties of such graphs. We present below three 
topics of considerable interest. 


NOTATIONS 


N6: K,.(s1,...,8,) denotes the complete r-partite graph with classes of size 51,..., ,, 
respectively. The graph K,*(s1,...,8,) is obtained from K,(s1,..., 8) by adding an 
edge to the first specified class, i.e., of order s1. 


N7: For all natural numbers n,m,r let 6(n,m,1r) denote the maximal value such that 
every graph G(n,m) has an r-clique R with >> {d(u) : u€ R} > d(n,m,r). 

FACTS 

F9: Bollobds and Thomason [BoTh81], Erdés and Sés [ErSo83] For r > 3 every 
G(n,t,(n) + 1) has a vertex u with degree d(u) > (1—1/r— (1+ Vr)~+) n and such 
that ['(u) induces more than t,_;(d(u)) edges. 


F10: Bondy [Bon83a], [Bon83b] For r > 3 every G(n,t-(n) + 1) has a vertex u of 
maximal degree such that [(u) induces more than t,_1(d(w)) edges. 


956 Chapter 8. Analytic Graph Theory 


F11: Bollobds [Bol99] Let G = G(n,t,-(n) + a) where a > 0. Let wu be a vertex of 
maximal degree d(u) = n — k. Then e(G [I'(u)]) > t-(d(u)); and the inequality is strict 
unless k = |n/r|, d(w) = [(r — 1)n/r], the set V\I'(u) is independent, and every vertex 
of I'\(w) is joined to every vertex of V —T(w). 


F12: Erdés [Erd63] For every ¢ > 0 there exists a number c = c(e) > 0 such that 
every G(n, |n?/4| +1) contains a KZ (clogn,n'~*). 

F13: Erdés and Simonovits [ErSi73] For every g and n sufficiently large, every 
G(n,t,(n) + 1) contains a K*(q,...,¢)- 


F14: Edwards [Edw77], [Edw78] It 3 <r <8,n>r? and m > t,(n), then 


2 
5(n,m,r) > 


n 


F15: Faudree [Fau92] If r > 3, n > r?(r —1)/4, and m > t,(n), then 
rm 
d(n,m,r) > —— (2) 
n 


REMARKS 


R1: Ina sense, Fact F13 is best possible, since if H is a fixed graph that occurs in any 
graph G = G(n,t,-(n) +1) then H Cc Ky (q,...,q) if ¢ is sufficiently large. However, 
Fact F12 suggests that extensions are still possible, although we are not aware of any 
such extension. 


R2: Bollobas and Nikiforov showed that Fact F15 holds for every n. 


R3: The result in Fact F16 confirms a conjecture of Bollobdés and Erdés [BoEr76], 
and it is essentially best possible, since if G is regular, then equality holds in (2). On 
the other hand, if m < t,_1(n), then d(n,m,r) = 0. It is a difficult open question to 
determine 6(n,m,r) for tp-1(n) <_m < t,(n) (see, e.g., [CEV88]). 


Books and Generalized Books 


The study of books was initiated by Erdés in 1962 [Erd62b] and has attracted much 
effort since then. Nevertheless, the Turan problems about books, except for the simplest 
case, are largely open. 


DEFINITION 


D9: For gq >1,r > 1anr-book is the graph BY consisting of g distinct (r+1)-cliques, 
sharing a common r-clique. 
e The value q is called the size of the r-book; we write bk“) (G) for the size of the 
largest r-book in a graph G. 
e We call 2-books simply books and write bk(G) for bk)(G). 


FACTS 


F16: Dirac [Dir63] Every G = G(n, t,(n) +1) contains a K-42 with one edge removed. 
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F17: Edwards [EdMS], KhadZiivanov and Nikiforov [KhNi79] 


#(o(+[]"))>5 


and this inequality is essentially best possible in view of the following graph. Let n = 6k. 
Partition [n] into 6 sets Au, Aj, Aj3, Aoi, Ago, Ao3 with |Ai1| |Aj2| |Aj3| k-1 
and |Ao1| = |A22| = |Ao3| =k +1. For 1 < j <k <3 join every vertex of A;; to every 
vertex of Aj,, and for 7 = 1,2,3 join every vertex of Aj; to every vertex of Ag;. The 
size of the resulting graph is greater than |n?/4| + 1, and its booksize is n/6 + 1. 


F18: Erdés, Faudree, and Rousseau [EFR94] If m > (r — 1)n?/2r and if n is 
sufficiently large, then 


3r— 4 
(r) | 
bk") (G(n,m)) > arr 1” 


For g >r > 2, s > 0 define the graph G with V(G) = [r] x [q] x [s] and join two vertices 
(21, Y1, 21) and (#2, yo, 22) if and only if x; # x and y; ¥ yo. Setting 


1 1 2 
n=rqs and m= (1 )(: )4 
q Py 2 


we have G = G(n,m), and for 1 <k <r, 


bk (G) = (1-*) (1 =) 


F19: Erdés, Faudree and Gy6ri [EFG95] There exists a number c > 1/6 such that 
if G = G(n) and 6(G) > n/2, then bk(G) > cn. 


Vertex-Disjoint Cliques 


The next two results are milestones in extremal graph theory. In particular, the proof 
of Hajnal-Szemerédi theorem, although somewhat simplified by Bollobads in [Bol78], is 
still very difficult. 


DEFINITION 
NOTATION: The union of s vertex-disjoint copies of a graph G is denoted by sG. 


D10: Let H and G be graphs such that v(G) = ku(H). If kH C G then G is said to 
have an H-factor. 


FACTS 


F20: Corrddi—Hajnal theorem [CoHa63] Let n,k be natural numbers with n < 3k 
and s = |n/k], and let t = k — (n— ks). If G = G(n) and 6(G) > 2k, then G contains 
k; vertex-disjoint cycles of length at most s+ 1, and ¢ of them are of length at most s. 
In particular, if 6(G) > 2n/3, then G contains |n/3] K3. 


F21: Hajnal—Szemerédi theorem [HaSz70] If G = G(n) and A(G) < r, then V(G) 
can be partitioned into r+ 1 independent sets such that the sizes of any two sets differ 
by at most 1. 
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8.1.2 The Number of Complete Graphs 


An exciting and difficult problem is to determine min k,(G) for a given value of 
k,.(G). In spite of the few illuminating results to be presented below, the general problem 
remains largely unsolved. 


FACTS 
F22: Rademacher [Erd62b] 


n(e(ol8))) > 


and this inequality is best possible. 


F23: Lovdsz and Simonovits [LoSi83] If 1 < |n/2], then 


n(o(o[e]+)) 28 


and this inequality is best possible. 


F24: Let 0 <1 < n/2r and suppose the graph G is obtained by adding / disjoint edges 
to one of the larger classes of the Turan graph T;.(n). Then k,41(G) is given by 


n+ 
Tr 


r—2 
kran(G@) = flrs) =] | 
i=0 
F25: Erdés [Erd69] For every r there exist c = c(r) > 0 and no = no(r), such that if 
n > no and 0 <1 < cn, then 
kr 1(G(n, tr(n) +1) > fr(n4 1) 
and this inequality is best possible. 


F26: Fisher [Fis89] If G = G(n,m) and n?/4 < m < n?/3 then 


— I9n3 — 2_ 3/2 
kg(G) > 9nm — 2n _ 3m) 


and this is best possible up to a term of order O(n?). 


F27: Nordhaus and Stewart [NoSt63], Moon and Moser [MoMo62] If G = G(n) 
and k,(G) > 0 then 
ksri(G@) So hs(G) 

k(G) ks_1(G) 


F28: Bollobas [Bol76] Suppose that 2<s<r<n. Let the function ¢(a) be defined 


in the interval [0, (”)] such that for every g=s,...,n, 


(i) @(ks(Tq(m))) = kr (Tq(n)); 
(ii) @ is linear in the interval [k,(Ty-1(n)), ks(Tq(n))]. 


Then 


(s? — 1) 


>s 


Kir (G) 2 o(ks(G)) 


Section 8.1. Extremal Graph Theory 959 


8.1.3 Erdds—Stone Theorem and Its Extensions 


The fundamental theorem of Erddés and Stone has attracted the attention of re- 
searchers for more than 50 years; no doubt this will continue in the future. The theorem 
can be viewed as a considerable extension of Turan’s theorem: slightly more than t,.(n) 
edges in a graph of order n guarantees not only a K,4, but a K,+1(q) for q fairly large. 
Equivalently, the Erdés—Stone theorem solves asymptotically the Turan problem for a 
fixed family of forbidden graphs. 


NOTATION 


N8: For ¢ > 0 and natural 2 <r <n, let g(n,r,¢) denote the maximal number gq such 
that every G(n, |(1 —1/r +¢)n?]) contains a K,41(q) for n sufficiently large. 


FACTS 


F29: Erdés—Stone theorem [ErSt46] For « > 0 and 2 <r <n, the function g(n, 7, ) 
tends to infinity when n tends to infinity. Since T).(n) is r-chromatic, this implies the 
result of Erdés and Simonovits [ErSi66] that if F = {F\,...,F),} is a fixed family of 
graphs and r + 1 = min; (Fj) > 2, then 


-1 
ex(n, F) = = n? + 0(n?) 
Tr 


F30: Bollobdés and Erdés [BoEr73] There exist constants c,,c2 > 0 such that 
cilogn < g(n,r,€) < cologn 


F31: Bollobds, Erdés, and Simonovits [BES76] There exists a > 0 such that if 
0<e<1/r then 
alogn 


Gln; 0,2) 2 1 
r log = 


There exists €, > 0 such that if 0 << ¢ <¢, then 


] 
g(n,7,2) < 384 
log = 
F32: Chvatal and Szemerédi [ChSz83] 
as logn 
n, Tr, _— Eee aaa ie 
3 500 log 2 


F33: Bollobds and Kohayakawa [BokKo94] There exists an absolute constant a > 0 
such that if 
r>2, O0<y<1l, O0<e<I1/r 


then every graph G of sufficiently large order n with e(G) > (1 —1/r +¢)n? contains a 
K,41(s,m,...,m,l) such that 


logn logn 


L> aelt7/2n7 


m > a(1—7) > 


s > a(1—7) 


rlog 2’ logr’ 
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F34: Ishigami [Ish02] There exists an absolute constant 6 > 0 such that if 
r>2, O0<y<1, and 0<e<I1/r 


then every graph G of sufficiently large order n with e(G) > (1 —1/r +e)n? contains a 
K,41(s,m,...,m,l) such that 


logn 


and I> n7 


> BI 
Tr? m > B( ieee 


The Structure of Extremal Graphs 


The structure of extremal graphs is fairly well understood in the case of a fixed family of 
forbidden graphs. Moreover, the stability theorems of Erdés and Simonovits give useful 
information about the structure of a graph without forbidden subgraphs, provided the 
size is close to the maximum. 


FACTS 


F35: Stability theorem. Simonovits [Sim68] For every r there is some c = c(n) such 
that if | < cn then every K,+1-free graph G(n,t,(n) — 1) is r-chromatic. 


F36: Stability theorem. Erdés [Erd68], Simonovits [Sim68] Let H be a graph with 
x(F) = r+1 > 3. For every ¢ > 0 there exists 6 > 0 such that if G is a H-free 
graph of order n and e(G) > ((r — 1)/2r — 6)n? then there is a K,(n1,...,n,) with 
ny +...+n, =n that can be obtained from G by changing fewer than en? edges of G. 


F37: Erdés [Erd68], Simonovits [Sim68] Let F = {Fi,...,F,} be a fixed graph 
family with r+ 1 = min; y(F;) > 3 and suppose that F has an (r+ 1)-coloring in which 
one of the color classes contains t vertices. Then 


ex(n,F) = (‘—) n? + O(n2-!/t) 


If G € ®(n, F) is such that e(G) = ex(n, F’), then 6(G) = (1 — 1/r)n + o(n) and all of 
the following hold: 
(i) the vertices of G can be partitioned into r classes each of size n/r + o(n); 
(ii) each vertex is joined to at most as many vertices of its own class as to any other 
class. For every €¢ > 0 the number of vertices joined to at least en vertices of their 
own class is o(n); 
(iii) there are O(n?—!/*) edges joining vertices of the same class. 


F38: Simonovits [Sim68] Let n and s be fixed integers. If n is sufficiently large, then 
every graph G = G(n) with 


AG)S2 Gas Yale Wns v+(5’) 


contains sK,+41, unless G = K,-; + T,(n—s-—1). 
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8.1.4 Zarankiewicz Problem and Related Questions 


The problem of Zarankiewicz is the counterpart of Turdn’s theorem for bipartite 
graphs; this problem has turned out to be extremely difficult — it seems that even 
today we are very far from a satisfactory solution. 


NOTATION 
N9: Let z(m,n, s,t) denote the largest size of an n-by-m bipartite graph not containing 
the complete biparatite graph Ko(s,t), and set z(n,t) = z(n,n, t,t). 
FACTS 
F39: 2ex(n,{Ko(s,t)}) < z(m,n,s,t) < ex(2n, {Ko(s,t)}). 
F40: K6vary, Sé6s, and Turan [KST54] If2<s<mand2<t<n, then 

z(m,n, s,t) < (s—1)/#(n-—t+1)m!-“* + (t-—1)m 
and 

z(n,t) < (t—1)V/*n?-/* + O(n) 


F41: Reiman [Rei58] 
(i) 2(n,2) < (n/2)(. + Vin=3); 
(ii) for every n = q? +q+1, where gq is a power of a prime, 


a(n,2) < $(1+ Vin—3) =(g-Y@? +441) 
(iii) limy +o 2(n, 2)n—3/? = 1. 


F42: Erdés, Rényi, and Sés [ERS66], Brown [Bro66] Let q be a power of a prime. 
Then for the cycle Cy = K (2,2), we have 


1 1 +1 
549+ 1)? < ex(g? +4 +1,{Cr}) < Sala +1)? + (3) 
a (n, {Ca}) 
< ex(n, af) 1 
jim, — 3/2 3 


The Erdés—Rényi graph giving the lower bound in (3) has for vertices the g?+q+1 
points of the projective plane PG(2, q) over the field of order g, and two points (21, y1, 21) 
and (2, y2, 22) are joined if and only if x172 + yiy2 + 2122 = 0. 


F43: Fiiredi [Fur83] For every natural number gq, 


1 
ex(q? +q+1,{Ca}) < sala +1)” 
and if g is a power of a prime, then 


1 
ex(q? +q+1,{Ca}) = sala +1) 


962 Chapter 8. Analytic Graph Theory 


K,-Free Graphs with Large Minimal Degree 


In 1973 Erd6s and his collaborators initiated the study of K,-free graphs with large 
minimal degree. It turned out that under certain conditions, the chromatic number 
of such graphs is bounded, and as later investigations showed, their structure is well 
determined. Despite intensive efforts the general questions remain open, the most chal- 
lenging of which is the conjecture that every K3-free G = G(n) with 6(G) > n/3 is at 
most 4-chromatic. 


DEFINITIONS 


D11: We say that a graph G is homomorphic to a graph H if there exists a mapping 
f :V(G) > V(#) such that (u,v) € E(G) implies that (f(u), f(v)) € E(A). 


D12: The kth power of a cycle C,, is a graph G with V(G) = [n] and (i,7) € E(G) 
if and only if |¢ — j] = 1,2,...,4 mod n. 


D13: The square of a cycle is its second power. 


EXAMPLES 
E3: Mycielski graphs [Myc55] Define the sequence of graphs M1, M2,... as follows: 
set M, = Ko; suppose that M,_, is already defined, and let V(M,_-1) = [n]. Set 
V(M,) = [2n + 1], and let E(M,) be the union 

E(Ms-1) U {(i,7 +): (G9) € B(Ms-1)} U {Qn + Li) in <i 2n} 


For every i, the graph M; is K3-free and y(M;) =i. In particular, the graph Mg is Cs; 
the graph M3 is also known as the Groétzsch graph. 


E4: Andrdsfai graphs [And62] Set A; = Ko and for every i > 2 let A; be the 
complement of the (i — 1)th power of C3;_1. For every i, the graph A; is K3-free and 
y(M;) = 3. In particular, the graph Ag is Cs; the graph A3 is also known as the Mébius 
ladder. 
FACTS 


F44: Andrdasfai, Erdés and Sés [AES74] If a graph G = G(n) is K,4,-free with 


minimal degree 
3 
> = 
d(G) > (1 ae :) n 


then G is r-chromatic. 


F45: Erdés-Hajnal-Simonovits graphs [ErSi73] If ¢ > 0, h > 2 and n is sufficiently 
large, then there exists a K3-free graph G(n) with 6(G) > (1/3 — e)n and y(G) > h. 


F46: Haggkvist [Hag82] Every K3-free graph G = G(n) with 6(G) > 3n/8 is homo- 
morphic to Az = Cs and so satisfies the inequality x(G) < 3. 


F47: Haggkvist [Hag82] For every natural number k, there exists a 4-chromatic, 
K3-free, 10k-regular graph of order 29k. 
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F48: Jin [Jin95] Every K3-free G = G(n) with 6(G) > 10n/29 is homomorphic to the 
graph Ag and so satisfies the inequality .(G) < 3. 


F49: Chen, Jin, and Koh [CJK97] Every K3-free 3-chromatic G = G(n) with 
6(G) > n/3 is homomorphic to some graph A;. If y(G) > 4, then M3 C G. 


F50: Brandt [Bra00] If G(n) is a d-regular maximal K3-free graph with d > n/3, then 
x(G) < 4. 


F51: Haggkvist and Jin [HaJi98] Let G = G(n) be K3-free and C5-free. If 6(G) > 
n/4, then G is homomorphic to C7. The bound n/4 is best possible, as there is a 
K3-free, C5-free, 3k-regular G(12k) that is not homomorphic to C7. 


8.1.5 Paths and Trees 


One of the most famous unsolved problems in extremal graph theory is the Erdés— 
Sods conjecture: 


Every graph G(n, |(k — 1)n/2| + 1) contains all trees of order k. 


The conjecture is true for many types of trees and under special conditions for the 
graph, but the general case remains open. 


NOTATION 


N10: P, denotes the path of order k. 


FACTS 


F52: Erdés—Gallai theorem [ErGa59] Every G = G(n) with e(G) > (n — 1)k/2 
contains a Py4i. If n = q(k — 1) +1, then there exists a graph G = G(n, (n — 1)k/2) 
containing no Py+1. 


F53: Faudree and Schelp [FaSc75] If n = kt+rand0<r<t, then 


ex(n, Pra) = (3) Ea @ 


and the extremal graphs are known. 


F54: Brandt and Dobson [BrDo96] Every graph G(n, |(k — 1)n/2] +1) of girth at 
least 5 contains all trees of order k. 


F55: Saclé and WodZniak [SaWo97] Every C4-free graph G(n, |(k — 1)n/2]| 4+ 1) 
contains all trees of order k. 


F56: Wang, Li and Liu [WLLO0] Every graph G(n, |(& — 1)n/2| + 1) whose edge- 
complement is of girth at least 5 contains all trees of order k. 


F57: Dobson [Dob02] Every graph G(n, |(k — 1)n/2| +1) whose edge-complement is 
K (2, 4)-free contains all trees of order k. 
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8.1.6 Circumference 


DEFINITIONS 


D14: The circumference of a graph G is the length of its largest cycle. It is denoted 
by c(G). 


D15: The girth of a graph G is the length of its smallest cycle. 


NOTATIONS 
N11: The set of cycle lengths of a graph G is denoted by C(G). 
N12: ec(G) is the largest even number in C(G) and oc(G) is the largest odd number 
in C(G). 
FACTS 
F58: Erdés and Gallai [ErGa59] If 2< k <n then 
(Gln, (k(n -1)/2) +1) =k 


F59: Bollobdas and Haggkvist [BoHa90] If G = G(n) and 6(G) > n/k, then c(G) > 
[n/(k —1)] and this inequality is best possible. 


F60: Egawa and Miyamoto [EgMi89] If G = G(n) and if d(u)+d(v) > [2n/k] when- 
ever u and v are two nonajacent vertices, then c(G) > [n/(k — 1)] and this inequality 
is best possible. 


F61: Dirac [Dir52] If G = G(n) is 2-connected with 6(G) = 6 < n/2, then c(G) > 20. 


F62: Voss and Zuluaga [VoZu77] If G = G(n) is a 2-connected, nonbipartite graph 
with 6(G) = 6 < n/2, then 


oc(G) > 26-1, ec(G) > 26 


8.1.7 Hamiltonian Cycles 


The theory of Hamiltonian graphs is one of the most popular areas of graph theory. 
Here we present several well-known results with an “extremal” flavor. 


DEFINITIONS 

D16: A graph G = G(n) is said to be Hamiltonian if n € C(G). 

D17: The closure of a graph G(n) is obtained by successively joining every two 
nonadjacent vertices u and v with d(u) + d(v) > n. 

FACTS 

F63: Dirac’s theorem [Dir52] If G = G(n), n > 3, and 6(G) > n/2, then G is 


Hamiltonian. 


Section 8.1. Extremal Graph Theory 965 


F64: Shi [Shi92], Bollobas and Brightwell [BoBr93] Let G = G(n), and let S be 
the set of vertices of degree at least n/2. If |S| > 3, then there is a cycle in G that 
includes every vertex of S. 


F65: Ore’s theorem [Ore60] If G = G(n) with n > 3, and if d(u) +d(v) > n whenever 
u and v are two nonajacent vertices, then G is Hamiltonian. 


F66: Pédsa’s Theorem [Pos62] Let G = G(n) with n > 3. If for every k with 
1<k < (n—1)/2, the number of vertices of G of degree not exceeding k is less than 
k, and for odd n the number of vertices of degree (n — 1)/2 does not exceed (n — 1)/2, 
then G is Hamiltonian. 


F67: Closure Lemma of Bondy and Chvatal [BoCv76] A graph is Hamiltonian if 
and only if its closure is Hamiltonian. 


F68: Chvatal’s theorem [Chv72] Let G = G(n) with n > 3 and with vertex degrees 
d(1) <...<d(n). If for every k < (n — 1)/2 either d(k) > k or d(n—k) > n—k, then 
G is Hamiltonian. 


F69: Chvatal—Erdés theorem [ChEr72] If a(G) < «(G), then G is Hamiltonian. 


F70: Fan and Haggkvist [FaHa94] If G = G(n) and 6(G) > 5n/7, then G contains 
the square of C,,. 


8.1.8 Cycle Lengths 


Erdés proposed the sum )*> {1/r:r¢€C(G)} as a measure of the wealth of cycle 
lengths in a graph G. He stated a conjecture that led to the following two results. 


FACTS 


F71: Gyarfads, Komlés, and Szemerédi |GKS84] There exists a number c > 0 such 
that for every graph G = G(n,m) we have 


S" {1/r sr € C(G)} > clog(2m/n) 


F72: Gyarfds, Prémel, Szemerédi, and Voigt [GPSV85] If & is sufficiently large 
and 2m > (1+ 1/k)n, then 


S¢ {1/r sr € C(G)} = (300k log k)~? 


Cycles of Consecutive Lengths 


In this section we present several sufficient conditions in terms of the size and minimal 
degree for the existence of large intervals in the set C(G) of cycle lengths. 


FACTS 


F73: Bondy-Simonovits theorem [BoSi74] Every graph G(n, |100kn1/*| +1) con- 
tains the cycle Cy, for k < 21 < kn1/*. 
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F74: Verstraéte [Ver00] Every graph G(n, |8(k — 1)n'/*| +1) contains the cycle Cy, 
fork 21 kn/*, 


F75: Fan [Fan02] If G is a graph with 6(G) > 3k, then G contains k + 1 cycles 
Co, C1,...,Cz such that 


k+1< |Co| <...<|Czl, IC;| — |Ci-1| = 2, (1 <i < k) 
and ICx| — |Ce-1| <2. 


F76: Gould, Haxell, and Scott [GHS02] For every c > 0 there exists a constant 
k, = k(c) such that if G = G(n) and 6(G)cn, then G contains a cycle of order t for every 
even t € [4, ec(G) — k] and for every odd t € [k, oc(G) — ki. 


Pancyclicity and Weak Pancyclicity 


In 1971 Bondy introduced the concept of pancyclicity that soon became a topic of 
intensive study. We present below only few of the known results. 


DEFINITIONS 
D18: A graph G = G(n) is called weakly pancyclic if C(G) is an interval. 


D19: A graph G = G(n) is called a pancyclic graph if C(G) = [3, n]. 


FACTS 


F77: Bondy [Bon71] If G = G(n, |n?/4|) is Hamiltonian, then G is pancyclic unless 
G = K((n/2] , [n/2]). 
F78: Bondy [Bon71] If G = G(n, |n?/4| + 1), then c(G) > [(n+3)/2] and G is 


weakly pancyclic. 


F79: Amar, Flandrin, Fournier, and Germa [AFFG83] If n > 102, G = G(n) is 
Hamiltonian and 6(G) > 2n/5, then G is pancyclic. 


F80: Shi [Shi86] If n > 50, G = G(n) is Hamiltonian and for every two nonadjacent 
vertices u and v, d(u) + d(v) > 4n/5, then G is pancyclic. 


F81: Brandt, Faudree and Goddard [BFG98] If 6(G) > n/4+ 250, then the graph 
G is weakly pancyclic unless the order of the shortest odd cycle of G is 7, in which case 
C(G) = {4,6,7,...,c(G)}. 


F82: Brandt, Faudree, and Goddard [BFG98] If G is a 2-connected nonbipartite 
graph of sufficiently large order n with 6(G) > 2n/7, then G is weakly pancyclic. 


F83: Brandt [Bra97] Every G(n, |(n — 1)?/4] +2) is weakly pancyclic or bipartite. 


F84: Bollobds and Thomason [BoTh99] Every graph G(n, |n?/4| —n+59) is weakly 
pancyclic or bipartite. 
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8.1.9 Szemerédi’s Uniformity Lemma 


The Uniformity Lemma of Szemerédi, whose power and versatility could hardly be 
overemphasized, is one of the most remarkable tools in discrete mathematics. Loosely 
stated, it guarantees that every dense graph has some finite rough structure, which, 
surprisingly often, is the basis of successful attacks on difficult combinatorial problems. 
The Blow-up Lemma of Komlds, Sarkozy, and Szemerédi, a close relative of the Unifor- 
mity Lemma, has been used to solve a number of difficult graph embedding conjectures. 
For comprehensive surveys of this area see [KoSi96], [Kom99], and [Kom00]. 


DEFINITIONS 
D20: Let a>0. A tower of a of length k is the function 
where the exponentiation is done k times. 


D21: A bipartite graph with classes A, B is called an c-uniform pair if for every 
pair of vertex subsets X C A and Y C B with |X| > e|A| and |Y| > ¢|B| we have 
E(X,Y) E(A,B) 
IX||]¥] [ATI BI 
D22: A bipartite graph with classes A, B is called an (e€,6)-super-uniform pair if 
it is e-uniform and 


<eé 


d(u) > 6|B],...,d(v) 2 6|Al 


whenever every u € A, uv € B. 


FACTS 


F85: Szemerédi’s Uniformity Lemma [Sze76] For every ¢ > 0 there exist numbers 
No = no(e) and kg = ko(e) such that for every graph G of order n > ng there is a 
partition V(G) =Vo UV, U... UV satisfying these criteria: 

(i) k < ko(e); 

(ii) [Vo] < en, [Mil =... =|Vel5 

(iii) all but ek? pairs (V;,V;) are e-uniform. 
F86: The function ko(e) in Szemerédi’s Uniformity Lemma is bounded from above by 
a tower of 2s of length e~°. 


F87: Gowers’ bound [Gow98] There exist constants ¢g > 0 and c > 0 such that for 
0 <e< & , there is a graph G whose vertices cannot be partitioned according to criteria 


(i)—(iii) of Szemerédi’s Uniformity Lemma unless k is as large as a tower of 2s of length 
cea! /16, 


F88: Blow-Up Lemma of Komlés, Sarkézy, and Szemerédi [KSS97] Fix a graph 
R with V(R) = [r]. For every 6, A > 0 there exists ¢ > 0 such that the following holds. 
Fix a natural n and let Vi,...,V, be r disjoint sets of size n. Define the graphs R(n) 
and G as follows: 
(i) Set V(R(n)) = UV; and for every (i, 7) € E(R), place all edges between V; and Vj. 
(ii) Set V(G)) = UV; and for every (i,7) € E(R), place an (¢, 6)-super-uniform pair 
between Viand V;. 


If H C R(n) and A(H) < Athen HCG. 
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Applications of the Uniformity and Blow-up lemmas 


The Uniformity Lemma and the Blow-up Lemma are powerful tools in graph theory. 
We present below only four of their applications but, in fact, many results described in 
other sections are also obtained applying these two lemmas. 


FACTS 


F89: Komldés, Sdérkézy, and Szemerédi [KSS98] For every « > 0 and natural 
number k there exists a number no = no(k), such that if 


1 
n> No, G=G(n), and 6(G) > 1 yay) 
then the graph G contains the kth power of a Hamiltonian cycle. 


F90: Alon and Yuster [AlYu96] For every ¢ > 0 and natural number h there exists 
a number no = no(e, h), such that if H is a graph of order h with x(H) =k and 


n>no, G=G(hn), and 46(G)> (1 - - +] hn 


then the graph G contains an H-factor. 


F91: Komlés, Sdrk6zy, and Szemerédi [KSS01] Let H be a graph of order h and 
x(H) = k. There exist numbers c = c(H) and no = no(H) such that if 
1 
n>, G=G(hn), and 4d(G)> (1 _ :) hn+c 


then G contains an H-factor. 


F92: Komlés, Sdérkézy, and Szemerédi [KSSOla] For every « > 0, there exist 
numbers c and no such that for n > no, every graph G = G(n) with 6(G) > (1/2+¢)n 
contains every tree T of order n such that A(T) < cn/logn. 


8.1.10 Asymptotic Enumeration 


An intriguing question is how many graphs with given properties are there. For 
certain natural properties like “G is K,-free” or “G has no induced subgraph isomorphic 
to H” satisfactory answers have been obtained. 


DEFINITIONS 
D23: A graph property is a graph family closed under isomorphism. 


D24: A graph property P is called a monotone property if G € P implies H € P 
for every subgraph of G. 


D25: A graph property P is called an hereditary property if G € P implies H € P 
for every induced subgraph of G. 


D26: For any graph property P, set P? = {G: Ge P,v(G) =n}. The logarithmic 
density of P” is the value 


en(P) = (logs |P")/ (5) 
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D27: Let 0 < s <r be integers. A graph H is called (r,s)-colorable if its vertices 
can be colored in r colors, so that the vertices colored with the ith color are a clique for 
1<2z<_s, and an independent set otherwise. 


D28: The coloring number r(P) of a hereditary property P is the largest integer 
such that for some s, the family P contains every (r, s)-colorable graph. 


NOTATION 


N13: Given a hereditary property P, let exjna(n, P) denote the maximal number of 
edges in a graph Go = G(n) for which there is a graph G, = G(n) with V(G1) = V(Go) 
and E(G,)N E(Go) = 9, so that every graph G with G; C G C Go UG, belongs to P”. 


FACTS 


F93: Erdés, Kleitman, and Rothschild [EKR73] The number k,, of the K;..,-free 
graphs of order n is given asymptotically by 


1 
logs kn = (1 —=+ o(t)) n? 
r 


F94: Erdés, Frankl, and Rédl [EFR86] Let H be a graph with y(H) =r+1. The 
number h,, of the H-free graphs of order n is given asymptotically by 


1 
logs hn = (1 —-+ o(1)) n° 
r 


F95: Kolaitis, Proémel, and Rothschild [KPR87] For every n let k,, be the number 
of K,41-free graphs G(n) and h,, the number of r-chromatic graphs G(n). Then 


F96: A property P is monotone if and only if there exists some sequence of graphs 
F,, F2,... such that P is the collection of graphs having no subgraph isomorphic to an 
F;. 


F97: A property P is hereditary if and only if there exists some sequence of graphs 
F\, Fo,... such that P is the collection of graphs having no induced subgraph isomorphic 
to an F;. The coloring number of r(P) is exactly the maximal r such that for some 
0<s<rno F; is (r,s)-colorable. 


F98: Prémel and Steger [PrSt91] There exist numbers cy) and c; such that the 
number t,, of G(n) with no induced C4 is given by 
th = (ep + o(1))a”’ /4+n— (log n)/2 


where r = 0,1, r =n mod 2. 


F99: Prémel and Steger [PrSt92], [PrSt93] Fix a graph H, and let P be the hered- 
itary property “G has no induced subgraph isomorphic to H”. Then 


-1 

n 1 
im exina(n, P ae NT Fo 
dim exinaln.P)(5) (P) 
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F100: Alekseev [Ale92], Bollobaés and Thomason [BoTh95] Let P be a hereditary 
property. Then 
and the limit 

c(P) = lim c,(P) 


n—->co 


exists. 


F101: Scheinerman and Zito [ScZi94] For every hereditary property P, one of the 
following is true: 
(i) for n sufficiently large |P”| is identically 0,1 or 2; 
(ii) |P"| = @(1)n* for some integer k > 1; 
(iii) for some c2 > cy > 0, CT < |P”| < ch; 
(iv) for some c > 0, |P”| > n°. 


F102: Bollobas and Thomason [BoTh97] For every hereditary property P, 


-1 


o(P) = lim exina(n, P) e z i 


F103: Balogh, Bollobds, and Weinreich [BBW00], [BBW01], [BBW02] For every 
hereditary property P one of the following is true: 
(i) there exists a collection of polynomials {p;(n)}*_, such that for n sufficiently large 
|P"| = dojo Pi(r)i”; 
(ii) for some integer k > 1, |P”| = n‘ 
(iii) nGte())n ea |P”| < non’). 


(iv) for some integer k > 1, |P”| = n‘ 


1-1/k+o0(1))n. 


1-1/k+0(1))n?/2_ 


8.1.11 Graph Minors 


The study of graph minors was initially motivated by the conjecture of Hadwiger 
that every r-chromatic graph has K,. as a minor. However, from the extremal point of 
view, minors happen to be of their own fascinating interest. 


DEFINITIONS 
D29: Let G and H be graphs. We say that H is a minor of G, and we write 
G>H 


if there are disjoint sets W(u), u € V(A), such that W(u) induces a connected graph 
in G, and for every (u,v) € E(G), there is an edge between W(u) and W(v). 


D30: Let u(H) be the minimal number ju such that e(G) > v(G) implies that G > H. 


FACTS 


F104: Mader [Mad67], [Mad68] 
U(K,) < 8r logy r 
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F105: Bollobads, Catlin, and Erdés [BCE80], de la Vega [Fer83] For some C > 0, 


u(K,) > Crv/logr 


F106: Kostochka [Kos82], [Kos84], Thomason [Tho84] 
w(K) = O(rVogr) 


F107: Thomason [Tho01] There is an explicit constant a = 0.319... such that 


w(K) = (a+ o(1))rvV/logr 


F108: Myers and Thomaen [MyTh02] Given a graph H of order n, set 
= min — i —w(u)w(v) _ 
(A) = min — S- w(u) with S- n =n 
u€V(H) (u,v)€E(HA) 
where w(u) are nonnegative real numbers assigned to the vertices of H. Then 
w(H) = (ay(H) + 0(1))rvV/logr 


8.1.12 Ramsey—Turan Problems 


Ramsey—Turan problems are in fact Turdn-type problems with with restriction on 
the independence number. For a comprehensive survey of this topic see [SiSo01]; we 
present below only some of the highlights of the area. 


NOTATIONS 


N14: Let Fi,...,F; be fixed graphs. Let RT,(n,Fi,..., Fs, f(m)) denote the max- 
imal size of a graph G(n) with a(G) < f(n) whose edges can be colored in s colors 
so that there is no F; in the i** color. We write RT(n,Fi,...,Fs,f(n)) instead of 
RT,(n, Fi,..., Fs, f(m)) when s is understood. 


N15: Let R(s) be the maximal number R such that one can color the edges of the 
complete graph Kp in s colors, so that there is no monochromatic triangle and so that 
each star is colored in at most (r — 1) colors. 


FACTS 


F109: Erdés graph [Erd61] For every k there exists ¢ > 0 such that if n is sufficiently 
large there exists a graph F,,, = G(n) with girth g(G) > k and independence number 
a(G) < ni~€, 


F110: Erdés and Sés [ErSo70] 
r—1 
2r 


The lower bound comes from the following graph: take the Turan graph T;.(n) and add 
to each of its classes a copy of F,,3, where s is the size of the class. 


RT (n, Kor41, 0(n)) = n? + o(n”) 
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F111: Bollobds and Erdés [BoEr76a] For every ¢ > 0 and n sufficiently large, there 
exists a K4-free graph BE, = G(n) with a(BE,,) < en and |d(u) — n/4| < en for every 
ué€V(BE,,). Thus, 


1 
RT(n, K4, o(n)) > run + o(n?) 
F112: Szemerédi [Sze72] 


1 
RT(n, K4, 0(n)) < gu + o(n?) 


F113: Erdés, Hajnal, Sés, and Szemerédi [EHSS83] 


_ 3r—5 4» 


RT (n, 2r, o(n)) maa + 0(n?) 


To prove the lower bound consider the following graph: for 1 = [4n/(3r—2)] take 
BE, + T,~-1(n — 1), and add to each of the parts of T,_1(n — 1) a copy of F'.3, where s 
is the size of the part. 


F114: Erdés and Sés [ErSo70] 
1 
RT (n, K3, K3,0(n)) = rue + o(n?) 


F115: Erdés, Hajnal, Sés, and Szemerédi [EHSS83] 


=] 
n? + 0(n”) 


RT;(n, K3,..., K3,0(n)) = Rey 


F116: Erdos, Hajnal, Simonovits, Sés, and Szemerédi [EHSSS93} 


RT,(n, K3, K4,0(n)) = ; (1 - 5) n? + o(n?) 
RT,(n, K3, K5,0(n)) = ; (1 - =) n? + 0(n?) 


If p and q are odd integers then 


1 
RT; (n, Cp, Ca, o(7)) = ra ae o(n”) 
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INTRODUCTION 


The field of random graphs came into its own with papers of Erdds and Rényi in 
1959-61. Earlier it had surfaced mainly in probabilistic proofs, where facts about ran- 
dom graphs were used to prove the existence of graphs with desired properties. Then, 
progressively, many interesting features of the random graphs themselves were discov- 
ered. For instance, a large random graph can be relied upon (in a sense to be made 
precise) to have diameter 2 and to contain any arbitrary fixed subgraph. Consequently, 
it will contain large complete subgraphs, and thus will have large genus and will not be 
k-colorable for small k. A number of models of random graphs are commonly studied, 
including variations arising from areas such as communication networks. Several mono- 


graphs have been devoted to the subject ({Bo01], [Pa85], [Ko99], [JaLuRu00], [Sp01]). 


8.2.1 Random Graph Models 


NOTATION: For 88.2, let n be a positive integer and let p be a real number, 0 < p < 1, 
and q = 1—p. An n-vertex simple graph G has vertex set V = [n] = {1,...,n}. The 
number of edges in the complete graph Ky, is N = ae 

NOTATION: Probability and expectation are denoted by P and E, respectively, and 


variance is denoted by Var. With no base shown, log denotes the natural logarithm, 
with base e. 
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DEFINITIONS 


Here we consider the two most common random graph models. 


D1: For0 < p< 1, the binomial (or Bernoulli) random graph, denoted by G(n, p), 
is a probability space whose underlying set is the set of n-vertex graphs. The probability 
function is determined by specifying that the edges of kK, occur independently with 
probability p each. Equivalently, the probability of any given graph with m edges is 
defined to be 


m_/N-—m 


pe4@ 


D2: For0<m<_N, the uniform (or Erdés—Rényi) random graph, denoted by 
G(n,m), is the uniform probability space on those graphs with exactly n vertices and 
m edges. Thus, the probability of any n-vertex m-edge graph is 


(n) 


TERMINOLOGY NOTE: To denote that G is a random graph with the probability distri- 
bution of G(n,p), we write G € G(n, p); alternatively we write G(n,p) for such G. 


D3: An event is a subset of the graphs in whichever model is under discussion. Given 
a graph probability space, every graph property Q defines an event in a natural way, 
being the set of graphs with property @. This event is also denoted by Q. 


D4: A graph property Q is increasing if a graph G has property Q whenever one of 
its spanning subgraphs has Q. 


D5: A graph property Q is convex if G has property Q whenever G; C G C G2 for 
some G1, G2 both having property Q and with the same vertex sets as G. 


D6: Decreasing properties are the complements of increasing properties. 


D7: Monotone properties are either increasing or decreasing. 


EXAMPLES 
El: Let Q be the graph property is complete. Then for G(n, p), we have 
P(Q) =p” 


E2: Let Q be the property vertex 1 is isolated. For the Bernoulli graph G(n,p), we 
have 


P(Q)=q""" 
For the Erdés—Rényi graph G(n,m), we have 


ee) 


P(Q) = a 
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E3: The properties in the two previous examples are monotone and hence convex, 
as are the properties G has a subgraph in F for any family of graphs F, and G has 
minimum degree k for any k. 


E4: The property G has diameter exactly 2 is neither increasing nor decreasing, but 
is convex, whereas G has a vertex of degree exactly 2 is not convex. 


REMARK 


R1: The independence of the edges in G(n,p) tends to simplify calculations of proba- 
bility. 


Asymptotics 


Most of the interest in random graphs lies in the asymptotic behavior as n > oo, with 
p= p(n) and m = m(n) functions of n. 


DEFINITIONS 


D8: An event A, holds asymptotically almost surely (a.a.s.) if P(A,) > 1 as 
n — co. This applies to events A,, defined on any sequence of probability spaces 
indexed by n, such as the Bernoulli random graph G(n,p) with p = p(n) a function of 
n, or the Erdés—Rényi random graph G(n,m) with m = m(n). 


NOTATION: Suppose that |f| < ¢g, for some functions f(n), g(n), and ¢(n). 
e If d(n) is bounded, then we write f = O(g). 
e If é—>0asn— o, then we write f = o(g) or alternatively, f <g org > f. 
e If f = O(g) and g = O(f), then we write f = O(g). 
e If f(n) = (1+ o0(1))g(n), then we write f ~ g. 


CONVENTION: The appearance of o(g) in a formula denotes a function f for which 
f = 0(g), and the same convention applies to O(g) and O(g). 

NOTATION: If S is a statement about a sequence of random variables involving any of 
these notations, rather than an event, we write “a.a.s. S” to mean that all inequalities 
|f| < ¢g that are implicit in S hold a.a.s. 

TERMINOLOGY NOTE: Our definition of o(g) is nonstandard, equivalent to the usual 
definition, but also accommodating the a.a.s. versions. For instance, [JahuRu00] use 
f = Oc(g), f = Ec(g) and f = o,(g) for a.as. f = O(g), aas. f = O(g) and a.as. 
f = 0(g), respectively. 

TERMINOLOGY NOTE: Elsewhere, the notations a.e. (almost every), whp (with high 
probability), or a.s. (almost surely) are sometimes used instead of a.a.s. 


D9: Let X1, X2,... be random variables and \ > 0 constant. We say that X = Xy is 
asymptotically Poisson with mean » if 
—dy\k 
P(X =k)= —~* +0(1) 
for all fixed integers k > 0, as n — oo. This also applies if 4 = A(n) is a bounded 
function of n. 
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D10: For a random variable X with 0 < VarX < oo, the standardized variable is 
X = (X —EX)/VVarX 
D11: We say that X = X,, is asymptotically normal if for all fixed values of a, 


1 a 
P(X <a) =o(l)+—e fo“ Pae as N — 00 


FACTS 


F1: (See [Bo01] and [JaLuRu00].) Let Q be a graph property and 0 < p = p(n) < 1 
such that pgN — co. If m = m(n) is a positive integer function, define x = x(m,n) by 
m=pN+2V/pqN. 

(i) If Q is a.a.s. true in G(n,m) whenever x is bounded, then Q is a.a.s. true in G(n, p). 
(ii) The converse of (i) is true if Q is convex. 


F2: (Following from [Bo79], for example.) Let 0 < p < 1, let H be a fixed graph, and 
F an induced subgraph of H. Then a.a.s. for G € G(n,p), every isomorphism of F' with 
an induced subgraph of G extends to an isomorphism of H with an induced subgraph 
of G. It follows that for k fixed, 

(i) a.a.s. every vertex in G is in a complete subgraph of size k and in an independent 
set of size k, 

(ii) a.a.s. G has diameter 2 and is k-connected; moreover, deleting any k vertices from 
G leaves a graph of diameter 2. 


REMARKS 


R2: Fact Fl shows that the two models under consideration share many properties. 
For instance, the properties in Fact F2 hold also in G(n,m) with m ~ cn?, any constant 
0<c< 1/2. For this reason, we usually limit ourselves to stating properties of just one 
model, when the corresponding property holds in the other model by Fact F1. 


R3: The topics of random graphs, and asymptotic enumeration of graphs, are inti- 
mately intertwined; many results in either area owe their existence to techniques from 
the other, especially with the model G(n,m). For example, one may derive the asymp- 
totic number of graphs with n vertices, m edges and no triangles, by multiplying (*) by 
the probability that G € G(n,m) has no triangles. Such results can be equally appealing 
when stated in either form. For uniformity we state only the random graph form here. 


8.2.2. Threshold Functions 


DEFINITIONS 
D12: A threshold function for a property Q in G(n,p) is a function f(n) such that 
for G € G(n,p), with p = p(n), 


0 ifp=o(f) 
P(G has Q) > ‘° fe p00) 


or alternatively, such that this is true with 0 and 1 interchanged. 
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D13: A threshold function f for a property Q is sharp if for every fixed € > 0, 


0 ifp<(1-f, 
P(G bs Q) > {! ifp>(1+6)f 


(or the same with 0 and 1 interchanged). 


FACTS 
F3: [BoTh87] Every monotone property has a threshold function in G(n, p). 


F4: [FrkKa96] Every monotone property with a threshold function f in G(n,p) such 
that log(1/f) = o(logn) has a sharp threshold function. 


F5: [AcFr99] For fixed k > 3, the property of being k-colorable has a sharp threshold 
function in G(n, p). 


F6: [ShSp88] Let A be a property expressible in the first-order theory of graphs, that 
is, using variables to represent vertices, using the equality and adjacency relations, and 
the usual Boolean connectives, and the quantifiers V, 4. For any irrational a,0 <a <1, 
in G(n,p) with p = n~°, A is either a.a.s. true or a.a.s. false. 


F7: [ErRé60] For G € G(n,p) and fixed k > 1, (logn)/n is a sharp threshold function 
for the minimum vertex degree of G being at least k. 


F8: [BoFr85] For G € G(n,p) and fixed k > 1, (logn)/n is a sharp threshold function 
for the property that G has k edge-disjoint Hamilton cycles. 


Strengthened versions of Facts F7 and F8 are given later. 
REMARKS 


R4: For a given monotone property, all sharp thresholds are clearly asymptotically 
equal. 


R5: Statements analogous to Facts F3 and F4 also hold in more general probability 
spaces concerning random subsets of a set (with an extra symmetry condition, in the 
case of Fact F4). 


R6: Threshold functions are known for many properties, and there are many proper- 
ties for which more accurate information is known than the mere existence of a sharp 
threshold. 


R7: Fact F6 is an example of a zero-one law in random graphs; see [JaLuRu00] and 
[Sp01] for much more on this topic. 


R8: Facts F7 and F8 together imply that (logn)/n is a sharp threshold function for G 
being k-edge-connected [ErRé61], having a matching which either is perfect (if n is even) 
or meets all but one vertex (if n is odd) [ErRé66], having a Hamilton cycle [KoSz83]. 


R9: Threshold functions are also defined in G(n,m): f is a threshold function for a 
property Q if P(G has Q) > 0 for m = o(f) and P(G has Q) — 1 for f = o(m) (or with 
0 and 1 interchanged). Sharp threshold functions are then defined in the obvious way. 
From Fact F1, snlogn is a threshold function in G(n,m) for the properties mentioned 
in Facts F7 and F8, and is, moreover, sharp. 
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8.2.3 Small Subgraphs and the Degree Sequence 


DEFINITIONS 


D14: The maximum density of a graph G is 


|E(F)| 
IV (F)| 


u(G) = max f : FCG, |V(F)| > of 


D15: A graph G is strictly balanced if its maximum density is achieved uniquely by 
F=G. 


FACTS ABOUT SMALL SUBGRAPHS 


NOTATION: Here H is a fixed graph with at least one edge, and Xz (Xz) denotes the 
number of subgraphs of G € G(n,p) (G € G(n,m), respectively) isomorphic to H. 


F9: [Bo81] Let H be strictly balanced with k vertices and j > 2 edges, and automor- 
phism group of order a. Let c > 0 be fixed and p = cn—*/J, Then Xy is asymptotically 
Poisson with mean c? /a. 


F10: (See [Bo01].) For arbitrary H, a threshold function for {Xq > 0} is n~ Ve), 


F11: [Ru8g] If |E(4)| > 1 then the distribution of Xq is asymptotically normal if and 
only if np““#) — oo and n?(1— p) > oo. 


F12: [Ja94] If |E(H)| >1,m> Vn, N—m> Vn and ns(m/N)“™ — oo, then Xyq 
is asymptotically normal. 


F13: [JaLuRu90] For every p = p(n) < 1, e~¥#/G-?) < P(Xy = 0) < ce P(™*) where 
Uy = min{E(Xp) : FC H, |E(F)| > 0}. 


F14: [OsPrTa03], [PrSt96] Let P(n,m) = P (G(n,m) ae For all € > 0, 


P(Xx, =0)~ P(n,m) if m>(1+ ni? Viogn 


and P(Xx, =0) > P(n,m) if n/2<m< (1- 6) 3n3/2 Jlogn. 


F15: [Wo96] For p = 0(n~?/3), 
P(X, = 0) wv erben teint able! 
For d= m/N = o(n-?/3), 
P(X, =0)~ es 
F16: [PrSt92] For G € G(n, 4), P(X = 0) ~ Pr(G is k—colorable) iff H has chromatic 


number & + 1 but contains a color critical edge (i.e., an edge whose omission from H 
reduces the chromatic number to k). 


FACTS ABOUT THE DEGREE SEQUENCE 


NOTATION: The number of vertices of degree k in a random graph is denoted D;, and 
d, >--- > d, is a descending ordering of the degrees of the vertices. 
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F17: [ErRé61] Let & be a fixed natural number, x a fixed real, and m = m(n) = 
sn(logn + kloglogn + x + 0(1)). In G(n,m), P(dn = k +1) > e~* */*, and a.as. 
dyn =k ork +1. (Note that d,, is the minimum vertex degree.) 


F18: [Bo01] Let k = k(n) be a natural number, and for fixed € > 0 let en~3/? < p= 
p(n) < 1—en~3/?, If ED;, — c then in G(n, p), the random variable D;, is asymptotically 
Poisson with mean c. (Note that ED; =n(",')p*(1—p)"-*.) 


F19: [BaHoJa92] If k = k(n) and either (i) np > 0 and k > 2, or (ii) np is bounded 
away from 0 and (np)~!/?|k — np| > oo, then the random variable D, is asymptotically 
Poisson in G(n,p), in the sense that the total variation distance between Dy = Dx(n) 
and a Poisson random variable Z = Z(n) tends to 0 as n — oo. (In some cases 


EZ(n) # O(1).) 


F20: [Bo01] Suppose that p(1—p)n > (log n)? and y is a fixed real number. Then for 
every fixed m, in G(n,p) 


m-1 


: es —ky /p) 
Jim P(dm < f(n,p,y)) =€ » e*4 Jk! 


where f(n,p,y) = pn + y/2p(1 — pynlogn(1 fogweee 4 yo ). 


F21: [Bo01] The random graph G(n, p) a.a.s. has a unique vertex of maximum degree 
and a unique vertex of minimum degree iff np(1 — p) > logn iff G(n,p) a.a.s. has a 
unique vertex of maximum degree or a unique vertex of minimum degree. 


> 
Il 


F22: [McWo97, Theorem 2.6] Let B,(n) be a sequence of n independent binomial 
variables, each Binom(n—1, p) (which is the distribution of the degree of any given vertex 
in G(n,p)). Consider the degree sequences D,(n) of G(n,p), and Dm(n) of G(n,m). Let 
A be any event defined on sequences, and assume that either logn/n? < p(1— p) « 
n—/2 or liminf p(1 — p)logn > 2/3. Then the following hold: 


(i) If m = pN is always an integer, then the probabilities of the event A in the two 
models D,,(n), and B,(n) restricted to sequences with sum 2m, differ by o(1). 


(ii) Choose p’ from the normal distribution with mean p and variance p(1 — p)/(2N), 
truncated to the unit interval (0,1). Then the probabilities of the event A in the 
two models D,(n), and 6,/(n) restricted to sequences with even sum, differ by 


o(1). 


REMARKS 


R10: The facts about small subgraphs concern the number of subgraphs isomorphic 
to a fixed graph H. Similar results on the number of induced copies of H were obtained 
by Janson (see [JaLuRu00, Chapter 6]). The property of having an induced subgraph 
isomorphic to H is not monotone; nevertheless there will usually be a “local threshold” 
near which the probability moves from o(1) to 1 — o(1) as the edge density increases, 
and later a second (disappearance) local threshold, where it changes back to o(1). For 
the existence of a copy of H vertex-disjoint from all other copies, again there are two 
local thresholds, with significant pioneering results in [Su90]; see also the discussion in 
(JaLuRu00]. Threshold results on covering every vertex by a copy of H were given by 
Spencer [Sp90], as part of more general results on extending all partial embeddings of 
k vertices of H. For threshold results on the property that every coloring of the edges 
of G(n, p) contains a monochromatic copy of a given graph G,, see [R6Ru95]. 
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R11: On probabilities in the tail of the distribution of Xy and x. H, exponentially 
small upper bounds have been obtained but are not always sharp ([Vu01],[JaRu02]). 
There are a number of other papers estimating the probability of nonexistence of a 
given subgraph, or equivalently results on the number of graphs which do not contain 
the subgraph, e.g., [PrSt96a]. 


R12: Fact F17 (and a number of similar ones) are stated with limiting probability 
strictly between 0 and 1, covering the whole range. For monotone properties, this implies 
the results for limiting probabilities equal to 0 and 1. For example, from Fact F17 it 
follows that with m = $n(logn + kloglogn + 2), 


P(d, >k+1)7>1forz—-oo and P(d, >k+1)— 0 for x > —co 


R13: Many properties of the degree sequences D,(n) and D,,(n) of the random graphs 
G(n,p) and G(n,m), respectively, follow from Fact F22. It is conjectured in [McWo97] 
that the restriction on p can be relaxed to simply p(1 — p) > logn/n?, which covers all 
p of any interest whatsoever. 


8.2.4 Phase Transition 


Erdés and Rényi initiated the study of the random graph G(n,m) as an evolving 
object, growing from a sparse, disconnected graph for small m to a highly connected 
graph for large m, and finally a complete graph when m = N. The biggest issue in 
this study has been the phase transition at m ~ $n, where increasing m by o(n) can 
change the size of the largest component from a.a.s. O(log n) to a.a.s. nearly a constant 
times n. Here we state properties of G(n,m) or G(n,p) in a large neighborhood of this 
phenomenon. Most of these translate from one model to the other by Fact F1. We begin 
with a simple statement shown by Erdés and Rényi, the tripartite nature of which leads 


to the term double jump. 
FACT 


F23: [ErRé60] Fix c > 0, and for c > 1 define b = b(c) so as to satisfy b+ e~°° = 1 (see 
Figure 8.2.1). Let Z denote the number of vertices in the largest component (called the 
giant) in G € G(n,m) where m = |cn/2]|. Then a.a.s. 


O(log n) ifc <1, 
L= ¢ Q(n?/3) ife=1, 
(b+o0(1))n ife>1. 


More precise examination by Bollobas began a revelation of details on how the phase 
transition takes place. The phase transition was eventually shown to have width of 
the order of n?/°. So we call m = n/2 + O(n?/3) the critical phase; before that is 
subcritical, and after is supercritical. Retrospectively, the significance of the “double 
jump” is mainly historical, as it manifests itself only when requiring m to be a fixed 
constant times n. 


DEFINITIONS 


D16: The excess of a graph with n vertices and m edges is m — n. 


988 Chapter 8. Analytic Graph Theory 


Cc 
Figure 8.2.1: The growth of the giant. 


D17: A connected graph is complex if its excess is at least 1 (so there are at least 
two cycles). 


D18: The k-core of a graph is the largest subgraph with all its vertex degrees at 
least k. (It is quite easy to see that the k-core is unique.) 


FACTS 
For many of these, specific bounds on the error terms are known but unstated here. 


Throughout the phase transition 


F24: [Ja93] For any m(n) and any k < n?/9, the random graph G(n,m) a.a.s has no 
complex component with fewer than k vertices. 


F25: [Bo01, Theorem 5.15] For any p(n) and any k > n?/%, the random graph G(n, p) 
a.a.s. contains no component which is a tree of order at least k. 


F26: [ErRé60] If 0 < c 4 1 is fixed and m ~ cn/2, then the size of the largest tree 
component in G(n,m) is a.a.s. (a + o(1)) logn where a = a(c) = 1/(c— 1— loge). 


Figure 8.2.2 charts the size of the largest tree component. 


Subcritical phase: n/2—m > 2/3 


F27: [ErRé60] If 0 < c < 1 is fixed and m ~ cn/2, then the probability that G(n, m) 
is a forest is asymptotic to e°/ 2402/4, /T — c, and the expected total number of vertices 
belonging to cycles tends towards c?/(2 — 2c). 


F28: (See [JaLuRu00].) Let r > 1 be fixed and n?/3 <s <n. In G(n,n/2—), the r 
largest components are a.a.s. trees of order (1/2 + 0(1))(n/s)? log(s*/n?). 
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Figure 8.2.2: The size of the largest tree component. 


F29: [JaLuRu00] For m < n/2, the probability that G(n,m) contains a complex 
component is less than 2n?/(n — 2m). (Note that this tends to 0 in the subcritical 
phase.) 


F30: [Bo01 Corollary 5.8] For p = c/n, 0 <c < 1, and any w = w(n) > ~w, GE G(n,p) 
a.a.s. has at most w vertices in unicyclic components. Consequently, the length of the 
longest cycle is a.a.s. at most w. 


F31: [JaLuRu00 Section 5.4] Let n?/3 <s <n. For any w =w(n) — 00, the length ¢ 
of the longest cycle in G(n,n/2— s) a.a.s. satisfies 0/w <n/s < tw. 


Critical phase: m = n/2 + O(n?/3) 


F32: [JaLuRu00 Section 5.5] Let m = n/2 + O(n?/3) and let rq be the number of 
components of G(n,m) with excess a. For any w = w(n) + 00, a.as. 1,51 47a < w, 


and the total number of vertices in complex components of G(n,m) is at most wn2/°, 


F33: [JaKnLuPi93 Theorem 5] Let m = n/2+O(n'/3) and fix q > 1. The probability 
that in G(n,m) there are exactly r; components of excess 7 for 1 < i < q and none of 


greater excess is 
ANS OE? ee Ce rl ay 
es LO /3 
(5) V3 TL rel. gh ar)! ae) 


where r = 7} +2r2+---+qrq and the c; are (easily computed) constants. The probability 
that there are no components of excess 2 or more is 1/3 cosh \/ + O(n-/9) = 0.9325. 


F34: [LuPiWi94], see also [A197] Let m = n/2 + cn?/? where c is constant. If (5;, E;) 
gives the size and excess of the ith component of G(n,m), listed so that S$; is nonin- 
creasing with i, then the random sequence (n~?/3,, E,), (n~?/3$5, E2),... converges 
in distribution to some random sequence (X11, X12), (X21, X22),... as nN > oo. 
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F35: (See [JaLuRu00 Section 5.5].) If m = n+ O(n?/%) and w = w(n) > oo then the 
length @ of the longest cycle in G(n,m) a.a.s. satisfies €/w <n'/3 < bw. 


Supercritical phase: m — n/2 >> n?/° 


F36: [Bo84 and Lu90] Define b so as to satisfy b+ e~2°"/" = 1. If m—n/2 > n2/9 
then in G(n,m), a.a.s. there is a complex component with bn + O(n?/2) vertices, while 
every other component is not complex and has less than n2/° vertices. 


F37: [Lu91] Let n?/? <s <n. The longest cycle in G(n,n/2 +s) a.a.s. has length 
between (16/3 + 0(1))s?/n and (7.496 + 0(1))s?/n. 


F38: [Lu91] For any w = w(n) > oo, the length ¢ of the longest cycle in G(n,n/2 + s) 
outside the largest component a.a.s. satisfies 0/w <n/s < fw, as does the length of the 
shortest cycle in the largest component. 


F39: [Lu90], [JaKnLuPi93], [PiWo03] Define 6 as in Fact F36, c = 2m/n and t = c—cb. 
By Fact F36 we may assume the largest component is unique. Let Y; denote the number 
of vertices in the 2-core of the largest component of G(n,m), Y2 the number of vertices in 
the largest component not in the 2-core, and Y3 the excess of the largest component. For 
m = O(n) with m—n/2 > n?/3, aas. Y; ~ (1—t)bn, Yo ~ thn and Y3 ~ b(c+t—2)n/2. 
Furthermore, each of the three variables is asymptotically normally distributed, and so 
are the numbers Yj + Y2 (a.a.s. ~ bn) of vertices and Y; + Y2 + Y3 (a.a.s. ~ b(c + t)n/2) 
of edges in the largest component. 


REMARKS 


R14: Let G’ denote the graph obtained by deleting the largest component from G(n,m) 
(or, if there is more than one largest component, deleting all of them). From Fact F239, 
it follows that form = O(n) and m—n/2 > n?/, G' a.a.s. has any particular property 
which is a.a.s. true for G(n’,m’) as n’ + co when m’ ~ Stn’. (See Theorem 5.24 in 
[JaLuRu00] for a more precise statement.) Here te~’ = ce~°, t € (0,1). 


R15: Other interesting facts about the phase transition can be seen by viewing the 
random graph as a process (see Section 8.2.8). 


8.2.5 Many More Properties of Random Graphs 


NOTATION: The connectivity of a graph G is denoted «(G), the edge connectivity X(G), 
the minimum vertex degree 5(G), the independence number a(G), and the chromatic 
number x(G). 


FACTS ON CONNECTIVITY, DIAMETER, MATCHINGS, CYCLES AND PATHS 


The next fact implies the threshold for the property of being k-connected, and this 
coincides with having minimum degree k. 
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F40: [ErRé61] For fixed k > 0 and m = m(n) = $n(logn + k log logn + « + o(1)), 
P(K(G(n,m)) =k) >1—e7° /" and aas. K(G(n,m)) =k or k+1 

F41: [BoTh85] For G € G(n,p) and any p, a.a.s. K(G) = A(G) = 6(G). 


F42: [Bo81a] Let p be fixed, 0 < p < 1, andt = |n/"|. For G € G(n,p) let Gp = G and 
Gi = Gi_-1—; (1 <i <t), where v; is any member of the set $; of vertices of minimum 
degree in G;_ 1. Then a.a.s. |S;| = 1, so v; is uniquely determined, for 1 <i <t. Also, 
a.as. K(G;) = 6(G;) for all 7, and 6(G;41) > 6(G;) + t for 0 <i <t. 


F43: [Bo81b] For constant c > 0 and d = d(n) > 2 an integer, we define p by 
p?n?—! = log(n?/c). If pn > (logn)?, then for G € G(n,p), a.a.s. the diameter of G is 
either d or d+ 1, and the probability it is d tends towards e~°/. Corollaries of this are: 
if p?n — 2logn > o0 and n?(1— p) > oo then a.a.s. G(n,p) has diameter 2; 

ifm < N and 2m?/n3 — logn > oo then a.a.s. G(n,m) has diameter 2. 


Facts F44 and F45 discuss properties for which the obvious necessary condition, mini- 
mum degree at least 1, has the same threshold (see Fact F17). The first is the classic 
special case of the second. 


F44: [ErRé66] For p = p(n) = (logn + # + o(1))/n, the probability that G € G(n, p) 


has a perfect matching tends to e~° 


as n — oo with n restricted to the even integers. 
F45: [LuRu91] Let T be a tree with at ¢t > 2 vertices. For p = p(n) = (logn+ a+ 
0(1))/n, the probability that G € G(n,p) has a T-factor (i.e., a spanning subgraph each 


e~" as n— oo with n restricted to 


of whose components is isomorphic to T’) tends to e~ 
the integers divisible by t. 


More generally, one can ask for the threshold of existence of a spanning subgraph with 
all components isomorphic to a given graph H (assuming n is divisible by |V(H))]). 
The case when H is a triangle was an outstanding unsolved problem for many years. 
The threshold was shown for strictly balanced graphs by Johansson, Kahn and Vu to 
coincide with the threshold for the property that every vertex is contained in a copy of 
H. We give only the triangle case here. Their argument also extends to solve a well 
known problem of Shamir on perfect matchings in hypergraphs. 


F46: [JoKaVu08] Restrict n to integers divisible by 3. For the property that the vertices 
are covered by a set of disjoint triangles, n~?/3(log n)!/° is a threshold function. 


Fact F47 concerns a property for which the obvious necessary condition, minimum 
degree at least 2, has the same threshold. (See also Remark 9 and Fact F76.) 


F47: [KoSz83] For p = p(n) = (logn + loglogn + x + o(1))/n, the probability that 


G € G(n,p) has a Hamilton cycle tends to e~* as n > co. 


F48: [AjKoSz81] Let f(c) be the supremum of all § such that G(n,p = c/n) aa.s. 
contains a path of length at least Gn. Then f(c) > 0 for c > 1, and lime. f(c) = 1. 
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FACTS ON INDEPENDENT SETS AND CHROMATIC NUMBER 


F49: [BoEr76], [Ma76] Let c < 1 and suppose that n~° < p = p(n) < ¢ for all 
56 > 0. For fixed € > 0 the independence number a(G) of G € G(n,p) a.a.s. satisfies 
ry <a(G) < re where 

r; = |2log, n — 2log, log, np + 2 log,(e/2) +1+ (—1)’e/p], 


with b= 1/(1—>p). 


F50: [Fr90] Let ¢ > 0 and r; = |2p~ 1! (log np— log log np +log(e/2) + (—1)*e) |. For some 
constant C., the independence number a(G) of G € G(n,p) a.a.s. satisfies the inequality 
ry <a(G) < re provided that C./n < p= p(n) < log~? n. 


F51: [Bo8g], [Lu91] Let c < 1 be constant. If 1/n < p= p(n) < c, then for G € G(n, p) 
a.as. x(G) ~ nlog Tp/(2 log np). 


F52: [AcNa06] Let p = d/n for constant d > 0. The chromatic number of G € G(n, p) 
is a.a.s. either kg or kg + 1, where kg is the smallest integer k such that d < 2k logk. 
Moreover, if d lies in the interval [(24 — 1) logk, 2k logk)], then for G € G(n,p), a.a.s. 
x(G) =k +1. 


F53: [A193], [Kr00] Let € > 0. If p(n) > n-1/4+¢, then a.a.s. for G € G(n, p), we have 
x(G) ~ x1(G), where ;(G) is the choice number (or list-chromatic number) of G. 


F54: [PiSpWo96] For fixed & > 3, the existence of a k-core in G(n,p) has a sharp 
threshold function p = cz/n where cy = infysof{ue"/f(u,k)}, f(u,k) = 9, we /il. 
(The k-core of a graph is the maximum subgraph of minimum degree at least k.) 


F55: [Mo96] Let c, be as above and dy = sup{d : y(G(n,d/n) < k}. Then cy 4 dy 
for k > 4. (From Fact F54, we have cz, < dx, since a graph with no k-core can always 
be (k — 1)-colored.) 


FACTS ON PLANARITY, GENUS, AND CROSSING NUMBER 


F56: [ErRé60] For planarity in G(n,m), m = n/2 is a sharp threshold function. More 
precisely [LuPiWi94], there is a function f such that 0 < f(x) < 1, f(a) > 0 as 
xz — —o0, f(x) 4 1 as x — 00, and P(G(n,m) is planar) > f(c) for m = n/2 + en2/3 
(where c is constant). 


F57: [ArGr95] If p?(1—p)? > 8(logn)*/n, the genus of G(n, p) is a.a.s. (1+0(1)) Gpn?. 
F58: [R6Th96] For every integer i > 1, if n~/“+) «< p « n-“-Y/", the genus of 
G(n,p) is a.a.s. (1 + 0(1))(é/4(i + 2))pn?. 


F59: [SpT602] The expected value of the crossing number of G € G(n,p) (ie., the 
minimum number of crossing points in a drawing of G in the plane, no three edges 
crossing at the same point) is o(p?n*) if p ~ 1/n and O(p?n*) if p = c/n for fixed c > 1. 


FACTS ON EIGENVAUES, AUTOMORPHISMS, AND UNLABELED GRAPHS 


F60: [FiiKo81] Let A; > A2 >... > Ay be the eigenvalues of the adjacency matrix of 
G and let p(n) =c. Then a.a.s. in G(n, p), 
Ai ~ np and max |As| = 2\/pgn + O(n’? log n) 
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F61: [Wr70] The expected number of automorphisms of G € G(n,m) tends to 1 iff 
min(m, N — m)/n — (log n)/2 > oo. 


The next fact follows from this and explains why random unlabeled graphs are rarely 
studied: for many purposes they have the same properties as random labeled graphs. 


F62: [Bo01] Let Q be a property of graphs of order n, 0 < c¢ < 1, and suppose that 
m = m(n) is such that min (m,N — m)/n— (logn)/2 + oo. Let G € G(n,m) and 
choose an unlabeled graph H on n vertices and m edges uniformly at random. Then 
P(G has Q) > c iff P(H has Q) > c. 


8.2.6 Random Regular Graphs 


There is an abundance of interesting models of random graphs besides G(n, p) and 
G(n,m). One of the most common is a restriction of G(n,m), to graphs with specified 
degree sequence. Random regular graphs are a special case of this. 


NOTATION: For d > 0 and dn even, G,,q is the probability space containing just the 
d-regular graphs on n vertices, all being equiprobable. 


FACTS 


The next fact comes easily from the enumeration formulae of Bender and Canfield 
[BeCa78] or alternatively from the model of random regular graphs given by Bollobds 
[Bo80]. (See the survey [Wo99], for example.) 


F63: For fixed d and any fixed graph F' with more edges than vertices, a random 
regular graph G € G,q a.a.s. contains no subgraph isomorphic to F’. 


F64: [Bo80], [Wo81] For d > 0 and k > 3 fixed, the number of cycles of length k in a 
graph in G € G,.q is asymptotically Poisson with mean (d—1)*/2k. 


F65: [KrSuVuWo01], [CoFrRe02] For 3 < d = d(n) < n—4, a random regular graph 
G € Gn is a.a.s. d-connected. 


F66: [BoFe82] Fix d > 3 and e > 0. The diameter D(G) of G € G,,q¢ a.a.s. satisfies the 
inequalities 


d—2 
1+ [logg_, | + jos ( bd log n)| < D(G) < 14 flogg_,((2+ 6)dnlogn)] 


F67: [McWo84] If 3 < d = d(n) = o(,/n), then the expected number of automorphisms 
of G € Gna tends to 1 as n > oo. (c.f. Fact F61.) 


F68: [KiSuVu02] If 3 < d= d(n) < n—4, then G € G,q a.a.s. has only the trivial 
automorphism. 


F69: [KrSuVuWo01], [CoFrReRi02] If 1 < d = d(n) < 0.9n, then for G € Gna, a.a.s. 
a(G) ~ 2logd/log(n/(n — d)) and x(G) ~ n/a(G). 


F70: [RoWo94] For fixed d > 3, arandom regular graph G € G,y,q a.a.s. has a Hamilton 
cycle and, for odd d, a.a.s. has edge chromatic number equal to d. Indeed, [KiWo01] for 
fixed d > 3, G € Gy q a.a.s. has a partition of its edge set into the edges of g Hamilton 
cycles (for d even), or 44+ Hamilton cycles and a perfect matching (for d odd). 
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F71: [KePéWol0] For fixed d > 3, a random regular graph G € G,,.¢ asymptotically 
almost surely has chromatic number either k or k — 1, where k is the smallest integer 
satisfying d < 2(k — 1) log(k — 1). If, moreover, d > (2k — 3) log(k — 1), then the value 
is k is a.as. 


F72: [ShWo07] A random 4-regular graph G € G,,,4 asymptotically almost surely has 
chromatic number 3. 


F73: [KrSuVuWo01], [CoFrRe02] For 3 < d= d(n) < n—1, a random regular graph 
G € Gy is a.a.s. hamiltonian. 


F74: [Mc81] For fixed d > 2 and |y| < 2d —1, the proportion of eigenvalues of the 
adjacency matrix of G,,q which are at most y is a.a.s. 


y = — 2 
a(t) + f d ee 1) - Pa. 
—2/d—1 Qn (d — x?) 


F75: [Fr08] For d even and fixed, the second-largest eigenvalue (in absolute value) of 
the adjacency matrix of Gyq is a.a.s. at most 2/d — 1+ o(1). 


F76: [BrFrSuUp99, Lemma 16] The second-largest eigenvalue (in absolute value) of 
the adjacency matrix of Gna is a.a.s. O(Wd) provided d= 0(\/n). 


REMARKS 


R16: There are interesting relationships between G,,,qg and other random graph models, 
expressed in terms of contiguity (see [Wo99] or [JaLuRu00}). 


R17: The behavior of the size of the largest independent set, smallest dominating set, 
and related functions of G,,q is not very well determined (see the bounds in [Wo99]}). 


R18: There are many results on random graphs with given degree sequences, usually 
obtained by the same methods as for random regular graphs. For instance, random 
graphs with given degree sequence, all elements of which lie between 3 and d < n%-, 
a.a.s. have connectivity equal to minimum degree [Lu92]. Properties of the emerging 
giant component in random graphs with given degree sequences are studied in [MoRe98]. 


8.2.7 Other Random Graph Models 


A vast number of other random graph models, and closely related probabilistic mod- 
els, have received much attention. Not attempting a complete or balanced treatment, 
we give a relevant pointer for some of the main models, either to a recent or significant 
result or to a major source of information: random trees (these are especially relevant to 
the average case analysis of many algorithms) [BaBoDeF]1GaGo02], [MeMo98], [Dr09]; 
random hypergraphs [FrJa95], [KrVu01]; random digraphs [Ka90]; random subgraphs 
of the cube [Ri00]; random geometric graphs [Pe03]; superposition models [Wo99], [Gr- 
JaKiWo02]; k-in, k-out models [CoFr94]; random mappings [Ko86]; random instances of 
k-SAT [BoBoChKiWi01]; random graphs with independent edges but of unequal prob- 
abilities [LuSh95], [BoJaRi07]; random maps (graphs embedded on surfaces) [GaWo00]}; 
random planar graphs (for instance in [McStWe05]). 
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8.2.8 Random Graph Processes 


A random graph process is a family of random graphs indexed by time. Some random 
graph processes are useful in proofs of facts about standard random graph models, but 
the following one in particular has been studied because of the interesting interpretation 
and extensions of threshold results which it enables. 


DEFINITIONS 


D19: The standard random graph process G (n) begins with no edges and adds 
new edges one at a time, each selected uniformly at random from those not already 
present. (Formally, G(n) is a Markov chain. Its state at time m (when it has m edges) 
is clearly equivalent to the random graph G(n,m), and so it is commonly represented 
as the sequence {G(n,m)}m>o-) 


D20: The hitting time of a graph property Q is min{m : G(n,m) has Q}. 


FACTS 


As before, asymptotic statements regarding G (n) refer to the passage of n to infinity. A 
number of statements have been proved which show that the first edge giving a graph a 
certain property is a.a.s. also the first edge for which another (simpler) property holds. 


F77: [Bo84] The hitting time for the property of possessing a Hamilton cycle is a.a.s. 
equal to the hitting time for having minimum degree at least 2. 


F78: [BoTh85] The hitting time for possessing a perfect matching is a.a.s. equal to the 
hitting time for having minimum degree at least 1. Indeed, [BoFr85] the hitting time 
for possessing |k/2| edge-disjoint Hamilton cycles and, if k is odd, a matching of size 
|n/2| disjoint from these cycles, is a.a.s. equal to the hitting time for having minimum 
degree at least k. 


F79: [BoTh85] For any function k = k(n), the hitting time for being k-connected is 
a.a.s. equal to the hitting time for having minimum degree at least k. 


F80: Let F, denote the length of the first cycle to appear in G(n). Then [Ja87] for 
fixed j > 3, P(Fn = j) ~ pj = af to-let/2+0?/4,/T—¢ dt: > j=3Pi ~ 1, but on the 
other hand [FlKnPi89] the expected value of F;, is asymptotic to n!/°, 

F81: [JaKnLuPi93] The probability that G(n) at no time contains more than one 


complex component is (1 + 0(1)) $2. 


REMARK 


R19: Many other random graph processes have been studied. The world-wide-web 
provides motivation for a number of these (see [BoRiSpTu01] and [CoFr03], for exam- 
ple). Some of the other random graph processes of interest include processes modeling 
the growth of the giant component [AIPi00], processes which randomly delete from a 
graph in order to find packings [AIKiSp97], simple processes for generating graphs with 
given maximim degree [RuWo92], and processes generating random planar graphs by 
adding edges at random maintaining planarity [GeScStTa08], but these are only a small 
representative sample. 
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INTRODUCTION 


In any group of six people there are always either three who know each other or 
three mutual strangers. This same statement in the language of graph theory is that 
if each edge of a complete graph Kg is colored either Red or Blue, then there is either 
a Red triangle (43) or a Blue triangle (3). Moreover, this conclusion is not true for 
Ks, so six is a minimum such number. This is a special case of a much more general 
observation of F. P. Ramsey [Ra30]. He observed that for all positive integers m and n, 
there is an integer r such that if each edge of a K, is colored either Red or Blue, then 
there will be either a Red K,,, or a Blue K,,. The smallest such integer r is denoted by 
r(m,n), and is called the (m,n)-Ramsey number. Ramsey graph theory is the study of 
such numbers and the corresponding graphs. More generally, the number of colors is 
not restricted to just two, the monochromatic graphs are arbitrary — not just complete 
graphs, and the graph being edge-colored is not restricted to being complete. 


8.3.1 Ramsey’s Theorem 


Ramsey’s original theorem applies to general set theory and has implications to 
many areas of mathematics other than combinatorics and graph theory. For combina- 
torial results related to Ramsey’s theorem, see [GaRoSp90], [Pa78], [Ne96], and survey 
articles [Bu74], [Bu79], [ChGr83], [GrRo87], and [Ra02]. A simplified version of Ram- 
sey’s theorem applicable to finite graphs is our starting point. 
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FACT 

F1: (Ramsey’s Theorem [Ra30]) Given positive integers k,n1,n2,--- ,nz > 2, there 
is a least positive integer r(n1,n2,--- ,m%) such that, for any partition C1,C2,---C, of 
the edges of a complete graph K, with p > r(nj,n2,--- ,nx), there is for some i a 


complete subgraph K,,, all of whose edges are in C;. 


DEFINITIONS 


D1: The number r(n1,72,--- ,nx) is called the Ramsey number for the k-tuple 
(n1, ne, ae , Nk): 


D2: The partition of the edges of a complete graph K, into k sets is described as a 
coloring of the edges of K, with k colors, or more specifically a k-edge-coloring of Kp. 


Ramsey Numbers for Arbitrary Graphs 


Ramsey’s theorem implies the existence of a “monochromatic” complete subgraph in the 
appropriate color in any edge-coloring of a sufficiently large complete graph. Since any 
graph G on m™ vertices is isomorphic to a subgraph of K,,, an immediate consequence 
of Ramsey’s theorem is the existence of the Ramsey numbers for arbitrary graphs. 


DEFINITIONS 


D3: The (generalized) Ramsey number r(G 1, G2,--- ,G,) for any collection of k 
graphs {G1, Go, --- , Gn} is the least positive integer n such that for any k-edge-coloring 
of Ky, there is for some 7 a monochromatic copy of G; in color 7. 


D4: Given k > 2 and graphs Gj, Go,-:-: ,Gx, a graph F is said to arrow the k-tuple 
(G1, Go,--- ,G,) if for any k-edge-coloring of F there is for some i a monochromatic 
copy of G; in the i*” color. This is denoted by F —> (G,,G2,:--,G,). Thus, the 
Ramsey number r(Gi, Go, --- ,G,) is the smallest order of a graph F’ such that F’ —> 
(Gi, Go,-+- ,Gn). 


D5: The size Ramsey number 7(G,G2,--- ,G,) is the smallest size (i.e., number 
of edges) of a graph F' such that F —> (Gj, G2,--- ,Gn). 


D6: A graph F is (Gi, Go,--- ,Ge)-minimal if F —> (Gi, Go,--- ,G,), but no proper 
subgraph of F' also arrows. 

REMARKS 

R1: If for each 7, G; = Kn,, then r(m,n2,--+ , nk) = 7(Kn,, Kn,++ » Kn, )- 


R2: Classical Ramsey graph theory deals with the case when each of the required 
monochromatic graphs is complete, while generalized Ramsey graph theory involves the 
generalization to arbitrary graphs. 


R3: This leads to asking questions about the structure of and the number of “different” 
graphs that arrow. Ramsey minimal graphs are considered in 8.3.6. 
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8.3.2 Fundamental Results 


The vast majority of Ramsey graph results concern 2-colorings, so these are featured. 
Several useful facts are immediate consequences of the definition. 


FACTS 


F2: For any pair of graphs G; and G2, r(G1,G2) = r(G2,Gi). More generally, the 
order of the graphs is not important for any number of graphs. 


F3: For any graphs G,, and G', of orders m and n, respectively, r(Gm, Gn) < r(m,n). 
F4:; For n> 2, r(2,n) =r(n,2) =n. 
F5: Erdés and Szekeres [ErSz35] For m,n > 3, 

r(m,n) < r(m—1,n) + r(m,n-—1) 


with strict inequality if both r(m —1,n) and r(m,n— 1) are even. A consequence of 


this is 
r(mti1jn+1)< Gea. 
m 


REMARKS 


R4: The Erdds-Szekeres theorem gives a finite upper bound for the Ramsey numbers 
of all pairs of finite graphs. There are corresponding bounds for any finite number of 
colors and collections of finite graphs. 


R5: To prove that the Ramsey number r(G, H) = p, normally two steps are taken. A 
proof is given to show that any 2-edge-coloring of K,, say a Red-Blue coloring, yields 
either a Red G or a Blue H, and then a Red-Blue coloring of a Ky_1 is exhibited that 
has neither a Red G nor a Blue H. 


R6: In the case of a 2-edge-coloring, say with Red and Blue, it is sometimes more 
convenient to just denote a subgraph F’, which represents the subgraph induced by the 
Red edges. Then, the complement F' of F' denotes the Blue subgraph. 


EXAMPLE 


El: To show that r(3,3) = 6, observe that in Figure 8.3.1 there is a Red-Blue coloring 
of Ks with no K3 in either color, and observe by the result of Erdés and Szekeres 


r(3,3) <r(2,3) +7r(3,2) = 3+3=6. 


8.3.3 Classical Ramsey Numbers 


Determining classical Ramsey numbers is quite difficult, and the number of non- 
trivial classical Ramsey numbers that are known precisely is very limited. Only one 
nontrivial multicolor (at least three colors) classical Ramsey number is known and only 
nine nontrivial two color Ramsey numbers r(m,7) are known, which is strong evidence 
of the difficulty in determining Ramsey numbers. 
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Red Blue 
Figure 8.3.1: r(K3, K3) > 5. 


Ramsey Numbers for Small Graphs 


Figure 8.3.2 contains the known classical Ramsey numbers r(k,1) along with the best 
upper and lower bounds for small values of m and n. The argument & ranges from 3 
through 10 and runs along the main diagonal, and the argument / runs along the top 
row. The exact Ramsey values are centered, upper bounds are the top entries, and 
lower bounds are the bottom entries. For instance, r(3,8) = 28, and r(4,8) is between 
56 and 84. The references for each of the exact values are also listed in the table along 
with references for some of the upper and lower bounds, where the leftmost column is 
used to indicate the argument / for the particular Ramsey number. For instance, it 
was established in [Ke64] that r(3,6) = 18. A listing of Ramsey numbers much more 
complete than Figure 8.3.2 can be found in an excellent dynamic electronic paper by 
Radzisowski [Ra02], which is updated periodically. 


FACT 


F6: Greenwood and Gleason [GrG155] (3, 3,3) = 17. 


REMARK 


R7: The earlier work determining classical Ramsey numbers was done by Greenwood 
and Gleason [GrGI155], Kéry [Ke64], and by Graver and Yackel [GrYa68]. Lower bounds 
were established with explicit but sophisticated colorings using algebraic techniques, 
and upper bounds were established using graph theory techniques. More recently Exoo, 
McKay, and Radzisowski, among other mathematicians, have used computational tech- 
niques, powerful algorithms, and more computing power to sharpen upper and lower 
bounds for larger Ramsey numbers. Numerous improvements have been made in the 
past few years. However, the gaps between upper and lower bounds are still enormous 
for even many small values of m and n. 


Asymptotic Results 


Considerable study of the asymptotic behavior of the Ramsey number r(m,n) has not 
yielded sharp asymptotic results. There have been numerous improvements in the upper 
bound of Erdés and Szekeres (Fact F5). The lower bounds for r(m,n), or in particular 
for r(n,n), are far from the upper bounds. The only case in which sharp asymptotic 
results have been obtained is m = 3. Shearer [Sh83] proved an upper bound for r(3, 7), 
and Kim [Ki95] verified an asymptotically sharp lower bound. 


1006 Chapter 8. Analytic Graph Theory 


GreGI55 


McRd97 


Ka66 


McZ92 


Figure 8.3.2: The classical Ramsey numbers r(k, /). 
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FACTS 


F7: [Th88] For all 1 <m <n, there is an absolute constant c such that 


r(m+1n4t1) < emlesn)/2n+evlogm is h ") 
_ m 


and in particular 
r(nti,n+1)< n7i/2+evlogn i) : 
F8: For some constant c’ and for all positive n, 
r(n,n) > ln2r/?, 
F9: ((Sh83], [Ki95]) There are absolute constants c and c’ such that 
cn? /logn < r(3,n) < c'n?/logn. 


Probabilistic techniques are used to prove the existence of colorings, but specific color- 
ings are not exhibited. 


8.3.4 Generalized Ramsey Numbers 


There has been more activity and considerably more results in generalized Ramsey 
theory than in any other area of Ramsey graph theory. It would be impossible to survey 
even a fraction of the results, so we will review just a few of the highlights. 


Initial Generalized Ramsey Results 


FACTS 
F10: [GeGy67] For positive integers 2<m<n, 
r(Pm; Pn) =n +|m/2| — 1. 


The lower bound for r(P, Pn) comes from the coloring determined by the graph F = 
Kn-1U K\m/2|-1- The graph F' contains no connected graph with n vertices and so no 
P,,, and the complement F is a bipartite graph with no P,,. 


F11: [BuRo73] Let n1,n2,---nz be positive integers with s of them being even and 


k > 2. Then 
k 


1 gies Kins ais King) = Soni ~ 1) +a, 


i=l 


where a = 1 if s is positive and even and a = 2 otherwise. For k = 2, 
r(Kim, Kin) =m+n-e, 


where € = 1 if m and n are both even and 0 otherwise. 
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REMARK 


R8: There are regular graphs of any order p and degree k < p except when k and p are 
odd, and, in this case, nearly regular graphs exist. The lower bounds for the Ramsey 
numbers in Fact F11 depend on colorings derived directly from regular or nearly regular 
graphs. 


Ramsey Numbers for Trees 


There are many classes of pairs of trees for which the Ramsey number r(Ti,, T,) is not 
known. However, for all such numbers that are known, r(Timn, Tn) < m+n-— 2. 


FACT 
F12: [BuRo73] When at least one of m or n is even and T,, and T,, are stars (Tm = 


Kimi and T, = Ki n-1), 
r(Tm,Tn) =m+n—2. 


CONJECTURES 


C1: Tree Conjecture [BuEr76] For any trees T,, and T,, with m,n > 2, 
r(Im,Tn) <m+n—2. 
C2: Erdés—Sés Conjecture Any graph G with n vertices and at least n(k —2)/2+1 
edges contains any tree Ti, (k > 2) as a subgraph. 
REMARKS 
R9: The Erdds—S6s conjecture implies the Tree Conjecture. 


R10: Personal communication has indicated that M. Ajtai, J. Komolés, M. Simonovits, 
and E. Szemerédi [AKSS] have proved that the Erdés—Sdés conjecture is true for n 
sufficiently large. The paper has not been published. 


Cycle Ramsey Numbers 


The Ramsey numbers of cycle graphs appear to have some of the same characteristics 
as the Ramsey numbers of trees. 


FACTS 
F13: ([Ro73a], [Ro73b], [FaSc74]) If 3 < m <n with (m,n) ¥ (3,3), (4,4), then 


2n—1 when m is odd, 
(Cm, Cr) =4 n+ B-1 when m and n are even, and 
max{n + 5 —1,2m—1} when m is even and n is odd. 
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For 3-edge-colorings the examples for cycle Ramsey numbers have similar properties, 
but determination of the numbers is much more difficult. 

F14: [RoYa92] r(Cs,Cs,Cs) = 17. 

F15: [FaScSc] r(C7, C7, C7) = 25 

F16: [Lu99] For all n > 4, r(Cn,Cn, Cn) < (44+ 0(1))n. 


CONJECTURE 


C3: Bondy and Erdés Conjecture [BoEr73] For n > 5 and odd, 
r(Ch, Cn, Cr) = 4n — 3. 


EXAMPLE 


E2: For n > 3 and odd, consider the 3-edge-coloring, say with Red, Blue, and Green, of 
a K4(n—1)- The Red subgraph is 4k,,_1, the Blue subgraph is isomorphic to 2Kp—1,.n-1 
and contains all of the edges between the first two and the last two of the complete 
graphs in Red, and the remaining edges are Green and form a Ka(n—1),2(n—-1). For n 
odd there is no Red C,,, since no component of the Red subgraph has n vertices, and 
there is no Blue or Green C;, since these graphs are bipartite and have no odd cycles. 
Thus, r(Cn,Cn, Cn) > 4n — 4 for n odd. 


F17: [KoSiSk05] For n odd and sufficiently large, r(Cn,Cn,Cn) = 4n — 3. 


F18: [BeSk09] For n even and sufficiently large r(Cn, Cn, Cr) = 2n. 


Good Results 


Results of [BoEr73] and [Ch77] on complete graphs and trees (see Facts F20 and F21) 
motivated new lines of investigation into generalized Ramsey numbers. 


DEFINITIONS 


D7: [Bu81] If x(G) is the chromatic number of G, then the chromatic surplus of G 
is the largest number s = s(G) such that in every vertex coloring of G with y(G) colors, 
every color class has at least s vertices. 


D8: [Bu81] A connected graph H of order n > s(G) is called a G-good graph if 
r(G, H) = (x(G@) — 1)(n—1) + s(G). 


EXAMPLES 


E3: Consider a Red-Blue coloring of a K(m_—1)(n—1) in which the Blue graph is m — 1 
vertex disjoint copies of a complete graph Ky _1 ((m—1)Ky_1) and the Red graph is 
the complementary graph, Kn, no,---nm_1, Where ny = Ng =+-+Nm-1 = n—1. There is 
no Blue T;,, and in fact no Blue connected graph with n vertices, and there is no Red 
Kmn- 


1010 Chapter 8. Analytic Graph Theory 


E4: The 2-edge-coloring of Example E3 gives the lower bound for r(Ky,,T;,) and also 
for r(Km, Cy). Moreover, there is no graph with chromatic number m in the Red graph. 
This coloring implies that if the chromatic number .(G) = m and H is any connected 
graph of order n, then r(G, H) > (m— 1)(n — 1). 


E5: For p= (x(G)—1)(n—1)+s(G)—1, consider the Red-Blue edge-coloring of K, in 
which the Blue graph consists of x(G) — 1 disjoint complete graphs of order n — 1 and 
one complete graph of order s(G) — 1, and the Red graph is the complementary graph. 
There is no Blue G and there is no connected graph of order n in Red for n > s(G). 


FACTS 
F19: [Bu81] If H is any connected graph of order n > s(G), then 
r(G, H) > (x(G) — )(n— 1) + 8(G). 
F20: (Bondy and Erdés [BoEr73]) If m > 3 and n > m? — 2, then 
r(Km,Cn) = (m—1)(n-1) +1. 


It was conjectured in [EFRS78b] that Fact F20 is true for n > m except for n =m = 3, 
and also verified for m = 3. It has now been verified for m = 3,4,5,6, (see [ChS71], 
[YHZ99], [BJYHRZO00], and [Sc03], respectively), and in addition proved in [Ni03] for 
m>3andn>4m-4 2. 


F21: (Chvatal [Ch77]) For integers m,n > 1, 
r(Km,Tn) = (m—1)(n—1) +1. 


This theorem, which can be stated as any tree T;, is Ky,-good, has been generalized 
in many ways. The two main approaches have been to replace K,, by a graph with 
chromatic number m or to replace the tree T;,, by a connected sparse graph. 


F22: [BuFa93] A graph G satisfies r(G,T,) = (m — 1)(n — 1) +1 for all trees T,, of 
sufficiently large order n, if and only if s(G) = 1, and there is a y(G)-vertex coloring of 
G such that the graph induced by two of the color classes is a subgraph of a matching. 


F23: Let G be an arbitrary graph and H a connected graph of order n. Then there 
are positive constants c,c1, C2, and a such that H is G-good if n is sufficiently large and 


BEFRS80a]_ G = K3,n > 4, and |E(H)| < (17n +1)/15, or 


ae 
(ii) [BEFRS80a] G= Km,m> 4,|E(A)| <n+en2/("-), or 
(iit) [EFRS85| |E(H)| <n+cn%, and A(H) < con®, or 
(iv) [BEFRS82b] G = Com4i1 and |E(H)| < (1+ c¢s3)n, or 
(v) [BuEr83] G = Ks; and H = K, + C,, (wheel), or 
(vt) [FaRoSh91] G= Com+1 and H = Ko + Ky_2. 
REMARK 


R11: A comprehensive summary of “good” results can be found in [FaRoSc92]. 
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Small Order Graphs 


Most of the generalized Ramsey numbers for very small order graphs were determined 
in papers by Chvatal and Harary [ChHa72], Clancy [C177], and Hendry [He89a]. Figure 
8.3.3 pictures all of the graphs with at most five vertices that have no isolated vertices. 
The graphs are described using standard graphical operations such as +, —, and U along 
with e, where Ge H is a graph (not unique) obtained from G and H by identifying one 
vertex from each graph. 


-~AAUMNKORA RS 


Ke P3 Ki,3 Ka — P3 Ka Ko U P3 
A? 13 19 20 21 22 
AWnwWdedecsn 
KoU K3 Ps Ky4 Ki,3¢¢€ C4aee K3e P3 K302Kq K3¢Ki2 Ka3 K3 0 K3 
; 23 ; 24 ; 25 ; 26 g 27 : 28 : 29 > A A 32 : 33 
(K4—e)ee (Kqa—e)ee Kz — Ps Kx —P, K5—(P3Ue) Ks—Kg Ks—Ki,3 Ky —P3 Kp —2Ko Ks —e Ks 


Figure 8.3.3: Graphs of order < 5 without isolates. 


REMARKS 


R12: Figures 8.3.4 and 8.3.5 give the diagonal Ramsey numbers or the sharpest known 
bounds for the Ramsey numbers for all graphs of order at most five without isolated 
vertices. Additional information on these exact numbers and the bounds can be found 
in [Ra02]. 


R13: The Ramsey numbers for several other classes of small order graphs have been 
determined, and there is an excellent survey of this type of result in [Ra02]. 


R14: Ramsey numbers for the pair (K3,G) where G is an arbitrary graph of order p 
have been determined for p < 6 in [FaRoSc80], for p = 7 or 8 in [Br98], and for p = 9 
in [BrBrHa98}. 


R15: The diagonal Ramsey numbers r(G,G) for all graphs with at most seven edges 
and without isolated vertices can be found in [He87]. 


R16: Ramsey numbers for almost all pairs (G,T) where G is a connected graph of 
order at most 5 and T is an arbitrary tree were calculated in [FaRoSc88]. 


REMARKS 


R17: Figures 8.3.4 and 8.3.5 (Part 1 and Part 2), which can be found in [FaSh83a], gives 
the generalized Ramsey numbers for all pairs of graphs without isolated vertices and 
with five or less vertices. The references for the numbers in these figures appear below 
the diagonal, where [CH] represents [ChHa72], [C] represents [C177], and [H] represents 
[He89a], and for the single entries [h] represents [He89b], [b] represents [BoHa81], [e] 
represents [ExHaMe88], [y] represents [YuHe95], and [m] represents [McRa95]. 
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Figure 8.3.4: Generalized Ramsey numbers for small graphs - Part 1. 


Section 8.3. Ramsey Graph Theory 1013 


e 
No} 
i) 
Oo 
i) 
pare 
No 
nN 


23 24 2 26 27 28 29 300631 (382 33 


56 56 & & F bb & 5S 5 B&F BSB & 5S 5 5 
1 5 5 5&5 BF F 5 6 5 5 FF F FF BB *F 9 

2 9 9 9 9 9 9 9 9 9 9 9 9 WW WU 14 
3 6 6 6 6 6 6 6 6 6 6 6 6 6 6 7 

4 7 & @ % % 8 FF 7 10 10 7 #10 ~~ 18 
5 7 «t @ FF % FF F & FF 8 HW MW 9 H BB 
6 7 % 8 F %F FF YF F §F OD DH OW 9 TD 
7 9 9 9 9 9 9 9 9 9 9 10 10 I 1 14 
8 9 9 10 9 10 10 9 10 10 1 tl 1 HW 13 16 
9 13 13 #14 #18 #413 #+18 «138 «©180«€©61406«©1406«18)6«6©618)«€617)« «6195 
10 6 & 6 Y € £ F FF  F PF RW FF |. 9 
11 9 9 9 9 9 9 9 9 9 9 9 9 WW WW 14 
12 9 9 6 9 9 9 9 9 9 9 18 18 9 18 17 
13 9 9 9 9 9 9 9 9 9 9 18 18 9 18 17 
14 9 9 7 9 9 9 9 9 9 9 18 18 9 18 17 
15 9 9 8 9 9 9 9 9 9 9 18 18 9 18 17 
16 9 9 9 9 9 9 9 9 9 9 128 18 W 18 #17 
17 9 9 9 9 9 9 9 9 9 9 1 18 W 18 #17 
18 9 9 9 9 9 9 9 9 9 9 18 18 1 18 17 
19 «[20] 9 9 9 9 9 9 9 9 10 18 18 9 18 #17 
20 10 10 10 10 9 10 10 13 #14 «14 «138 «17 ~~ 20 
21 9 9 9 9 9 10 10 18 18 1 138 17 
22 i0 10 O08 10 10 Hl GB BW Hh 2 iw 
23 1 20 40 10 af 23 18 M 138. 47 
24 25] 9 9 9 10 138 1 1 12 «17 
25 10 10 13 13 13 +138 #17 ~=©620 
26 10 1 4 4 8 I 2% 
27 14 14 14 #15 17 += 20-22 
28 [29] 18 18 17 19 25 
29 [30] 18 17 19 25-28 
30 15 17 27-29 
31 22 30-34 
32 y 43-49 
33 m 


Figure 8.3.5: Generalized Ramsey numbers for small graphs - Part 2. 
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Linear Bounds 


By using Szemerédi’s regularity lemma, [CRST83] verified the following conjecture of 
Erdés. Chen and Schelp [ChSc93] subsequently extended the class of “linearly bounded” 
graphs to a larger class of c-arrangeable graphs, and this extension has some nice ap- 
plications. 


CONJECTURE 


C4: Erdés Conjecture [BuEr75] If G is a graph of order n with maximal degree A, 
then r(G,G) has an upper bound that is linear in n. 


DEFINITION 


D9: A graph G is c-arrangeable if the vertices of G can be ordered in such a way 
that for any vertex v, each adjacency of v that succeeds v in the order has at most c 
adjacencies that precede v in the order. 


FACTS 


F24: [CRST83] if G is a graph of order n with maximal degree A, then r(G,G) < c-A-n 
for some positive constant c. (This verifies the Erdés Conjecture.) 


F25: [ChSc93] If G is a carrangeable graph of order n, then there is an upper bound 
for r(G, G) that is linear in n. 


F26: [ChSc93] (Corollary) If G is a planar graph, then r(G,G) has an upper bound 
that is linear in the order of G. 


F27: [RoTh96] If G is a graph of bounded genus, then r(G,G) has an upper bound 
that is linear in the order of G. 


CONJECTURE 


C5: Bounded Density Conjecture [BuEr75] If the average degree of each subgraph 
of a graph G of order n is at most c’, then there is a constant c = c(c’) such that 
r(G,G) < cn. 


8.3.5 Size Ramsey Numbers 


Increased interest in the size Ramsey number 7(G,H) was created in a paper by 
Erdés et al. [EFRS78a]. 
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General Bounds 


FACTS 


F28: |E(G)| +|E(H)|—-1< AG, H) < (“G®). 


(Km, Kn) = ("%”)), and 


F29: [EFRS78a] For m,n > 1, 
ft 
ra Kim; Kin) =m+n— 1. 


(ii) 
REMARKS 


R18: Any graph F' such that F —> (G,H) must have at least |E(G)| + |E(H)| -1 
edges, and the fact that K,~¢,4) —> (G,H) implies that r(G, H) < Cee): 

R19: It is natural to investigate the relationship that 7(G, H) has with both (4) 
and |E(G)| + |E(4)| — 1. Both extreme possibilities occur. 


Linear Bounds 


The size Ramsey #(Ky,m, Ki) is linear in m and n, while the number of edges in the 
complete Ramsey graph for the pair (£1, A1,,) is quadratic in m and n. Beck ([Be83], 
[Be90]) answered some of the questions posed in [EFRS78a] by showing that there were 
large classes of graphs for which the size Ramsey number has a linear property or near 
linear property. 


FACTS 


F30: [Be83], [Be90] There exist constants c and c’ such that for any tree T,, of order 
n and maximum degree A and for n sufficiently large, 

(i) * (Pr, Pn) < en, 

(ii) = F(Ch, Cn) < cn, and 


(iii) (Tn, Tn) < An(log n)?2. 


F31: [HxKo95] For any tree T,, with maximum degree A, there is a constant c such 
that 7(Tn,Tn) <c-A-n. 


REMARK 
R20: The previous result was conjectured in [Be90], along with a stronger conjecture 


dealing with the bipartite structure of the tree T,,. The previous results led Beck and 
Erdos to make the following conjecture. 


CONJECTURE 


C6: Beck Conjecture [Be90| For a graph G of order n and bounded degree A, there 
is a constant c = c(A) such that 7(G,G) < cn. 
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Bipartite Graphs 


For the complete bipartite graph K,,,,, upper bounds were proved by Erdés et al. 
[EFRS78a] and by NeSetiil and Rédl [NeR678], and lower bounds were proved by Erdés 
and Rousseau [ErRo93], but none are asymptotically sharp. 


DEFINITION 
D10: A star forest with s components each being a star with n edges will be denoted 
by sKy.n.- 
FACTS 
F32: [EFRS78a], [NeR678], [ErRo93] For n > 6, 
(1/60)n?2” < #(Knn; Knn) < (3/2)n32”. 
F33: [BEFRS78] For positive integers m,n, s and t, 


? (skim, thin) =(m+n—1)(s+t-1). 


REMARKS 


R21: Only a limited number of precise values of size Ramsey numbers are known, 
since they are much more difficult to calculate than generalized Ramsey numbers. Star 
forests with all components equal is one class of graphs for which many numbers are 
known. 


R22: The precise value of the size Ramsey numbers for general families of star forests is 
still open, but results in special cases support the following conjecture from [BEFRS78]. 


CONJECTURE 


C7: Star Forest Conjecture [BEFRS78] Let s and t be positive integers with m; > 
Mg:++Ms > land ny > ng >---n > 1, and let Fy = UF_, Kim, and Fo = U1 1,n,- 
Then, 

stt 


PF, Fa) = So Pk 
k=2 


where pz = max{m; +n; —-1:i4+j=k}. 


Small Order Graphs 


Exact size Ramsey numbers are known for some small graphs, but even for small graphs 
it is sometimes difficult to calculate the number. An easier number to calculate is the 
restricted size Ramsey number, which is determined by restricting the “arrowing” graphs 
to those whose order is the Ramsey number. 
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DEFINITION 


D11: For graphs G and H, the restricted size Ramsey number 1r*(G,H) is the 
minimum size graph F' of order r(G, H) such that F —> (G, H). 


REMARK 


R23: An “arrowing” graph corresponding to the size Ramsey numbers appears below 
the diagonal in Figure 8.3.6. These graphs are not, in general, unique. The subscript 
is a reference to the paper with this result. In this case [1] refers to [Bu79], [2] refers to 
[BEFRS78], [3] refers to [EFRS78a], [4] refers to [FaSh83b], [5] refers to [HaMi83], and 
[6] refers to [FaRoSh84]. 


; Py Ks 2K» Pt Kia C1 Ki—Ps | Ka-e| Ka 
Ps 32) 84) 4p 5 (2) 66) 7 9 | 184 
Py Ki3 Kz 1519) 6 15 18 20 19 20 3619) 
Kz | K;—2k) | Kg 2K 3 5 6 8 8 10 12 
2K» Cs 2K 3K Ps 7s] 8 9 15 17 | 35° 
Py | Ka—K | Kr—P,; Cs Ks — Ps Kia 5p] 1246 18 19 | 42 
Kia Kia |Kr—-Ka| Kia Bye K2 Kis C1 1546 20 a1 | 45 
Cy K, K;-K» C4 K; —K» Kog Ke | Ka—Py | 195 20 | 43" 
Ky —P3 | Kj—2K, | K;— Py | 2(Ky—Ps) | K>—P; | K;—Ky | K;-Ky| K;-Py | Ky—e | 39%, | 55* 
Ky—e | Ks—Ky | K;—Ky | (Ky—e) | Kr7—Ps | Kr-Py | Ky | K7—Ky | Kio—Ka | Ka | 158, 
K4 K; —3Ko Kg 2K 4 Kio —2C; | Kio — Kg Kio Kio — Ps Ku Kis 


Figure 8.3.6: Size Ramsey numbers for small order graphs. 


8.3.6 Ramsey Minimal Graphs 


DEFINITIONS 


D12: For a pair (G, H) of graphs, R’'(G,H) = {F : F —> (G,H)}. The graphs F in 
R'(G, H) are the Ramsey graphs for the pair (G, H). 


D13: A graph F is (G, H)-minimal if F € R’(G, H), but no proper subgraph of F 
is in R’. 


D14: The Ramsey minimal graphs in R’(G, H) will be denoted by R(G, H). 


D15: The pair (G, H) is Ramsey-infinite if the number of nonisomorphic graphs in 
R(G, H) is infinite. Otherwise, the pair (G, H) is Ramsey-finite. 


D16: A graph is 2.5-connected if it is 2-connected and any cutset with two vertices is 
independent. 


1018 Chapter 8. Analytic Graph Theory 


EXAMPLES 


E6: For any graph G, clearly G — (G, Ko), and if F —> (G, K2), then F must have 
G as a subgraph. Hence, R(G, Ky) = {G}, and the pair (G, Ka) is Ramsey-finite. 

E7: Observe that any 2-edge-coloring of an odd cycle C’, will have consecutive edges 
with the same color because of the parity of n. Thus, C,;, —> (P3, P3) for n odd, and no 
proper subgraph of C,, will “arrow” (P3, P3). Thus, the pair (P3, P3) is Ramsey-infinite, 
and it is easy to show that R(P3, P3)={C, : n odd}U Ky 3. 


FACTS 


NeSetiil and Rédl [NeR678] initiated the Ramsey-infinite and Ramsey-finite line of in- 
vestigation. Their work on forests was extended in [BEFRS81] and [BEFRS82a], but 
there is still no complete characterization of Ramsey-finite graphs for forests. 


F34: (NeSetril and Rédl [NeR678]) The pair (G,G) is Ramsey-infinite if 
(i) = x(G@) 23, 

(ii) G is 2.5-connected, or 

(iii) Gis a forest containing a P,. 

F35: [BEFRS81], [BEFRS82a] Let G and H be forests. 


(i) The pair (G, H) is Ramsey-infinite if either G or H has a component that is not a 
star. 


(ii) If G and 4H are star forests without isolated edges, then (G, H) is Ramsey-finite if 
and only if each of G and 4 is a single star with an odd number of edges. 


(iii) There are both Ramsey-finite and Ramsey-infinite pairs of star forests (G, H) 
when G and H have isolated edges. 


F36: [BEFS78] For m a positive integer and G an arbitrary graph, the pair (G,mK2) 
is Ramsey-finite. 


F37: [Lu94] If G is a forest that is not a matching and H contains a cycle, then the 
pair (G, H) is Ramsey-infinite. 


REMARKS 


R24: One consequence of Fact F37 is that a matching G = mKo is the only graph 
that can be paired with any graph H to yield a Ramsey-finite pair. 


R25: Fact F37 also answers a question posed in [BEFRS80b] by showing that the pair 
(P3, H), and in fact the pair (K12n,H) for n > 1, is Ramsey-infinite for any graph H 
that is not a matching. 


R26: A complete characterization of the pairs of forests that are Ramsey-finite is not 
known, and much less is known about pairs of graphs in general. 


CONJECTURE 


C8: Ramsey-Finite Conjecture The pair (G,H) is Ramsey-finite if and only if 
either 

(i) Gor H is a matching, or 

(ii) Gand Z are appropriate star forests. 
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8.3.7 Generalizations and Variations 


There are an endless number of generalizations to classical Ramsey numbers and 
only a few of them have been considered in this short survey. We end with a brief 
mention of some of the directions that have been explored. 


Graphs 


The induced Ramsey number leads to a stronger “arrowing” result, since the monochro- 
matic graph must be induced. The existence of r*(G, H) was verified by Rédl in his 
doctoral thesis [Ro73], and was also verified independently by other mathematicians. 
Some general upper bounds on r*(G, H) for various graphs G and H can be found in 
[KoPrRo98}. 


DEFINITIONS 


D17: The induced Ramsey number r*(G,H) is the least positive integer n such 
that there exists a graph F of order n such that any 2-edge-coloring (Red and Blue) of 
F yields an induced copy of G in Red or an induced copy of H in Blue. 


D18: For bipartite graphs G and H the bipartite Ramsey number r,(G, H) is the 
smallest order of a bipartite graph F' such that F’ —> (G, H). 


D19: The connected Ramsey number r,.(G, H) is the order of the smallest graph 
F such that F —> (G,#H) and the graph induced by each color is also connected. 


D20: For a graph G, the Ramsey multiplicity R(G,G) is the minimum number 
of monochromatic copies of G in any 2-edge-coloring of the Ramsey graph K,, where 
n=r(G,G). 


REMARKS 


R27: The existence of the bipartite Ramsey number r,(G,H) was verified by Erdés 
and Rado [ErRa56] and has been calculated for some basic graphs as paths, stars, and 
some small complete bipartite graphs. This bipartite definition can be extended to any 
chromatic number, not just chromatic number two. 


R28: Sumner [Su78] showed that r.(G, H) = r(G, A) if neither G nor H has a bridge 
and each has order at least four. However, r-(G,H) < r(G,H) for some graphs with 
bridges, such as paths [Su78] and paths with other graphs [FaSc78]. 


R29: Harary and Prins [HaPr74] calculated R(G,G) for small order graphs and stars, 
but very few Ramsey multiplicities are known. 


Hypergraphs 


The discussion of Ramsey theory to this point has been restricted to graphs, and nearly 
exclusively to 2-colorings of graphs. However, the original Ramsey theorem applied to 
k-uniform hypergraphs as well. 
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DEFINITIONS 


D21: A hypergraph consists of a set of vertices V and a set of edges, each of which 
is a subset of V. A hypergraph is k-uniform if its edges all have cardinality k. 


D22: For k-uniform hypergraphs (G),G2,--- ,Gm), the k-hypergraph Ramsey 
number rp(Gi,Go,--- ,G@m) is the smallest integer n such that if the k-sets of a set of 
order n are colored with m colors, there will be for some 7 an isomorphic copy of G; in 
color 7. 


FACT 


F38: (McKay, Radzisowski [McRa91]) r3(4,4) = 13. 


REMARKS 
R30: The only classical hypergraph Ramsey number known is 13(4, 4). 


R31: Fact F38 says that if the triples of a set with 13 elements are 2-colored, then 
there will be a set of order 4 with all of its subsets of order 3 having the same color, 
and it is not true for all colorings of the triples on a set with 12 elements. 
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INTRODUCTION 


Even in purely deterministic settings, randomness can be artificially introduced as 
a powerful proof technique. This is now known as the Probabilistic Method, and it 
emerged in the middle of the last century. Around that time, several important results 
were proved by such arguments, including the myriad applications to Combinatorics 
that were popularized by Paul Erdés. The basic application follows the same general 
lines. First, one specifies a random procedure which generates a random object or 
identifies a random substructure according to some probability distribution. Then, one 
shows that the result has the desired properties with some positive probability, typically 
using inequalities instead of exact enumeration. 


8.4.1 The First Moment Method 


The first moment method has been used numerous times to prove results which are 
more difficult or perhaps impossible to prove using constructive methods. 


DEFINITION 


D1: The first moment method involves defining a random variable, the knowledge 
of whose expected value (i.e., first moment) can resolve the question of the existence of 
a particular structure. 
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EXAMPLES 


El: To prove that there is a tournament T with n vertices and at least n!2~"-) 
Hamilton paths, choose a random tournament and compute the expected number of 
Hamilton paths. Since there are n! paths in the undirected graph, each aligned in 
orientation with probability 2-“—"), the expected number is n!2~("—)), This implies 
that at least one tournament has this many! 


E2: (Bohman, Frieze, Martin, Ruszinko, and Smyth [BoFrMaRuSm]) A coloring of 
the edges of a graph G is said to be k-bounded if no color is used more than & times. 
A subgraph H of G is said to be multi-colored if every edge has a different color. To 
prove that every k-bounded coloring of the edges of the hypercube graph Q,, contains a 
multi-colored copy of the hypercube graph Q,,, for m > kn?2”, simply choose a random 
sub-cube and estimate the expected number of colors that appear twice or more. This 
is less than one for the given parameters. It follows that some Q,, does not have a color 
that appears twice. 


E3: A hypergraph H = (V,{E;, : 7 = 1,2,...,m}) is 2-colorable if there exists a 
partition of its vertex set V into two color classes RU B such that LE; R #4 0 and 
E;,QNB #90 for 1<i<m. In general it is NP-hard to tell whether or not a hypergraph 
is 2-colorable. However, Erdés [Er63] showed that if |E;| > k for 1 < i < m and 
m < 2*-! then H is 2-colorable. Simply partition V at random and show that the 
expected number of edges which are mono-colored is less than one. 


E4: (Spencer [Sp94]) A tournament T is said to have property 5; if for every set S of k 
vertices (players), there is a vertex v = v(S') ¢ S such that all the edges of T which join 
v to S are directed toward v, i.e., player v beats everyone in S. Do finite tournaments 
exist with property S;? The answer is yes. To prove this, simply choose an n which 
satisfies 


C) (i GREP 1 (8.4.1) 


and then randomly orient the edges of K,. Let Z be the number of sets of k vertices 
for which v(S) does not exist. The left hand side of (8.4.1) is the expected value of Z, 
and so the first moment method proves the existence of a tournament with property 
S,. For large k this gives 

n > 2*k? (In 2)(1 + o(1)) 


and this is close to being best possible, since Szekeres has proved that if f(k) is the 
smallest number of vertices in a tournament with property $;, then f(k) > ck2* for 
some constant c > 0 (see Moon [Mo79]). 


One can find other results like this in the book by Alon and Spencer [AISp00]. 


Ramsey Numbers 


DEFINITIONS 


D2: A red-blue edge-coloring of a graph is an edge coloring in which every edge is 
colored either red or blue. 
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D3: The Ramsey number R(k,k) is the smallest integer such that for n > R(k,k), 
every red-blue edge-coloring of the complete graph K,, contains either an all-red Ky, or 
an all-blue Ky. 


Determining the precise values of R(k,k) has proven to be extremely difficult, and 
R(k,k) is not known exactly for any k > 5. All we have are bounds. See Section 8.4.3. 
One of the earliest bounds was proved by Erdos. It should be stated right away that 
Paul Erdés was a pioneer in the use of the probabilistic method, proving many beautiful 
results, as well as inspiring numerous researchers to follow in his footsteps. 


FACT 


F1: (Erdés [Er47]) For sufficiently large values of k, 


R(k,k) > (1- o(1) aah, (8.4.2) 


REMARKS 


R1: The proof of the previous fact is quite elementary. One wants to show that if n is 
smaller than the right hand side of (8.4.2), then one can find an edge coloring without a 
mono-chromatic copy of K,. It has proven very difficult to produce an explicit coloring 
that will give this result. So we proceed as follows: we randomly color the edges 
of K, and show that with positive probability this coloring will have the 
property we want, which is that no K, will be monochromatic. This proves the 
existence of such a coloring without actually explicitly constructing one. This is the 
essence of the probabilistic method. 


R2: In the random construction above one concentrates on the random variable Z 
which counts the number of monochromatic K;, in the coloring. A simple calculation 
shows that the expected value E(Z) < 1 and then one can use the fact that 


Pr(Z > 1) < E(Z) <1 
to show that Pr(Z = 0) > 0. 


R3: The previous fact is one of the basic results in a deep and difficult theory. See 
Section 8.4.3 and also, for example, the books by Graham, Rothschild, and Spencer 
[GrRoSp90] or Nesetril and Rédl [NeRo90]. 


8.4.2 Alterations 


Our main example of another probabilistic proof technique concerns the possible 
relationship between chromatic number and girth. It would be reasonable to conjecture 
that graphs with large girth have small chromatic number, i.e., there is some function f 
such every graph with no cycle of length less than g can be properly colored with f(g) 
colors. In spite of its appeal, it just is not true. 
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DEFINITION 


D4: The alteration method is first to generate a random object and then to alter it 
to obtain a property we desire. 


EXAMPLES 


E5: Erdés [Er59] proved that for any pair of integers k, @ there exists a graph with girth 
at least k and chromatic number at least @. For a probabilistic proof, let Gy, denote the 
random graph with vertex set [n] = {1,2,...,n} and in which each of the (4) possible 
edges occurs with probability p, and start with a careful choice of p = O(1/n) and n 
sufficiently large. Erdés showed that one can delete edges and vertices to create a graph 
G’ with n’ vertices, girth at least k and no independent set of size n’/€. A moment’s 
thought will convince the reader that G’ has the required property. 


E6: By altering the randomly colored complete graph used to prove (8.4.2) one can 
show that 


R(k,k) > (1- o(1)) Sah? 


E7: A similar, but more sophisticated alteration was used by Beck [Be78] to replace the 
inequality m < 2*—! in the question of 2-colorability of hypergraphs by m = 0(2*k1/8). 
Beck’s proof was modified and improved so that the current best value for m is m = 
0(2*(k/Ink)'/?). This was done by Radhakrishnan and Srinivasan [RaSr00]. 


8.4.3 The Lovasz Local Lemma 


After the first moment method, perhaps the next most useful tool is the Local 
Lemma, which is the following fact. 


FACT 


F2: (Lovadsz [ErLo75]) Symmetric version of the Local Lemma: Given a collection of 
bad events A, Ao,..., Am, we wish to prove that there is some point in our probability 
space for which none of the A; occurs. Let I be the corresponding dependency graph 
on the vertex set {1,...,m}, defined such that for each 1 < i < m, the event A; is 
independent of the collection {A; : ij € E(I)}. If all Pr(A;) are upper bounded by 
some real number p which satisfies e(A + 1)p < 1, where e © 2.718 is the natural base 
and A is the maximum degree of the dependency graph I, then Pr(()j", Ai) > 0. In 
particular, the desired point in the probability space exists. 


EXAMPLES 


E8: The local lemma yields a slight improvement of the lower bound on R(k,k). 
We once again randomly color, and the bad events are that a particular k-clique gets 
mono-colored. The computations lead to a slight improvement, 
kV? 94) 2 

e 9 


R(k,k) > (1 —o(1)) (8.4.3) 


which only doubles the bound of (8.4.2). 
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E9: Let A = (V,{E;: i =1,2,...,m}) be a hypergraph in which every edge has at 
least k elements and suppose that each edge of H intersects at most d other edges. If 
e(d+1) < 24! then H is 2-colorable. We randomly 2-color V and event A; is defined 
to be {edge FE; is monochromatic}. Given the set of events A = {A;,i = 1,2,...,mb}, 
we define a dependency graph I with vertex set A such that event A; is independent 
of the events which are not adjacent to A; in the graph IT. In the present context of 
hypergraph coloring, p = 2*~! and A = d and so, the local lemma proves the existence 
of a coloring, i.e., proves that the hypergraph is 2-colorable. 


E10: This example concerns list coloring. Here we have a graph G = (V, E) and each 
v € V has a list of allowable colors L, and the question is can one choose a color cy € Ly 
for each v € V so that the coloring is proper. G is k-list colorable if |£,| > k for all 
v € V implies that a proper coloring exists. The list chromatic number of a graph is the 
minimum & such that G is k-list colorable. Here is a simple result that can be proven 
by one simple application of the local lemma: It is taken from Graph Coloring and the 
Probabilistic Method, by Molloy and Reed [MoRe02]. Suppose that every v € V has 
the following properties: 


(i) [Zy| > 4 and 
(ii) each c € L, appears at most ‘ times on the color list of a neighbor of v. 


The random experiment is to choose a random color c, independently from L, for each 
v €V. The collection of events is A.,-, which denotes {color c is chosen at both ends of 
edge e}. The local lemma immediately implies that there is a positive probability that 
none of these events will occur and so a proper coloring exists under these circumstances. 


E11: Here is another simple application. Suppose G = (V, FE), |V| = n and r divides n. 
Let A denote maximum degree. Suppose that r > 8A; then we can show that for any 
partition of V into sets Vi, V2,...,Vm, m = n/r of size r, there is an independent set 
of G of size m which contains exactly one member of each V;. Simply choose a random 
member of each V; and use the local lemma to show that it is an independent set with 
positive probability. The events are defined by the edges of G. Event A, will denote 
both endpoints of e are chosen. 


REMARKS 


R4: Sometimes a modification of the local lemma can be used, even when all the events 
are dependent. What is needed is some notion of being only weakly dependent. This has 
been called the lopsided local lemma and has been used by Erdés and Spencer [ErSp91] 
and by Albert, Frieze and Reed [AIFrRe95] to show the existence of multi-colored perfect 
matchings and Hamilton cycles. 


R5: As described, the local lemma is non-constructive and does not yield polynomial- 
time algorithms for finding the objects of interest. Starting with a breakthrough by 
Beck [Be91], algorithmic versions have been developed by Alon [A191], Molloy and Reed 
[MoRe98a], Czumaj and Scheideler [CzSc00], Salavatipour [Sa04], Moser [Mo09], and 
Srinivasan [Sr08]. Ultimately, Moser and Tardos [MoTal0] discovered a constructive 
proof producing a polynomial-time algorithm for the most common setting, where there 
is an underlying collection of mutually independent random variables, the events A; 
are determined by various subsets of that collection, and the edge 17 appears in the 
dependency graph I’ if and only if the subsets of variables that determine A; and A; 
overlap. 
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8.4.4 The Rodl Nibble 


The alteration method proceeds by altering the results of a random experiment. 
The Rédl nibble takes this a step further. It was first used by Rédl [Ro85] to affirm 
a conjecture of Erdés and Hanani. This nibbling approach has become a powerful but 
technically demanding tool. 


DEFINITION 


D5: The Rodl nibble considers a random process that builds the required object of 
interest a little piece at a time. 


FACT 


F3: (Rédl [Ro85]) Let M(n,k, 2) denote the minimum size of a family of k-subsets of 
[n] which contain every ¢-subset of [n] at least once. Then as n — oo, we have 


M(n, kyl) = (1+ 0(1)) 2. 
(7) 
This was generalized by Pippenger and Spencer [PiSp89] to general hypergraphs with 
small co-degree. 


EXAMPLES 


E12: Johansson [Jo96] used the nibble to show that the chromatic number of a triangle 
free graph is O( 4): The main idea is to randomly color a small fraction of the 
vertex set, update the lists of colors available at each vertex and repeat. The proof is 
complicated by the need to choose colors non-uniformly. Also, one needs to use the local 
lemma to show that with positive probability the vertex coloring has some regularity 


properties. 


E13: Kim [Ki95] used the nibble to show that R(3,t) = Q(;5;) where R(3,t) is the 
minimum n such that every Red-Blue coloring of the complete graph K,, contains either 
a Red triangle or a Blue copy of Ky. This coincides with the upper bound of OG) 
proved earlier by Ajtai, Komlés and Szemerédi [AjKoSz80]. 


E14: Kahn [Ka96] used the nibble to prove that the list chromatic index of a graph 
G is A+ o(A). Here we properly color the edges of a graph G, using lists of colors for 
each edge. 


E15: Molloy and Reed [MoRe98b] used the nibble to show that the total chromatic 
number of a graph is at most A+ O(1). The total chromatic number is the minimum 
number of colors needed to color the edges and vertices of a graph so that no edge or 
vertex is incident /adjacent to an edge/vertex of the same color. 
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8.4.5 Bounds on Tails of Distributions 


The probabilistic method often deals with events of low probability and has to use 
estimates for the probability of a large deviation of some random variable. The following 
two inequalities are widely used in probabilistic combinatorics. 


FACTS 


F4: (Corollary to the Azuma—Hoeffding inequality — e.g., see [AlSp00]): Let Z = 
Z(Y1, Yo,---; Ym) be a random variable, with Y,, Y2,..., Yj, independent. Suppose also 
that changing the value of one variable Y; only changes the value of Z by at most one. 
Then for any t > 0 we have 


Pr(|Z — E(Z)| >t) < exp {> (8.4.4) 


F5: Suppose that we choose a random subset S of some set X, such that each x € X 
is chosen independently with probability p,. For a collection A,, A2,..., Am of subsets 
of X, we want an estimate of the probability II that S does not contain any of the Aj. 
Janson [Ja90] proved an upper bound on II which is the meat of the inequality. The 
lower bound 


I> Ila — Pr(A; € X)) (8.4.5) 


| 
ma 


a 


follows directly from the FKG inequality of Fortuin, Kasteleyn and Ginibre [FoKaGi71]. 


EXAMPLES 


E16: Inequality (8.4.4) was used by Frieze, Gould, Karoriski, and Pfender [FrGoKaPf02] 
in their paper on graph irregularity strength. Suppose that we weight the edges of a graph 
G with integers from {1,2,...,m}. The weight of a vertex is the weight of all its incident 
edges. A weighting is proper if every vertex has a different weight. The strength o(G) 
of a graph G is the minimum m for which a proper weighting exists. One result from 
[FrGoKaPf02] is that if G is r-regular and r < (n/Inn)!/4 then s(G) < 1410n/r. Part 
of the proof involves randomly weighting each edge with a one or a two and then using 
(8.4.4) to bound the probability that some vertex weighting is repeated much more than 
its expected number of times. 


E17: Inequality (8.4.5) was used in [BoFrMaRuSm] to give a simple proof of the 
following result. Suppose we have k-bounded proper coloring of the edges of K,, and 
m > 2k'/2n3/2, Then there must be a multi-colored copy of K,,. We simply choose a 
random set, where each vertex of K,, is chosen with probability p = 2n/m. Then we 
use (8.4.5) to bound the probability that we do not choose two edges from the same 
color class. 


REMARKS 


R6: Sometimes a related inequality due to Talagrand [Ta96] can be used in place of 
Inequality (8.4.4). 


R7: The interested reader can learn more about this subject and the related subject 
of random graphs from [AlSp00], [MoRe02], [Bo01], and [JaLuRu]. 
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8.4.6 Dependent Random Choice 


The most straightforward way to select a subset U of vertices in a graph is to 
independently accept each vertex at random with the same probability. Sometimes, one 
wishes to use U as a base for finding a desired substructure in the graph. In applications 
related to subgraph embedding, it can be very beneficial to find a set U for which all (or 
at least, many) of its r-subsets have many common neighbors, for some r of interest. It 
turns out that such “rich” sets can actually be constructed randomly, although not via 
independent choices. This robust technique, called dependent random choice, has now 
seen numerous applications, many of which are outlined in the recent survey of Fox and 
Sudakov [FoSu11]. 


DEFINITION 


D6: In this section, the common neighborhood N(S) of a collection of vertices S is the 
set of all vertices which are simultaneously adjacent to every single vertex of S. 


The following result is the key lemma that appears in, or serves as a prototype for 
applications of dependent random choice. 


FACT 


F6: (As formulated in [FoSull1].) For every n, d, s, and k, every n-vertex graph with 
average degree d contains a subset U of at least 


dt ‘ 

max ¢——_ — (” : :tEZr 
not s} \n 

vertices, such that every subset S C U of size s has |N(S)| > k. 


REMARK 


R8: The proof of this main technical lemma is short and flexible, and uses the alter- 
ations method introduced earlier in this chapter. Indeed, one first selects an auxiliary 
subset T of vertices by independently picking exactly ¢ uniformly random vertices, with 
replacement, for a certain optimal t. Then, one takes N(T), and removes one vertex 
from every problematic s-subset, producing U as the result. 


EXAMPLES 


E18: The Turdn number of a graph F, denoted ex(n,F'), is the maximum num- 
ber of edges in any n-vertex graph with no subgraph isomorphic to F’. Erdés, Stone, 
and Simonovits [ErSi66, ErSt46] determined that for every fixed graph F, ex(n, F) = 
(1- aca) ce +o(n?), where x(F) is the chromatic number of F’, resolving this asymp- 
totically for non-bipartite F'. Erdés conjectured [Er67] that for bipartite graphs F' with 
degeneracy r, ex(n, F) = O(n?-/"). Alon, Krivelevich, and Sudakov [AlKrSu03] dis- 
covered a short proof using dependent random choice that ex(n, F) = O(n?-'/4), where 
A is the maximum degree of F’. In the same paper, they extended the technique to prove 
that ex(n, F) = O(n?-V@r)), 
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E19: In additive combinatorics, one often considers the sumset A+ A of a subset 
of integers with itself, which is defined to be the collection of all possible sums of two 
not necessarily distinct elements of A. The sumset can have size which is linear in 
the size of A when, for example, A is an arithmetic progression, and much research 
has investigated the relationship between small sumsets and progression-like behavior. 
See, e.g., the book [TaVu] by Tao and Vu. The Balog-Szemerédi-Gowers theorem (see 
[BaSz94, Go98, SuSzVu05]) considers partial sumsets. Given two sets A and B of, say, 
n integers each, and a 2n-vertex bipartite graph G with one part corresponding to A 
and the other corresponding to B, one can define A+g B to be the set of all sums a+ 8, 
where a € A and b € B form an edge in G. (The full sumset A+ B corresponds to the 
case when G is the complete bipartite graph.) The result states that for any 6 and C, 
there are « and C’ (independent of n) such that whenever A and B are two n-sets, G 
has at least 6n? edges, and |A +q B| < Cn; then there exist A’ C A and B’ Cc B with 
|A’|, |B’| > en, and the full sumset |A’ + B’| < C’n. The original proof by Balog and 
Szemerédi produced tower-type dependencies, due to its use of the Regularity Lemma. 
The approach of Gowers used the philosophy of dependent random choice to establish 
the technical core of the proof, which was a purely graph theoretic statement. 


E20: A graph homomorphism from H to G is a not necessarily injective map from the 
vertices of H to the vertices of G, such that whenever wv is an edge of H, the images of 
u and v are adjacent in G. The homomorphism density t~(G) is the probability that a 
uniformly random map from V(H) to V(G) is a homomorphism. Erdés and Simonovits 
[Si84] and Sidorenko [Si93] conjectured that for every m-edge bipartite graph H and 
any graph G, 
tu(G) 2 tK,(G)”. 

Since tx, (G) is essentially the edge density of G, this conjecture can be interpreted as 
saying that the Erdés—Rényi random graph minimizes the number of copies of H, given 
a fixed number of vertices and edges. Recently, Conlon, Fox, and Sudakov [CoF0Su10] 
used dependent random choice to prove this conjecture for the case when H has a 
vertex that is adjacent to every vertex in the other part of the bipartition. From this, 
an approximate version of the conjecture follows for all H, in which the exponent m on 
the right hand side can, for example, be replaced by m+n, where n is the number of 
vertices in H. 
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MOTIVATION 


When taking limits of rational numbers, we find out the realm of all real numbers. 
They provide better understanding of the variety of rational numbers and turn out to 
be a natural notion which enables us to simplify many arguments and develop more 
sophisticated mathematical tools. 


Discrete mathematics is traditionally about all but taking limits. However, Lovasz 
et al. [BCLt08, BCL10, BCL*06, LS06] introduced a natural notion and developed a 
powerful theory of graph limits. In this theory, finite graphs are viewed as elements of a 
certain metric space (see Section 8.5.4). The completion of this metric space, obtained 
by adding all limits of “convergent” graph sequences, provides a similar boost as the 
real numbers provide for the rationals. What we obtain is a complete metric space and 
the limits themselves can be viewed as generalizations of graphs; thus they are called 
graphons. Their main use is in understanding, exploring, and manipulating very large 
graphs. Recent advances in mathematics, computer science, bioinformatics, life sciences 
and social sciences show the need to analyze large combinatorial objects, graphs being 
the most important among these; see [Lov09]. Emergence of the probabilistic method 
[AS08] and success of Szemerédi’s Regularity Lemma (cf. Section 8.5.5) boosted interest 
in (very) large graphs as well. 

The theory of graph limits emerged from three seemingly unrelated areas: 


e Random graphs (see Section 8.2 or [AS08, Bol01]) and quasirandom graphs [CGW89]. 

e Szemerédi’s Regularity Lemma [Sze78] (see also [KS96, KSSS02, SS91, Tao06]) 
and its algorithmic version [ADLt 94, FK99b, FK99al. 

e Computational complexity ([AFNS06, LS10, BCLT06]). 
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Proper understanding of these areas is fundamental for understanding of graph limits 
and their applications. 


8.5.1 Graphs, Weighted Graphs, and Graphons 


Graphons are generalizations of finite graphs. These objects can be used to describe 
large dense graphs and their limits in a similar manner as the adjacency matrices are 
used to represent finite graphs. 


DEFINITIONS 


D1: <A graphon is a symmetric measurable function W : [0,1] x [0,1] — R. By 
measurable function we refer to the Lebesgue measure on the unit interval [0,1], and 
by symmetric function we mean that W(x, y) = W(y,«) for every x,y € [0,1]. The 
graphon W is a simple graphon if 0 < W(a,y) <1 for every x,y € [0,1]. 


|] is measure-preserving if for every measurable set 


D2: A map ¢: [0,1] > [0,1 
(A) is measurable and has the same measure as A. 


A C [0,1], the preimage ¢~+ 


D3: Two graphons W and U are isomorphic graphons if there exist measure- 
preserving maps ¢ : [0,1] > [0,1] and y : [0,1] > [0,1] such that W(¢(x), d(y)) = 
U(a(x), ~(y)) for every x,y € [0,1]. The pair (¢,~) is said to be an isomorphism 
between graphons W and U. 


D4: If G is a graph with vertices v1,..., Un, then we define the graphon correspond- 
ing to the graph G as the graphon Wg defined as follows. If x € (0,1], we define 
i = [na] and set uz = v; to be the ith vertex of G. For x = 0 we set uz = v;. Then we 


define 
_ fj 1, ifuz and u, are adjacent; 
Wola) = { 0, otherwise. 


D5: A weighted graph is a graph G together with vertex-weights a, = a,(G) € Rt 
(v € V(G)) and edge-weights 8. = 6.(G) (e € E(G)). The total weight of the 
weighted graph G is ag = vev(a) Qy(G). 


NOTATION: If S C V(G), then we write a(S) = )0,¢gQ(G) and a(G) = ag = 
a(V(G)). 


D6: Let G be a weighted graph with vertices v1,...,Un, vertex weights a, (v € V(G)), 
and edge-weights 6. (e € E(G)). Set z = 0 and define z; = 2-1 + Qy,/a(G) for 
i=1,...,n. Finally, define the graphon We corresponding to the weighted graph 
G as follows. If x,y 40 and z-1 <a < % and zj_-1 < y < z;, then 


_ Jf Bojyo;, if; and v; are adjacent; 
Wale.) = { 0, otherwise. 


If x = 0 or y = 0, we define We(z, y) to be 0. 
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D7: If G and Z are weighted graphs, a mapping ¢ : V(G) > V(#) is an isomor- 
phism of weighted graphs if it is bijective and for any vertices u,v € V(G), we have 
Qa(v)(H) = a,(G) and Bgiu)4(v)(1) = Buv(G). An automorphism of a weighted 
graph H is an isomorphism of H with itself. The automorphisms of H form a group 
acting on V(#), and they partition V(H) into orbits of this action. 


D8: Two vertices v,v’ of a weighted graph G are twins if for every u € V(G), we have 
Boul(G) = Boru(G). If v and v’ are twins, let H be the weighted graph obtained from 
G by replacing the vertices v and v’ with a new vertex w whose weight is a,,(H) = 
a,(G)+a,/(G) and the edge-weights 3,,,,(H) are equal to 6,,,(G) for u € V(G)\ {v, v’}. 
Other vertex and edge-weights in H are the same as in G. The graph H is said to be 
obtained from G by merging the twins v and v’, and G is obtained from H by 
splitting the vertex w into vertices v and v’. 


REMARKS 


R1: Similarly as for the adjacency matrix of a graph, the graphon Wg is only defined 
up to isomorphism of graphons as different permutations of the vertex-set give different 
(but isomorphic) outcomes. 


R2: The definition of the isomorphism of graphons involves two measure-preserving 
maps, ¢ and w. Both of them are needed since a measure-preserving function does not 
always have a measure-preserving inverse. (An example of such a function ¢ : [0,1] > 
[0, 1] is given by the rule $(x) = |1 — 2z|, which is a measure-preserving map and is 2-1 
almost everywhere.) 


R3: It makes sense to extend the definition of edge-weights to all pairs of vertices by 
setting By,(G) = 0 if uw ¢ E(G). This convention will be used throughout the whole 
section. 


FACTS 


F1: For a (simple or weighted) graph G and a positive integer k, let G[k] be the graph 
obtained from G by replacing each vertex by k twin vertices and replacing each edge 
uv by the complete bipartite graph K;,,, joining the copies of v with copies of u. Then 
both graphs have isomorphic graphons, We = Weyx). 


F2: A weighted graph and any weighted graph obtained from it by merging twins or 
splitting vertices into twins have isomorphic graphons. 


F3: If vertex and edge-weights of a weighted graph G are positive integers, then there 
is an unweighted graph H such that Wg = Wy. The graph H can be obtained from 
G by replacing each vertex v with a, twin vertices and replacing each edge e with (. 
multiple edges joining the same pair of vertices. 
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8.5.2 Homomorphism Density 


DEFINITIONS 


D9: If F and G are simple graphs, then we denote by hom(F,G) the number of 
homomorphisms F -—> G, i.e., the number of maps ¢: V(F) + V(G) such that for 
every edge uv € E(F), $(u)d(v) is an edge in G. 


D10: The homomorphism density for two simple graphs F' and G, denoted by 
t(F,G), is the normalized value of hom(F, G), 


t(F,G) = 


D11: Homomorphism density between weighted graphs: Let 6: V(F) > V(G) 
be a mapping between weighted graphs F' and G. Let us set 


ag = II (agi (G))e™ and bg = Il (Byuyo(v)(G))Pur 
veEV(F) uve E(F) 


and then define 
hm(F,G@)= SY) ag By 


¢:V(F)3V(G) 


and 
hom(F,, G) 


a(G)o) 
where a(G) and a(H) denote the total weight of vertices of the graphs. 


t(F,G) = (8.5.1) 


REMARKS 


R4: The homomorphism density tells us what fraction of all maps V(F) > V(G) 
are homomorphisms; it can be viewed as the probability that a randomly chosen map 
V(F) > V(G) is a homomorphism. 


R5: If ¢: V(F) > V(G) is a mapping between weighted graphs F' and G, then 
the number of ways twins of v can be mapped to different twins of $(v) is equal to 
(av) (G))*“). This explains the definition of ag in Definition D11. 


R6: If F and G are multigraphs (multiple edges and loops allowed), then homomor- 
phisms also specify which edges joining u and v are mapped to which edges between 
o(u) and ¢(v). If Bu, (F’) is the multiplicity of the edge wv in F and Bg(x)4(v)(G@) is the 
edge multiplicity in G, then the number of ways to map these edges onto each other 
iS (B4(u)o(v) (G))P«”) if d(u) and ¢(v) are fixed. This explains the definition of By in 
Definition D11. 


EXAMPLES 


El: hom(G, K;) is the number of k-colorings of the graph G. 
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E2: hom(K;,G)/k! is the number of k-cliques in G. 


E3: If B is the graph with two adjacent vertices and a loop at one of them, then 
each homomorphism of a graph G into B is determined by the set of vertices that are 
mapped to the vertex without the loop. The set of these vertices is independent in G, 
and any independent set can arise in this way. Thus, hom(G, B) is equal to the number 
of independent sets in G. 


FACTS 


F4: Let H be a simple graph. Then the homomorphism densities into H satisfy the 
following submodularity property: If F and F’ are simple graphs on the same vertex-set, 
then 

t((PUF’,H)+t(FPO F’,H) >t(F,H)+t(F"’, A). 


F5: In the setting of weighted graphs, the densities and homomorphisms can be treated 
in the same way. Namely, if G is the weighted graph obtained from G by dividing all 


vertex weights by a(G), then a(G) = 1 and thus 


t(F,G) = t(F,G) = hom(F,G). 


F6: Let F bea simple graph with vertex set {1,2,...,k}. The homomorphism density 
t(F,G) of F into a simple graph G can be expressed as follows: 


1: =p 1 
(FG) = f i of II Wea(ai, 47) dridr2...dxp. 
SS 
k 


ij€E(F) 


NOTATION: One can use the following shorter notation for multiple integrals over the 
k-dimensional cube [0, 1]* like the one appearing in Fact F6: 


[ff fooex= [0 foo fl fen a andes... din 


DEFINITIONS 


D12: Let F be a simple graph with vertex set {1,2,...,k} and let W be a graphon. 
The homomorphism density for F and W is the number 


«ewy= fff I] Wi 2;) 4x. 


ij€E(F) 


FACTS 


F7: t(k1,W) =1 (we say that the homomorphism density is normalized). 
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F8: Homomorphism densities are multiplicative: For arbitrary disjoint simple graphs 
FF’ and every graphon W, we have 


(FU F’,W) =t(F,W)t(F’,W). 


F9: If G is a (simple or weighted) graph, then t(F,G) = t(F,Wa), where Wg is the 
graphon corresponding to G. 


F10: If two graphons W and W’ are isomorphic, then they have the same homomor- 
phism densities, i.e., for every simple graph F’, we have t(F,W) = t(F, W’). 


8.5.3. Convergent Sequences of Graphs and Graphons 


Graphons can be considered as limiting objects of sequences of finite graphs. In this 
section we reveal this relationship. 


NOTATION: We write (X;,,) or (Xn)n>1 for a sequence of objects, X), Xo, X3,.... 
DEFINITIONS 


D13: Let (G,,) be a sequence of (simple or weighted) graphs. We say that the sequence 
is convergent if for every simple graph F’, the sequence of homomorphism densities 
t(F,G,) converges. If there exists a graphon W such that lim, t(F,Gn) = t(F, W) 
for every simple graph F’, then we say that W is the limit of the convergent graph 
sequence (G,), and we write W = limyp+o Gn. 


D14: A sequence (W,,) of graphons is convergent if for every simple graph F, 
limn+o t(F,W,) exists. A graphon W is the limit of the convergent sequence 
(W,,) of graphons if lim, ..t(f,G,) = t(F,W) for every F’; in this case we write 
W = limpn—oo Wn. 


FACTS 
F11: Every simple graphon is a limit of a convergent sequence of simple graphs. 
F12: Every graphon is a limit of a convergent sequence of weighted graphs. 


F13: If (G,,) is a convergent sequence of (simple or weighted) graphs, then there exists 
a graphon W that is the limit of the sequence, W = limyn_.. Gn. 


F14: If (W,,) is a convergent sequence of graphons, then there exists a graphon W 
that is the limit of the sequence, W = lim,_,., Wn. 


F15: The limit of a convergent sequence of graphs or graphons is determined uniquely 
up to isomorphisms of graphons. 
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8.5.4 Metric Space Topology on Graphs and Graphons 


The set of graphs and the larger set of graphons can be viewed as a metric space 
whose topology gives the same convergent sequences as defined via homomorphism 
densities. This setup enables one to use analytic techniques when studying large graphs. 
The space of graphons turns out to be a complete metric space in the sense of Cauchy 
convergence. 


DEFINITIONS (METRIC SPACE) 


D15: A metric on a set X is a function d: X x X > R satisfying the following 
conditions: 


(a) d(x,y) = d(y, x) for every x,y € X. 

(b) d(x, y) > 0 for every x,y € X. 

(c) d(x, y) = 0 if and only if « = y. 

(d) d(a,y) < d(x, z)+d(z,y) for every x,y,z © X. 


D16: A metric space is a pair (X,d), where X is a set and d is a metric on X. 


D17: A sequence (x,,) of elements of X is convergent in the metric space (X,d) if 
there exists « € X such that d(z,,7) + 0 as n > oo. In such a case we say that x is 
the limit of the sequence (x). 


D18: A sequence (x,,) of elements of X is a Cauchy sequence in the metric space 
(X,d) if for every « > 0 there exists an integer mp such that for every m,n > no, 
d(an,Lm) <€. 


D19: A metric space (X,d) is complete if every Cauchy sequence is convergent. 


D20: Let (X,d) be a metric space. If (x,) and (yn) are Cauchy sequences of elements 
of X, then we define d((an), (Yn)) = limn+co d(an, yn). The two Cauchy sequences (2p) 
and (yy) are said to be equivalent sequences if d((an),(Yn)) = 0. This induces a 
metric d on the set X of equivalence classes of all Cauchy sequences. The resulting 
metric space (X,d) is called the completion of the metric space (X,d). It turns out 
that every metric space has a completion and that the completion is uniquely defined 
as the smallest complete metric space that contains (X,d) as a metric subspace. Note 
that every « € X can be identified with the equivalence class of the constant sequence 


(Goes k) 


DEFINITIONS (CUT DISTANCE) 


D21: Cut distance of labeled graphs: If G and H are two graphs of order n on 
the same (labeled) vertex set V = V(G) = V(#), then we define the cut distance of 
G and H as 


1 
d (G, H) = = gnax,|ec(5,T) _ ex(S,T) 


; 
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where eg(S,T) (and similarly ey($,T)) denotes the number of edges between S and T 
with edges whose both ends are in SMT counted twice, i.e., 


ecg(S,T) = |{(s, t) |sES,teT, ste E(G)}\. 


D22: Cut distance of weighted labeled graphs: The definition of the cut distance 
extends to weighted graphs on the same set of vertices if the vertex weights are the 
same in both graphs. In that case, the normalization factor 1/n? is replaced by 1/a(G)? 
(where a(G) is the total vertex weight of G) and define 


=S°S¢a.(G) G)Bsi(G ). 


seESteT 


The edge count e77(S, 7) in H is defined in the same way. Then we set a = a(G) = a(H) 
and define 


1 
do(G, H) = a max lec( (S,T) — ex(S,T)|. 


D23: Cut distance of unlabeled graphs: If two graphs, G and H, have the same 
number of vertices that are not labeled, then one can consider all possible labelings and 
select those whose cut distance is the smallest: 


ba(G, H) = ming q do(G, A) 


where the minimum runs over all labelings e3 H of graphs G and H. 


Fact F16 motivates the way to define the cut distance also for graphs whose numbers 
of vertices are different: we first blow up both graphs so that the resulting graphs have 
the same number of vertices, and then use Definition D23. To deal also with the 
weighted case and to involve a corresponding generalization of relabelings, the notion 
of a fractional bijection from V(G) to V(#) is needed. 


D24: Let G and G’ be weighted graphs with normalized vertex-weights, i.c., a(G) = 
a(G’) = 1. Let V = V(G) and V’ = V(G’). A function X : V x V’ > [0,1] isa 
fractional bijection V > V' if 


S- X(v,v') = ay(G) for every v € V, and 
v'eV'’ 

ye X(v,v') = ay(G’) for every v' € V’. 
vEeV 


The transpose X7 : V’x V — (0, 1] of a fractional bijection X, defined by X7(v’,v) = 
X(v,v’), is a fractional bijection V’ > V. 


D25: Having a fractional bijection X from V(G) to V(G’), we define a weighted 
graph G[X] on the vertex set V x V’. The vertex-weights of GLX] are a(»7)(G[X]) = 
X(v,v’) and the edge-weights are Biy.w)(u, (uw (G [X]) = Bou(G). One can view this as 
splitting each vertex vu € V into es vv’ (v' € V’). Similarly we define G’[X7]. 
There is obvious bijection between their vertex sets, so we can define their cut distance 
dg(G[X], G'[X7]) by Definition D22. 
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D26: Cut distance of arbitrary unlabeled weighted graphs: The cut distance 
6o0(G, G’) is defined as 


bo(G,G') = infd (G[X], G@’[X7]) 
= tee ys X(v,v')X(u, u’) (Bou(G) — Buu (G’)) , 


vv'ES uw’ ET 


where the infimum is taken over all fractional bijections X € [0,1]”*Y’. 


D27: The cut norm of a graphon W is defined as: 


ap W (x,y) da dy 


where the supremum runs over all measurable subsets A, B of the unit interval. 


|Wlo = sup 
A,BC[0,1] 


9 


D28: Cut distance of graphons: For graphons U and W, we define dq(U,W) = 
||U —W || and 60(U, W) = inf da(U, W), where the infimum is taken over all graphons 
W and over all graphons U that are isomorphic to W and U, respectively. 


FACTS 


F16: For a graph G and a positive integer k, let G[k] be the graph obtained from 
G by replacing each vertex by & twin vertices (or equivalently, multiplying all vertex 
weights by k) and replacing each edge uv by the complete bipartite graph Ky, joining 
the copies of v with copies of u. Then dg(G, G[k]) = 0. 


F17: For graphons U and W, we have 69(U,W) = infdg(U, W) = inf d (U,W), 
where the infima are taken over all graphons W and over all graphons U, respectively, 
that are isomorphic to W and U, respectively. 


F18: If G and H are (simple or weighted) graphs, then their cut distance is the same 
as the cut distance between the corresponding graphons, 69(G, H) = dg(We, Wa). 


F19: The cut distance dq(U,W) defines a metric on the space W of all graphons. In 
this metric, (W, dq) becomes a complete metric space. 


F20: A sequence (G,,) of graphs (respectively, a sequence (W,,) of graphons) is conver- 
gent if and only if it is a Cauchy sequence in the metric space (W, dp), i.e., whenever 
n,m — co, we have dg (Gn, Gm) > 0 (respectively, d5(Wn, Wm) > 0). 


F21: The cut norm of a graphon can be expressed as 


|Wllo = sup 
0<f.g<1 


[ ih W (x,y) f(a)g(y) dx dy) , 


where the supremum runs over all measurable functions f,g : [0,1] — [0,1]. 


F22: For every simple graph F and arbitrary graphons U and W, we have: 
Jt(F, U) —t(F,W)| < |E(F)|- d0(U, W). 
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8.5.5 Regularity Lemma and Approximation 
of Graphons by Weighted Graphs 


Every graphon can be approximated by a large weighted graph. An important fact 
is that for a given “error of approximation”, the number of vertices of such a graph 
is bounded, independent of the graphon to be approximated. This is made possible 
by applying the Szemerédi Regularity Lemma [Sze78]. This important result states 
that the vertices of every large enough graph can be divided into subsets of about the 
same size so that the edges between different subsets behave almost randomly. The 
Regularity Lemma has numerous applications in graph theory (see [KS96, KSSS02]). 
Additionally, it became an invaluable tool in number theory after it was used in the 
celebrated result of Green and Tao [GT08] that prime numbers contain arbitrarily long 
arithmetic progressions. 


The basic setup for the Regularity Lemma is to partition the vertex set of a graph G 
into a bounded number of parts V; U--- U Vz such that the edges between (almost 
all pairs of) different parts behave almost randomly. The formulation involves a fixed 
(small) positive real number ¢ > 0 and uses a notion of a random-like property given 
in Definition D30. 


DEFINITIONS (REGULAR PARTITION) 


D29: Let X,Y be disjoint sets of vertices in G. The density of edges between X 


and Y is the number 
eg(X, Y) 


|X| |¥] 
where eg(X,Y) denotes the number of edges of G between X and Y. 


d(X,Y) = 


D30: Let ¢ > 0 be areal number. A pair (X,Y) of disjoint nonempty vertex-sets of 
G is an e-regular pair if for every X’ C X and every Y’ C Y with |X’| > e|X| and 
|Y’| > e/Y|, we have 

\d(X’, Y’) —d(X,Y)| <e. 
Observe that € plays a dual role; it is used in the lower bound on the size of vertex 
sets X’ and Y’ and is used again as the upper bound on the discrepancy between the 
densities. 


D31: A partition V = V, U---U Vz of a set V is said to be a partition into k parts. 
Such a partition is denoted as P = {Vj,..., Vi}. 


D32: A partition P = {Vj,...,Vi} of a set V is balanced if | |V;| — |Vj|| <1 for all 
i,j © {1,...,k}. 
D33: A partition P = {V,...,V,} of the vertex-set V of a graph G is ¢-regular if 


e P is balanced, and 


e all but at most ek? pairs (Vi, V;) with 1 <i <j < k are e-regular. 


D34: Given a partition P = {Vi,..., Ve} of the vertex-set V of a graph G, we define 
a weighted graph Gp with vertex-set [k] = {1,...,k} as follows: 
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e The weight of the vertex i € [k] is a; = |V;|/|V(G)|, and 
e the weight of the edge ij is Bi; = ea(Vi, V3) /(IVil Vj) = d(Vi, Vj). 


We let Wp denote the corresponding graphon of Gp. The same definition can be made 
if the graph G is weighted by replacing cardinalities of vertex- sets by their total weight 


and by defining eg(V;,V;) as in Definition D22. 


FACTS 


F23: (Regularity Lemma [Sze78]) For every « > 0 and every integer L, there is an 
integer M such that every graph G with at least DL vertices has an ¢-regular partition 
P with L<|P| <M. 


F24: (Weak Regularity Lemma [FK99a]) For every « > 0, every graph G has a partition 
P into at most 4!/©" classes such that dg(G, Gp) <e. 


F25: (Weak Regularity Lemma for weighted graphs [FK99a]) For every € > 0, every 
weighted graph G has a partition P into at most 22/© classes such that d (G,Gp) < 
a (ea(V, yy where V = V(G) and eg(V,V) is defined as in Definition D22. 


F26: For every c > 0, q > 229/©" and every weighted graph G with edge-weights in 
(0, 1], there exists a simple graph H of order q such that d9(G, H) < e. 


REMARKS 


R7: The constant M = M(e, L) involved in the Szemerédi Regularity Lemma (Fact F23) 
is extremely large, and hence the Weak Regularity Lemma of Frieze and Kannan 
(Fact F24) is more appropriate for applications like parameter testing [BCL*06]. 


R8: The Regularity Lemma is trivial for graphs having at most M vertices since the 
partition into singletons is always e-regular. 


R9: Ifthe graph is sparse, e.g., if the number of edges is less than $€2n?/L?, then every 
balanced partition into L parts of size |n/L| is e-regular since the densities appearing 
in Definition D30 are smaller than e. 


R10: Numerous applications of the Regularity Lemma are presented in [KS96, KSSS02, 
SS91, Tao06]. Algorithmic versions of the Regularity Lemma and the algorithmically 
more tractable Weak Regularity Lemma appear in [AN06, ADLT94, FK99b, FK99a]J. 
Additional applications are considered in [AFdIVKK03, AFNS06, LS10, BCLT 06]. 


8.5.6 W-Random Graphs 


The notion of W-random graphs, which generalize traditional notions of random 
graphs (cf. Section 8.2), has a close relationship with graph limits. 
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DEFINITIONS 


D35: If W is a graphon and n > 1 is an integer, one can define the probability space 
G(n,W) of W-random graphs of order n as follows. To obtain a random element of 
G(n, W), select n points v1,...,Un from the unit interval [0,1] independently at random 
(with the uniform probability distribution on [0,1]). Then add the edge u,v; with 
probability W(v;,v;) independently for every 1 <i < j <n. Although it is possible 
that some selected points v; are equal to each other, this occurs with probability 0, so 
U1,-++,Un are almost surely pairwise distinct; hence almost all graphs in G(n, W) are of 
order n. 


A W-random graph drawn from G(n, W) is denoted by G,,,w. For an arbitrary fixed 
simple graph F’, the homomorphism density t(F, G,,w) can be considered as a random 
variable. 


FACTS 


F27: For every simple graph F' and n > |F|, the following statements hold: 
(a) |EU(F, Ga.w)] - HF, W)| < 5 ('5'). 
(b) Var[t(F, Gnw)| < 3|F/?. 
(c) Ifk =|F| and 0 <e <1, then 7 


Pr[ |t(F, Gnw) —t(F,W)| >e] < 2exp(— 55 yi 


F28: The probability that a sequence of W-random graphs (G',,w) is convergent and 
has limit W is equal to 1. 


F29: Every graphon W is a limit of some convergent sequence of graphs. Such a 
sequence can be obtained, with probability 1, by taking a sequence (G,,w) of W-random 
graphs. 


8.5.7 Graph Parameters and Connection Matrices 


An important aspect in understanding graph limits is through the study of connec- 
tion matrices of homomorphism densities. They give rise to the use of algebraic tools 
and have application in extremal graph theory and theoretical computer science. 


DEFINITIONS (GRAPH PARAMETERS) 


D36: A graph parameter is a function defined on all (finite) graphs that is invariant 
under graph isomorphisms. We may allow multiple edges and loops. If a parameter 
is defined only for simple graphs, we can extend its range to allow parallel edges by 
defining its value in such a way that we first replace multiple edges by single edges (and 
delete all loops). Such a parameter is invariant under multiplying edges, and is said to 
be a simple graph parameter. 


D37: A graph parameter p is multiplicative if p(GU H) = p(G) p(H) whenever G 
and H are disjoint graphs. 


D38: A graph parameter p is normalized if p(k1) = 1. 
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EXAMPLES 


E4: Homomorphism counting function hom(-, H) into a fixed weighted graph H deter- 
mines a graph parameter homy that is multiplicative: If FU F’ is the disjoint union of 
two graphs, then hom(F U F’, H) = hom(F’, H) hom(F’, H). 


E5: The density function ty = t(-,H) can be interpreted as the homomorphism 
function into the normalized weighted graph H and is therefore also multiplicative. 
Moreover, the homomorphism density parameter ty is normalized. 


FACT 


F30: (Lovasz [Lov06]) Let G and H be weighted graphs without twin vertices and 
with the same total weight a(G) = a(H). Then G and H have the same densities, i.e., 
t(F,G) = t(F, H) for every simple graph F’, if and only if they are isomorphic. 


DEFINITIONS (&-LABELED GRAPHS) 


D39: Let k > 0 be an integer. A k-label on a graph G is a sequence L = (Ly,..., Ly) 
of & distinct vertices of G. Having such a k-label, we say that the vertex L; is the ith 
labeled vertex, or the vertex with label i (1 <i<k). 


D40: A k-labeled graph is a pair (G, L), where G is a graph of order at least k and L 
is a k-label on G. We usually omit the reference to the labeling L and refer to G itself 
as a k-labeled graph. 


D41: Two k-labeled graphs (G,L) and (G’,L’) are isomorphic if there is a graph 
isomorphism 1 : G — G’ that preserves the labels, i.e., o(L;) = Li for every i=1,...,k. 


D42: Let Lz be the set of all k-labeled graphs up to isomorphism. For G, H € Lx, we 
define the k-sum G-H as the labeled graph obtained from the disjoint union of G and 
HA and then, for 1 = 1,...,k, identify the 7th labeled vertex of G with the zth labeled 
vertex of H to get the 7th labeled vertex of the sum. 


G A 1 G-.H 
I K & 
2 2 2 
Figure 8.5.1: The 2-sum of 2-labeled graphs. 


DEFINITIONS (CONNECTION MATRICES) 


D43: If pis a graph parameter and k > 0, we define the kth connection matria for 
the parameter p as the infinite matrix M = M(p,k) € R&**** whose entries correspond 
to pairs of k-labeled graphs with their values being the values of the parameter on the 
k-sum, Mg.n = p(G- H). 
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EXAMPLE 


E6: The 2-sum G'- H of 2-labeled graphs G and Z is illustrated in Figure 8.5.1. The 
black vertices are labeled as indicated. As an illustration of connection matrices, we 
show a part of the connection matrix M(y,2) for the chromatic number and k = 2 in 
Figure 8.5.2. 


Figure 8.5.2: Part of the connection matrix M(y,2) for y and k = 2. 


NOTATION: When speaking of vectors and matrices, we will employ functional notation. 
Recall that B4 denotes the set of all functions A + B. In this manner, a (real) matrix 
indexed by a (possibly infinite) set L is an clement of R’*"”, and a vector with 
entries indexed by L is an element in R’. As usual, we use index notation to denote 
specific entries of matrices and vectors, and we apply usual notation from linear algebra 
to denote the matrix-matrix or matrix-vector multiplication, the transpose notation, 
etc. 


y] 


D44: The rank of an infinite matriz is the supremum of ranks of its finite (prin- 
cipal) submatrices. 


D45: An infinite matrix M is positive semidefinite if every finite principal submatrix 
of M is positive semidefinite. 


D46: A graph parameter p is reflection positive if its connection matrices M (p, k) 
are positive semidefinite for each k > 0. 


FACTS 


F31: A matrix M € R&*** has rank at most r < oo if (and only if) there are 2r 


vectors %1,...,@, and y1,..., yy in R** such that 
e 
M=Soui-y, (8.5.2) 
i=1 


where the product F = x-y? € R&**£« denotes the matrix with values Fo. = tcyun.- 
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F32: A matrix M € R&**** is positive semidefinite if and only if it can be written 
in the form M = Oe, %i° x}, where x; € R** for i € I. If the matrix M is positive 
semidefinite and has finite rank r, then it is possible to choose I so that |I| =r. 


F33: (Freedman, Lovasz, and Welsh [FLW]) If rank(M(p, k)) is finite for some k > 0, 
then the parameter p can be computed in polynomial time for graphs of tree-width at 
most k. 


F34: Every graph parameter p can be considered as an element in R“*. If p is 
multiplicative, then M(p,0) = p-p’, so M(p,0) is positive semidefinite with rank at 
most 1. For larger values of k, multiplicative parameters satisfy the following relation: 


rank(M(p,k+1)) > rank(M(p, k)) - rank(M(p, 1)). 


F35: Connection matrices M(homy,k) and M(ty,k) corresponding to homomor- 
phism counting are positive semidefinite. Thus, the parameters homy and ty counting 
homomorphisms into a fixed weighted graph H are reflection positive. 


F36: For every simple graph H and every k-labeled graph F’, we have 
tu(G-G) > ta(F)’. 
F37: For every integer k > 0, the rank of the connection matrices M(homy,k) and 
M(ty,k) is exponentially bounded: 
rank(M(homy,k)) < |H|*. 
F38: Let g =rank(M(homy,1)). Then rank(M(homy,k)) > q*. 
F39: For every weighted graph H and k > 0, rank(M(homy, k)) = rank(M(ty, k)). 


F40: (Lovasz [Lov06]) If a weighted graph H has no twins and the automorphism 
group of H has q orbits on V(#), then 
rank(M(homy,k)) = q”. 
In particular, if H has no twins and has no nontrivial automorphisms, then 
rank(M(hom;;,k)) = |V(H)|*. 
F41: (Freedman, Lovasz, and Schrijver [FLS07]) Let p be a reflection positive graph 
parameter for which there exists a positive integer q such that rank(M(p,k)) < q* 


for every k > 0. Then there exists a weighted graph H with |V(H)| < q such that 
p = homg, ie., p(G) = hom(G, H) for every graph G. 


EXAMPLES 


E7: Let the parameter e be the graph parameter that counts the number of edges, 
ie., e(G) = ||G||. Since e(G- H) = e(G) + e(H), it follows that 

M(e,k) =e-17+1-e?7, 
where we interpret e as a vector in R&*, and 1 is the constant 1 vector in R“*. Thus, 
the rank of M(e,k) is at most 2. By considering the submatrix of M on the empty 


graph and a graph with one edge, we see that its rank is 2, so the rank of M(e,k) is 2 
for every k. 
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E8: Suppose that for G € Lx, e'(G) denotes the number of pairs of vertices that are 
adjacent in G. That is, we count the number of edges of the underlying simple graph. 
Then e’(G: H) = e'(G)+e'(H) —r, where r denotes the number of pairs of the labeled 
vertices that are simultaneously adjacent in G and in H. For every pair of distinct labels 
i,j (1<i<j<hk), define the vector e;; € R&* that has value 1 for a k-labeled graph 
G if its ith and jth labeled vertices are adjacent, and value 0 otherwise. Then we have 


M(e’,k) =e -17 +1-(e)? + S- Cij * Er. 


ij 
1<i<j<k 


Consequently, rank(M(e’,k)) < 2+ (5). It can be shown that the equality holds by 
looking at the principal submatrix of M corresponding to the k-labeled graphs having 
precisely the edge ij (there are (c) of these) and two additional graphs with edges 


outside of the labeled part. 


E9: Let m(G) denote the number of perfect matchings of the k-labeled graph G. Every 
perfect matching of G- H can be decomposed into a matching in G and a matching in 
H. Suppose that the set of labeled vertices in G and H is [k] = {1,...,k}. Let S C [k] 
be the subset of labeled vertices that are covered by the edges in G, and let S = [k] \ S 
be the vertices covered by the edges of H. Clearly, all perfect matchings, in which S 
is covered by the edges in G and S$ is covered by the edges of H, can be obtained by 
taking a perfect matching in G— S and a perfect matching in H — S. Thus, if mg is 
the parameter that counts the number of perfect matchings in G — S, then 


m(G-H) = S~ ms(G)ms(H). 
SCIk] 


This shows that 


M(m,k) = SS mg-ms 
SCIk] 


and hence rank(M(m,k)) < 2”. It can be shown that the equality holds. 


E10: Let r be a positive integer, and let h,(G) = hom(G, K,) denote the number of 
r-colorings of a graph G. The r-colorings of G'- H can be partitioned into the following 
classes. For every partition 7 of the labeled set [k] into |7| < r parts, we consider all 
r-colorings such that the colors on the labeled set induce the partition 7. Let us denote 
the number of such colorings of a graph G by c,(G). Then 


he(G- H) = = ( ‘: ) en(Ghex(H) 


Iz 


This shows that M(h,,k) is a positive semidefinite matrix whose rank is at most the 
number of partitions of [k] into at most r parts. 


1054 Chapter 8. Analytic Graph Theory 


8.5.8 Extremal Graph Theory 
and the Algebra of Quantum Graphs 


The study of graph parameters can be extended to a formal setting of quantum 
graphs and graph algebras. This approach has applications in extremal graph theory 
(see Section 8.1). It is essentially the same as the theory of Razborov flag algebras 
that was independently introduced in [Raz07, Raz08}. 


EXAMPLE 


E11: The special case of Turdn’s Theorem (see Section 8.1) states that every triangle- 
free graph H of order n contains at most n?/4 edges. This can be expressed with 
homomorphism densities as follows: If ty (43) = 0, then ty(K2) < 1/2. A more general 
statement (proved by Goodman [Goo59]) shows that H contains many triangles if the 
number of edges is larger than n?/4. It can be expressed as follows: 


ty(K3) 2 ty(Ke)(2ty(K2) — 1) (8.5.3) 


and can be derived using multiplicativity and reflection positivity of the homomorphism 
density parameter ty as follows. For i = 1,2, let F; be the 3-labeled graph on vertex 
set {1,2,3} isomorphic to the path of length 2 in which the vertex i has degree 2. Then 
ty (Fy U Fy) = ty (Ks) and ta (Fi ia Fy) = ty (Ko U ky) = ty (Ko). By using Fact F4 
(submodularity) on Fy and F2, we conclude that 


ty (K3) + ty(Ko) = ty(F\) + ty (Fo) = 2t 4 (P3). 


Note that P3 can be viewed as the 1-sum K 2 - K2 of two copies of 1-labeled graph Ko. 
By Fact F36, we conclude that ty(P3) > ty(K2)?; thus, ty(K3) > 2tw(P3) —tu(Ke2) > 
2t7(K2)? — ty (Ke), which gives Inequality (8.5.3). 


NOTATION: For every k > 0, let £; denote the set of all k-labeled graphs, whose labeled 
vertices are [kK] = {1,..., k}. 


DEFINITIONS 


D47: A k-labeled quantum graph G is a formal linear combination of labeled graphs, 


Le., 
G= S- vy H, 


HEL: 


where xy € R for every H € Ly, and only finitely many of the coefficients xy are 
non-zero. The set of all k-labeled quantum graphs is a real vector space, isomorphic to 
a subspace of R**. 


D48: Recall that the k-sum of two k-labeled graphs G and H in L, is the labeled 
graph G- H that is obtained from the disjoint union of G and H by identifying pairs of 
labeled vertices that have the same label. The sum extends to quantum graphs by the 


= wad See) (Sema) teenie 


GEL, HEL: GEL, 
HEL, 
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With this notation the space of quantum graphs R“* becomes a commutative algebra 
(where the k-sum has the role of the product operation in the algebra) which is called 
the algebra of k-labeled quantum graphs. The edgeless graph on labeled vertex set 
[k] is the identity in this algebra. 


D49: If pis a graph parameter, we define its values on quantum graphs by the following 


_ v(G) = »( SS tn Ht) = >> tH0(H). 


HELE HELE 


EXAMPLE 


E12: With the notation introduced above, Goodman’s Theorem (8.5.3) can be stated 
in the form 
ty(K3 — 2( Ko U Ko) + Ko) >0 


where we view K3 — 2(K9U K2)+ Ko as a 0-labeled quantum graph and U denotes the 
disjoint union. 


FACTS 


F42: A graph parameter p is reflection positive if and only if the parameter p is non- 
negative on squares of quantum graphs. Specifically, the connection matrix M(p,k) 
is positive semidefinite if and only if the parameter p is non-negative on squares of 
k-labeled quantum graphs. 


At the time of preparing this section, the topic of exploring uses of analytic tech- 
niques for homomorphism densities is very active. One of the most interesting outstand- 
ing conjectures in this area is the Sidorenko Conjecture, which claims that for every 
bipartite graph H with m edges and every graph G, 


ta(G) > tk, (G)”. 


The Sidorenko Conjecture has been proved for many bipartite graphs H, and the reader 
is referred to [CFS10, Hat10, Lov11] for some of the known results. 
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Glossary for Chapter 8 


alteration method — a probabilistic method: first generate a random object, and 
then alter it to obtain a property we desire. 

Andrasfai graph Aj: for i > 2, the complement of the (i — 1) power of the cycle 
graph C3;_1. 

c-arrangeable graph: a graph G whose vertices can be ordered in such a way that 


for any vertex v, each neighbor of v that succeeds v in the order has at most c 
adjacencies that precede v in the order. 


arrowing graph F' for (G),G2,--- ,G,) — a concept in Ramsey theory: we write F —> 
(G1, Go,-+- ,Gp) if for any k-edge-coloring of F' there is for some 7 a monochromatic 
copy of G; in the i*” color. 


asymptotically normal sequence {X,,} — of random variables: a sequence such that 
for all fixed a, 


‘A 1 a 2 
P(X <a) =o(1) + = | e* /2 dz asn 4 0 
(X <a) =o(1)+ ef 


where X = (X — EX)/VVarX. 
asymptotically Poisson sequence {X,,} — of random variables: a sequence such that 
P(Xn = k) = e7*A*/k! + o(1) 
for all fixed integers k > 0, as n > oo. [Valid in the case that » is any bounded 
function of n.| 
book: a collection of k3-graphs sharing a common edge. 
__, r-book: a collection of (r + 1)-cliques sharing a common K,. 


booksize — of a graph G: the number of K3-subgraphs in the largest book subgraph 
of G. 


chromatic surplus: the largest number s = s(G) such that in every vertex coloring of 
a graph G with y(G) colors, every color class has at least s vertices. 


circumference — of a graph: the length of the longest cycle. 


clique, — in a graph: a maximal complete subgraph; note that there are two different 
definitions. 


clique — in a graph: any complete subgraph, not necessarily maximal under inclusion; 
note that there are two different definitions. 


clique number w(G) — of a graph G: the number of vertices of its largest clique. 


closure — of a graph: the graph obtained from G(n) by successively joining all nonad- 
jacent vertices u and v with deg(u) + deg(v) > n. 
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complex graph: a connected graph G with |E(G)| > |V(G)|. 
concentration inequality: a bound on the probability that a random variable differs 
from its mean by a large amount; a technique within the probabilistic method. 


2.5-connected graph: a 2-connected graph such that any cutset with two vertices is 
independent. 


convex graph property: a property P that a graph G must have, if a subgraph and 
a supergraph of G on the same vertex set both have P. 


k-core — of a graph: the largest subgraph with all vertex degrees at least k. 
decreasing graph property: a property that cannot be lost by deleting edges alone. 
density, — of a graph G: the ratio |E(G)|/|V(G)|. 

__, maximum - of a graph G: the maximum ratio of the number of edges divided 

by the number of vertices, taken over all non-null subgraphs of G. 

density 2 — of a graph G: the ratio |E(G)| OO) : 
excess — of a graph G: the difference |E(G)| — |V(G)]. 
extremal function ex(n, F) — for an integer n and a family F of forbidden graphs: the 


function whose value is the largest number of edges of a simple n-vertex graph that 
contains none of the forbidden subgraphs. 


extremal graph — for an integer n and a family F of forbidden graphs: a graph with 
no forbidden subgraphs and with the largest possible number of edges. 


H-factor — of a graph G: a collection of disjoint copies of the graph H that covers the 
vertices of G. 


first moment method: using the expected value of a random variable to bound the 
probability; a technique within the probabilistic method. 


H-free graph — where 4 is a graph: a graph that has no subgraph isomorphic to H. 
girth — of a graph: the length of its shortest cycle. 
graph parameter: a function defined on all (finite) graphs that is invariant under 
graph isomorphisms. 
__, multiplicative: a graph parameter p such that p(GU HA) = p(G) p(#), whenever 
G and H are disjoint graphs. 
___, normalized: a graph parameter p such that p(K,) = 1. 
graphon: a symmetric, measurable function W : [0,1] x [0,1] > R. 
__, simple: a gryphon W such that 0 < W(a2,y) < 1 for every x,y € [0, 1]. 
hereditary property P — of graphs: a property that is shared with every induced 
subgraph of a graph having P. 
homomorphism from G to H: for simple graphs, a mapping f : V(G) — V(H) such 
that uv € E(G) implies f(u) f(v) € E(#); for general graphs, there is also an edge 
mapping f : E(G) > E(#) such that if u and v are the endpoints of edge e € E(G), 
then f(u) and f(v) are the endpoints of f(e) € E(#). 
hypergraph: a set of vertices V and a set of edges, each of which is a subset of V. 
__, k-uniform: a hypergraph whose edges are all of size k. 


k-hypergraph Ramsey number r;(Gi,G2,---,Gm): the smallest integer n such 
that if the k-sets of a set of order n are colored with m colors, there will be for some 
i an isomorphic copy of G; in color 7. 
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increasing graph property: a property of a simple graph that cannot be lost by 
adding edges alone. 


independence number a(G) — of a graph G: the largest possible number of vertices 
in an independent set. 


independent set — of vertices in a graph G: a set of mutually nonadjacent vertices. 

Lovasz Local Lemma: a tool within the probabilistic method for dealing with weakly 
dependent events. 

minor of a graph G: any graph formed from G by a sequence of edge deletions and 
contractions. 

monotone graph property: a property that is either increasing or decreasing. 

Mycielski graphs: any of the graphs MM), Mo,... in a particular sequence constructed 
inductively by Mycielski. 

pancyclic graph: a Hamiltonian graph having cycles of all possible lengths up to its 
number of vertices. 

phase transition — of the random graph: loosely, the range of density during which 
the largest component grows from very small to very large. 

k* power of a cycle: a graph G with V(G) = {1,...,n} and (i,j) € E(G) if and only 
ifi—jy =+1,+2,...,4k mod n. 

probabilistic method: proving the existence of an object by showing that it exists 
with a positive probability. 

Ramsey graph for the pair (G, H): a graph F’ such that F —> (G, H). 

Ramsey multiplicity R(G,G): the minimum number of monochromatic copies of the 
graph G in any 2-edge-coloring of the Ramsey graph K, where n = r(G,G). 


Ramsey number 7(G,,G2,---,G,): the least positive integer n such that for any 
k-edge-coloring of K,, there is for some 7 a monochromatic copy of the graph G; in 
color 7. 


__, bipartite r,(G, H): smallest number of vertices of a bipartite graph F' such that 
F — (G,4H). 


__, classical — r(G1, Go,--- ,Gx) where each G; is a complete graph Ky,. 


___, connected, denoted by r.(G, H): the least number of vertices in a graph F’ such 
that F —> (G, H) and such that the graph induced by each color is connected. 


__, diagonal: a Ramsey number r(G, H) with G = H. 


__, induced, denoted by r*(G, H): the least positive integer n such that there exists 
a graph F of order n such that any 2-edge-coloring (Red and Blue) of F yields 
an induced copy of G in Red or an induced copy of A in Blue. 


Ramsey-finite pair (G,H): a pair of graphs such that there are only finitely many 
nonisomorphic minimal graphs F' with F' —> (G, H). 

Ramsey-infinite pair (G,H): a pair of graphs such that there are infinitely many 
nonisomorphic minimal graphs F' with F' —> (G, H). 

(Ramsey) minimal graph — for (G,,G2,---,G,): a graph F such that F —> 
(G1, Gz,-+- ,Gp), but no proper subgraph of F' also arrows (G1, Go,--- ,Gz). 
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random graph: a probability space whose domain is the set of n-vertex simple (la- 
beled) graphs on the vertex-set [n] = {1,...,n}; the probability function is deter- 
mined by specifying that the edges occur independently with probability $ each. 

__, binomial G(n, p): a probability space whose domain is the set of n-vertex simple 
(labeled) graphs on the vertex-set [n] = {1,...,n}; the probability function is 
determined by specifying that the edges occur independently with probability 
p = p(n) each; also called the Bernoulli random graph. 

__, uniform G(n,m): the uniform probability space on those simple (labeled) graphs 
on the vertex-set [n] = {1,...,n} with exactly m edges; also called the Erdés- 
Rényi random graph; we observe that the probability of each edge is p(n) = 
m/ (2). 

Rédl1 nibble: a random process that builds the required object a little piece at a time. 

sharp threshold function — regarding the binomial random graph G(n, p), for a graph 
property Q: a function f(n) such that for all « > 0, G(n,p) a.a.s. does not have Q 
if cae <1-—e, and G(n,p) a.as. has Q if ae > 1+ (or the same with “does not 
have” and “has” interchanged). 

size Ramsey number: smallest number of edges of a graph F' such that F —> 
(Gi, Go, as ,Gy). 

__, restricted: the minimum size, i.e., number of edges, of a graph F with r(G, H) 

vertices such that F' —> (G, H). 

square of a cycle: the second power of a cycle. 

star forest: a graph in which each component is a star. 

star: any bipartite graph Kym. 

strictly balanced graph: a graph whose maximum density is strictly greater than 
the maximum density of any of its proper subgraphs. 

threshold function — for a graph property Q: regarding G(n,p), a function f(n) such 
that G(n,p) a.a.s. does not have Q if p = o(f), and G(n,p) a.a.s. has Q if f = o(p) 
(or the same with “does not have” and “has” interchanged). Regarding G(n,m), the 
definition is the same but with p replaced by m. 

Turan graph T,.(n): the complete r-partite graph on n vertices whose classes (partite 
sets) are as nearly equal as possible. 

Turdn number ¢,(n): the number of edges of the Turdn graph of n vertices and 
chromatic number r. 

Turan-type problem: the problem of finding the largest possible number of edges of 
a graph having no graphs is a specified family of forbidden subgraphs. 

weakly pancyclic graph: a graph having cycles of all possible lengths up to its 
circumference. 
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INTRODUCTION 


How far two objects (or sets of objects) are apart in a discrete structure is of interest, 
both theoretically and for its applications. Since discrete structures are naturally mod- 
eled by graphs, this leads us to studying distance in graphs. A book entirely devoted 
to this subject has been written (see [BuHa90]). 


9.1.1 Standard Distance in Graphs 


Although there is not a unique way to define the distance between two vertices in a 
graph, there is one definition of distance that has been used most often and is commonly 
accepted as the standard definition of distance. 


Distance and Eccentricity 


Many of the distance concepts that have been studied have their origins in maximizing 
distance from a vertex. 


DEFINITIONS 


D1: For two vertices u and v in a graph G, the distance d(u,v) from u to v is the 
length (number of edges) of a shortest u—v path in G. 


D2: A u-—v path of length d(u, v) is called a u— v geodesic. 


D3: For a vertex v in a connected graph G, the eccentricity e(v) of v is the distance 
from v to a vertex farthest from v. That is, e(v) = max{d(v, xz): x2 €V(G)}. 
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EXAMPLE 


E1: Each vertex in the graph G of Figure 9.1.1 is labeled with its distance from the 
vertex uv. The distance from v to a vertex farthest from v is 3 and so e(v) = 3. 


Figure 9.1.1: Distances from the vertex v to the vertices of G. 


FACTS 


F1: For vertices u and v in a connected graph G, d(u, v) > 2 if and only if u and v are 
distinct and nonadjacent. 


F2: The distance d defined above is a metric, that is, d satisfies the following four 
properties: 


(u,v) > 0 for all u,v € V(G). 
2. d(u,v) = 0 if and only if u =v. 

(u,v) = d(v, u) for all u,v € V(G) [the symmetric property]. 

(u,w) < d(u,v) + d(v, w) for all u,v, w € V(G) [the triangle inequality]. 
Radius and Diameter 


Two major distance parameters associated with a connected graph G are obtained by 
minimizing and maximizing the eccentricities of the vertices of G. 


DEFINITION 


D4: The minimum eccentricity among the vertices of a connected graph G is the radius 
of G, denoted by rad(G), and the maximum eccentricity is its diameter diam(G). 


REMARK 


R1: The diameter of a connected graph G also equals max{d(z,y): 2,y € V(G)}. 


EXAMPLE 
E2: Each vertex in the graph G of Figure 9.1.2 is labeled with its eccentricity. So 
rad(G) = 2 and diam(G) = 4. ieee cd 


G: 


Oo O 
3 2 3 4 


Figure 9.1.2: Eccentricities, radius, and diameter. 
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FACTS 


F3: For every nontrivial connected graph G, rad(G) < diam(G) < 2rad(G). More 
generally, if a “distance” defined on V(G) is a metric and eccentricity, radius, and 
diameter are defined as expected, then the inequalities rad(G) < diam(G) < 2rad(G) 
always hold. 


F4: [Os73] For every pair r,d of positive integers with r < d < 2r, there exists a 
connected graph G with rad(G) = r and diam(G) = d. Furthermore, the minimum 
order (number of vertices) of such a graph is r + d. 


Center and Periphery 


The radius and diameter of a connected graph G give rise to two subgraphs of G. 


DEFINITIONS 


D5: A vertex v in a connected graph G is a central vertex if e(v) = rad(G), while a 
vertex uv in G is a peripheral vertex if e(v) = diam(G). 


D6: The subgraph induced by the central vertices of a connected graph G is the center 
C(G) of G and the subgraph of G induced by its peripheral vertices is its periphery 
P(G). 


EXAMPLE 


E3: Each vertex in the graph G of Figure 9.1.3 is labeled with its eccentricity. So 
rad(G) = 3 and diam(G) = 5. The center and periphery of G are also shown in 
Figure 9.1.3. 


y 
Z C(G): P(G): e e 
5 


Ww +) h w 


x Ww x Ww 


Figure 9.1.3: The center and periphery of a graph. 


FACTS 
F5: [BuMiS181] Every graph is (isomorphic to) the center of some graph. 
F6: [HaNi53] The center of every connected graph G lies in a single block of G. 


F7: [Jo69] The center of every tree either consists of a single vertex or is isomorphic 
to K2. Furthermore, the center of a tree T consists of a single vertex if and only if 
diam(T’) = 2rad(T). 


F8: [BiSy83] A nontrivial graph G is (isomorphic to) the periphery of some graph if 
and only if every vertex of G has eccentricity 1 or no vertex of G has eccentricity 1. 
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Self-Centered Graphs 


DEFINITIONS 
D7: A graph G is self-centered if C(G) = G. 


D8: An automorphism of a graph G is an isomorphism between G' and itself. The 
set of all automorphisms of a graph G under the operation of composition forms a group 
called the automorphism group of G. 


EXAMPLE 


E4: The vertices of each of the graphs in Figure 9.1.4 are labeled with their eccentric- 
ities. Thus the graphs K2 3 and Cs are self-centered, while P, is not. 


3 
2 2 
3 2 4 & & 4 
: 5 >: OM OOO 
K, , Coo ce BS 
2 2 2 3 


self-centered self-centered non-self—contered 
Figure 9.1.4: Self-centered graphs and a non-self-centered graph. 
FACTS 


F9: [Bu79] Let n > 5 and r > 2 be integers such that n > 2r. Then there exists a 
self-centered graph of order n, size m, and radius r if and only if 


[(mr — 2r —1)/(r —1)] < m < (n? — 4nr + 5n + 4r? — 6r)/2. 
If n = 2r = 4, then m = 4. 


F10: [Bu79] If G is a self-centered connected graph of order n, size m, and radius 2, 
then m > 2n — 5. 


F11: [LeWa90] For a given graph H that is not self-centered, there exists a self- 
centered graph G whose order exceeds the order of H by 3 and such that (1) G contains 
Hf as an induced subgraph and (2) the automorphism group of G is isomorphic to the 
automorphism group of H. 


F12: [LeWa90] For every finite group I, there exists a self-centered graph whose 
automorphism group is isomorphic to I. 
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9.1.2 Geodetic Parameters 
Geodetic Sets and Geodetic Numbers 


For every connected graph G, there are sets S of vertices of G such that every vertex 
of G lies on a geodesic connecting two vertices of S. 


DEFINITIONS 
D9: A vertex w is said to lie in a u—v path P if w is a vertex of P but w 4 u,v. 


D10: For two vertices u and v in a connected graph G, the closed interval I[u, v| 
consists of u, v, and all vertices lying in some u — v geodesic of G; for S C V(G), the 
closed interval I[5S] of S is the union of all sets I[u, v] for u,v € S. 


D11: A set S of vertices of a connected graph G is called a geodetic set in G if 
I[S] = V(G). A geodetic set of minimum cardinality is a minimum geodetic set. 
The cardinality of a minimum geodetic set is called the geodetic number g(G). 


D12: A graph F is a minimum geodetic subgraph if there exists a graph G con- 
taining F’ as an induced subgraph such that V(F’) is a minimum geodetic set for G. 


REMARK 


R2: Each vertex of a connected graph G whose neighborhood induces a complete 
subgraph in G belongs to every geodetic set of G. In particular, each end-vertex of G 
belongs to every geodetic set of G. 


EXAMPLES 


E5: The set S; = {x,y,z} is a geodetic set of the graph G, in Figure 9.1.5. Since 
there is no 2-element geodetic set in G1, it follows that 5; is a minimum geodetic set of 
G, and so g(G,) = 3. In Go, the set Sy = {u,v, w,t} is a minimum geodetic set of Go, 
so g(G2) = 4. Notice that every vertex in 52 has the property that its neighborhood 
induces a complete subgraph in Go. 


Figure 9.1.5: Minimum geodetic sets in graphs. 


E6: The set S = {u,v,w,x} is a minimum geodetic set of the graph G of Figure 9.1.6. 
Since the subgraph (S$) of G induced by S' is (isomorphic to) C4, it follows that C4 is a 
minimum geodetic subgraph. 
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C4: G: 


Figure 9.1.6: A minimum geodetic subgraph. 


FACTS 
F13: [HaLoTs93] Determining the geodetic number of a graph is an NP-hard problem. 


F14: [ChHaZh02] If G is a connected graph of order n > 2 and diameter d, then 
g(G) <n—d-+1 and this bound is sharp. 


F15: [BuHaQu] Let G be a nontrivial connected graph of order n. Then (a) g(G) = nif 
and only if G = K,, and (b) g(G) = n—1 if and only if G = (K,, UK,,U:--UK,,)+ 5, 
where r > 2 and nj, 79,:--: ,n, satisfy ny +ng+-:- +n, =n-1. 


F16: [ChHaZh02] For every three positive integers r, d, and k > 2 with r < d < 2r, 
there exists a connected graph G with rad(G) = r, diam(G) = d and g(G) =k. 


F17: [ChHaZh02] A nontrivial graph F is a minimum geodetic subgraph if and only if 
every vertex of F' has eccentricity 1 or no vertex of F' has eccentricity 1. 


F18: [ChHaZh02] A nontrivial graph F' is a minimum geodetic subgraph of a connected 
graph G if and only if F is the periphery of some connected graph H. 


Convex Sets and Hull Sets 


There are sets S of vertices in a connected graph with the property that every geodesic 
connecting two vertices of S contains only vertices of S. 


DEFINITIONS 


D13: A set S of vertices of a connected graph G is convex if I[S] = S and the convex 
hull [S] is the smallest convex set containing S. 


D14: For a set S of vertices in a connected graph G, let [°[S] = S$, I’[S] = JS], 
and I*[S] = I[I*~1[S]] for k > 2. From some term on, this sequence is constant. The 
smallest nonnegative integer p for which I?[S] = I?*1[S] is the geodetic iteration 
number gin(S). The set I?[S] is, in fact, the convex hull [5] of S. The geodetic 
iteration number of G, denoted by gin(G), is given by 


in(G) = in(S')}. 
GN eee) 
D15: Let S be a set of vertices of a connected graph G. If [S] = V(G), then S is 
called a hull set in G. A hull set of minimum cardinality is a minimum hull set. 
The cardinality of a minimum hull set in G is called the hull number h(G). 
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D16: A graph F isa minimum hull subgraph if there exists a graph G containing 
F as an induced subgraph such that V(F’) is a minimum hull set for G. 


EXAMPLES 


E7: In the graph G of Figure 9.1.7, let S; = {u,v,w} and S = {u,v,w,x}. Since 
[Si] = So AS; and [So] = Sb, it follows that $1 is not a convex set in G; while So is a 
convex set in G. Furthermore, 52 is the convex hull of Sj. 


u 


Si ={u, v,w} S2={u, v,w, x} 
Figure 9.1.7: Convex sets in a graph. 
E8: In the graph G of Figure 9.1.8, let S = {s,t,y}. Since I[S] = {s,t,u,v,w, ax, y} 


and I?[S] = V(G), it follows that S is a hull set of G. In fact, S is a minimum hull set 
and so h(G) = 3. Furthermore, gin(S) = 2. 


Figure 9.1.8: A minimum hull set in a graph. 


REMARK 


R3: Every geodetic set in a connected graph G is a hull set of G. The converse is not 
true in general. Thus h(G) < g(G) for every connected graph G. 


FACTS 


F19: [HaNi81] Let n be the minimum order of a graph G for which gin(G) = k. Then 
n=1lifk=0,n=3ifk=landn=k+3ifk>2. 


F20: [EvSe85] If G is a connected graph of order n > 2 and diameter d, then h(G) < 
n—-d+l. 


F21: [EvSe85] If G is a k-connected graph of order n and diameter d, then h(G) < 
n—k|d/2|. 


F22: [ChHaZh00] For every pair a, b of integers with 2 < a < b, there exists a connected 
graph G such that h(G) = a and g(G) = b. 
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F23: [ChHaZh00] For every nontrivial connected graph G, h(G) = h(G x Ko). 
F24: [ChHaZh00] A connected graph G of order n > 3 has hull number n — 1 if and 
only if G = (K,, U Ky, U-::U Ky.) + Ky where r (> 2), 1, n2,-++ nM, are positive 


integers with nj +ng+-::-+n,=n-1. 


F25: [ChHaZh00] A nontrivial graph F is a minimum hull subgraph of some connected 
graph if and only if every component of F' is complete. 


9.1.3. Total Distance and Medians of Graphs 


Total Distance of a Vertex 


DEFINITION 


D17: The total distance td(u) (or distance or status) of a vertex u in a connected 
graph G is defined by 


td(u)= S> d(u,v). 


veEV(G) 
EXAMPLE 


E9: Each vertex in the graph G of Figure 9.1.9 is labeled with its distance from the 
vertex u. Thus, the total distance of u is 


td(u)= SY) d(u,v)=0+14+14+24+24+24+344444+54+6+7=37. 
veEV(G) 


Figure 9.1.9: The total distance of a vertex u. 


FACT 


F26: [EnJaSn76] Let G be a connected graph of order n and size m and v € V(G). 
Then 


n—1<td(v) < (n-1)(n+2)/2—m 


and these bounds can be attained for each m with n-1<m< eae 
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The Median of a Connected Graph 


The center of a connected graph is not the only subgraph that’s been used to describe 
the middle of a connected graph. 


DEFINITIONS 


D18: A vertex v in a connected graph G is a median vertex if v has the minimum 
total distance among the vertices of G. 


D19: The median M(G) of a connected graph G is the subgraph of G induced by its 
median vertices. 
EXAMPLE 


E10: Each vertex in the graph G of Figure 9.1.10 is labeled with its total distance. 
Therefore, u and v are the two median vertices of G. The median of G is also shown in 
Figure 9.1.10. 


M(G): oe 
u 


Figure 9.1.10: A graph and its median. 


FACTS 
F27: [S180] Every graph is (isomorphic to) the median of some graph. 


F28: [Tr85] The median of every connected graph G lies in a single block of G. 


Centers and Medians of a Connected Graph 


There is no restriction on the relative locations of the center and median of a connected 
graph. 


DEFINITION 


D20: For two subgraphs F and H in a connected graph G, the distance between F 
and H is 


d(F,H) = min{d(u,v): we V(F),v € V(A)}. 
FACTS 


F29: [Ho89] For every two graphs G; and G» and positive integer k, there exists a 
connected graph G such that C(G) is isomorphic to G;, M(G) is isomorphic to G2, and 
d(C(G), M(G)) =k. 
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F30: [{NoTi91] For every three graphs G;, G2, and G3, where G3 is isomorphic to an 
induced subgraph of both G; and G2, there exists a connected graph G such that C(G) 
is isomorphic to G,, M(G) is isomorphic to Gz, and C(G)M M(G) is isomorphic to G3. 


9.1.4 Steiner Distance in Graphs 


There is a generalization of the distance between two vertices in a connected graph 
G for any set of vertices in G that is an analogue of the Euclidean Steiner Problem 
which asks, for a given set S of points in the plane, the smallest network connecting the 
points of S. 


Steiner Radius and Steiner Diameter 


All of the basic distance parameters can be extended to Steiner distance. 
DEFINITIONS 


D21: For a nonempty set W of vertices in a connected graph G, the Steiner dis- 
tance sd(W) of W is the minimum size of a connected subgraph of G containing W. 
Necessarily, each such subgraph is a tree, called a Steiner tree with respect to W. 
In particular, if W = {u,v}, then sd(W) = d(u,v), the ordinary distance between u 
and v. 


D22: Let G be a connected graph of order n. For an integer k with 1 < k < n, the 
k-eccentricity e,(v) of a vertex v in G is the maximum Steiner distance among all 
k-element sets of vertices of G containing v. 


D23: The minimum k-eccentricity of a connected G' is the k-radius rady(G) of G and 
the maximum k-eccentricity is its k-diameter diam,(G). 
EXAMPLES 


Ell: Let S = {u,v,z} in the graph G of Figure 9.1.11. Here sd(S) = 4. There are 
several trees of size 4 containing S, one of which is the Steiner tree T of Figure 9.1.11. 


4 Xx 


Figure 9.1.11: A graph G and a Steiner tree T. 
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E12: Each vertex in the graph G of Figure 9.1.12 is labeled with its 3-eccentricity so 
that rad3(G) = 4 and diam3(G) = 6. 


Figure 9.1.12: The 3-eccentricities of the vertices of a graph. 


FACTS 


F31: [ChOeTiZo89] Let k and n be integers with 3 < k <n. For every tree T of order 
n, diam,_1(T') = rad; (T). 


F32: [ChOeTiZo89] For every integer k > 3 and every tree T of order at least k, 


diam,(T) < (<5) diam,_1(T). 
This fact is not true for graphs in general. [HeOeSw90] 
F33: [HeOeSw90] If G is a connected graph of order n, then 
(a) diam3(G) < (2) rad3(G) if n > 3. 
(b) diam4(G) < (42) rad4(G) if n > 4. 
F34: [HeOeSw91] For every connected graph G and every integer k > 3, 


k+1 


) diam,_1(G). 


Steiner Centers 


There are a number of centers associated with Steiner distance. 


DEFINITION 


D24: For k > 2, a vertex v in a connected graph G is a k-central vertex if ex(v) = 
rad; (G); the subgraph induced by the k-central vertices of G is the Steiner k-center 
of G. 


FACTS 


F35: [OeTi90] Let k > 2 be an integer. Every graph is (isomorphic to) the Steiner 
k-center of some graph. 


F36: [OeTi90] Let k > 3 be an integer and T a tree. Then T is (isomorphic to) the 
Steiner k-center of some tree if and only if T has at most & — 1 end-vertices. 
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9.1.5 Distance in Digraphs 


There is a natural definition of distance from one vertex to another in digraphs as 
well. 


DEFINITIONS 


D25: Let u and v be vertices in a digraph D. If D contains one or more directed u—v 


> 


paths, then the directed distance d(u,v) is the length of a shortest directed u—v path 
in D. 


D26: A digraph D is strong if D contains both a directed u— v path and a directed 
v—u path for every pair u,v of distinct vertices of D. 


Radius and Diameter in Strong Digraphs 


The definitions of eccentricity, radius, and diameter in a digraph are analogous to those 
in an undirected graph (see Definitions D3 and D4). 


DEFINITION 


D27: The eccentricity e(v) of v ina strong digraph D is the greatest directed distance 
from v to a vertex of D. The minimum eccentricity among the vertices of D is its radius, 
rad(D), and the maximum eccentricity is its diameter, diam(D). 


EXAMPLES 


E13: There are three directed u—v paths in the digraph D of Figure 9.1.13. A shortest 
directed u — v path has length 2 and so d (u,v) = 2. On the other hand, there is no 
directed v — u path in D. In fact, there is no directed x — u path in D for any vertex x 
(~ u) of D since the indegree of u is 0. Therefore, D is not a strong digraph. 


Vv 


Figure 9.1.13: A digraph that is not strong. 
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E14: The vertices of the strong digraph D of Figure 9.1.14 are labeled by their eccen- 
tricities. Observe that rad(D) = 2 and diam(D) = 5. So, in general, it is not true that 
diam(D) < 2rad(D). 


3 2 


Figure 9.1.14: The eccentricities of the vertices of a strong digraph. 


FACT 


F37: [ChJoTi92] For every two positive integers a and b with a < b, there exists a 
strong digraph D with rad(D) = a and diam(D) = b. 


The Center of a Strong Digraph 


DEFINITION 


D28: The center C(D) of a strong digraph D is the subdigraph induced by those 
vertices v with e(v) = rad(D). 


EXAMPLE 


E15: The strong digraph D of Figure 9.1.14 is repeated in Figure 9.1.15, where its 
center is also shown. 


D: 4 4 C(D): 


Figure 9.1.15: The center of a strong digraph. 


FACT 


F38: [ChJoTi92] For every oriented digraph D, there exists a strong oriented digraph 
whose center is (isomorphic to) D. 
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Strong Distance in Strong Digraphs 


There is yet another reasonable way to define distance in strong digraphs, and this 
definition is analogous to Steiner distance in an undirected graph (see Definition D21). 


DEFINITIONS 


D29: For a strong oriented graph D, the strong distance sd(u,v) from u to v is the 
minimum size of a strong subdigraph of D containing u and v. 


D30: The strong eccentricity se(v) of a vertex v in a strong oriented graph D is the 
largest strong distance from v to a vertex in D. 


D31: The minimum strong eccentricity among the vertices of a strong oriented graph 
Dis the strong radius srad(D) of D and the maximum strong eccentricity is its strong 
diameter sdiam(D). 


EXAMPLE 


E16: In the strong digraph D, of Figure 9.1.16, sd(v,w) = 3, sd(u,y) = 4, and 
sd(u,z) = 5. The vertices of the digraph D2 of Figure 9.1.16 are labeled with their 
strong eccentricities. Therefore, srad(D2) = 6 and sdiam(D2) = 10. 


10 10 
Di: u x D>: 6 6 
10 10 


y 10 


Figure 9.1.16: Strong eccentricities, strong radius, and strong diameter. 
FACTS 


F39: |ChErRaZh99-a] Strong distance is a metric on the vertex set of a strong oriented 
graph. Thus srad(D) < sdiam(D) < 2srad(D) for every strong oriented graph D. 


F40: [ChErRaZh99-a] For every pair r, d of integers with 3 < r < d< 2r, there exists 
a strong oriented graph D with srad(D) =r and sdiam(D) = d. 


F41: [ChErRaZh99-a] If D is a strong oriented graph of order n > 3, then sdiam(D) < 
[5(nm — 1)/3] and this bound is sharp. 


Strong Centers of Strong Digraphs 


DEFINITIONS 


D32: A vertex v in a strong oriented graph D is called a strong central vertex if 
se(v) = srad(G), while the subgraph induced by the strong central vertices of D is the 
strong center SC(D) of D. 
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D33: A strong oriented graph D is called strongly self-centered if srad(D) = 
sdiam(D), that is, if D is its own strong center. 


EXAMPLE 


E17: The strong center SC(D) of a digraph D is shown in Figure 9.1.17. 


Figure 9.1.17: The strong center of a digraph. 


FACTS 


F42: [ChErRaZh99-b] Every oriented graph is (isomorphic to) the strong center of 
some strong oriented graph. 


F43: [ChErRaZh99-b] For every integer r > 3, there exist infinitely many strongly 
self-centered oriented graphs of strong radius r. 
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INTRODUCTION 


We consider sets of vertices that “are near” (dominate) all the vertices of a graph. 
The idea of domination is an area of research in graph theory that is experiencing 
significant growth. Its application in design and analysis of communication networks, 
social sciences, optimization, bioinformatics, computational complexity, and algorithm 
design may explain in part the increased interest. 

The books by Haynes, Hedetniemi, and Slater [HaHeS198, HaHeS198b] deal exclu- 
sively with domination in graphs. Recent survey articles on domination in graphs can 
be found in [He96] and [HeLa90]. For a comprehensive bibliography of papers on dom- 
inating sets in graphs, see the reference list compiled in [HaHeS198] that contains over 


1200 entries. 


9.2.1 Dominating Sets in Graphs 


DEFINITIONS 


D1: Aset SCV isa dominating set of a graph G = (V, E) if each vertex in V is in 
S or is adjacent to a vertex in S. 


D2: The domination number 7(G) is the minimum cardinality of a dominating set 
of G. We refer to a minimum dominating set of a graph G as a 7(G)-set. 
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REMARK 


R1: A vertex is said to dominate itself and all its neighbors. 


EXAMPLE 


El: The set S = {2,5,7} is a dominating set in the Petersen graph G shown in 
Figure 9.2.1. Since the Petersen graph is a cubic graph, each vertex dominates four 
vertices. Therefore no set of two vertices dominates all ten vertices in the graph, and 
so the set S is a dominating set of minimum cardinality (a y(G)-set) and y(G) = 3. 


Wes 


4 3 
Figure 9.2.1: The Petersen graph G. 


REMARK 


R2: The open neighborhood of a vertex v, denoted N(v), is the set of vertices which 
are adjacent to v. One can in fact show that the Petersen graph in Figure 9.2.1 has 
exactly ten distinct ~(G)-sets, namely, the ten open neighborhoods N(v) corresponding 
to the ten vertices v in the graph. 


Equivalent Definitions of a Dominating Set 


The domination number appears in many different mathematical contexts or frame- 
works. We mention a few of the equivalent definitions of a dominating set: 


DEFINITIONS 
D3: Vertex Set Covering Problem. 


Set S C V is a dominating set of a graph G if each vertex in V \ S has at least one 
neighbor (is covered by a vertex) in S. 


D4: Set Intersection. 


The closed neighborhood of a vertex v is the set N[v] = N(v) U{v}. Set SCV isa 
dominating set if for every vertex v € V, |N[v] N S| > 1. 


D5: Union of Neighborhoods. 


Set S C V is a dominating set if U Niv] = V. 
ves 
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D6: Dominating Function. 


Let f be the function f : V > {0,1} such that for each v € V, 


y IOSt 


ue N{v] 
The vertices with the value 1 under f form a dominating set. 
D7: Distance from the Set. 


Set S C V is a dominating set if for every vertex v € V \ S, d(v,x) < 1 for some 
vertex 2 € S. 


D8: Integer Programming Problem. 


For a graph G of order n, the closed neighborhood matrix, denoted N, is the n x n 
matrix formed from the adjacency matrix of G by adding 1’s on the diagonal. For a 
set S of G, we let the n-vector Xs = [x;] be the characteristic vector of S, that is, 
x; —1if x; € S and x; = 0, otherwise. Let I, denote the column vector of all 1’s. We 
say that S is a dominating set of G if and only if N- Xg >1,. This leads to the integer 
programming formulation for the domination number 7(G) given by: 


7(G) = min )> Li 
i=1 


subject to N-X >1, 
with a; € {0,1}. 


Applications of Domination 


The applications of domination in a wide variety of fields have surely added to its esca- 
lating popularity. For a sample of its applications, consider communication networks, 
facility and guard location problems, surveillance systems, and coding theory. 


EXAMPLES 


E2: Berge [Be73] mentions the problem of keeping a number of strategic locations 
under surveillance by a set of radar stations. The minimum number of radar stations 
needed to survey all the locations is the domination number of the associated graph. 


E3: Liu [Li68] discusses the application of dominance to communications in a network, 
where a dominating set represents a set of cities which, acting as transmission stations, 
can transmit messages to every city in the network. 


E4: The notion of domination is a standard one in coding theory. If one defines 
a graph whose vertices are the n-dimensional vectors with co-ordinates chosen from 
(1,...,p) and two vertices are adjacent if they differ in one co-ordinate, then sets of 
vectors which are (n, p)-covering sets, single error correcting codes, or perfect covering 
sets are all dominating sets of the graph with certain additional properties. See, for 
example, Kalfleisch, Stanton, and Horton [KaStHo71]. 


E5: A desirable property for a committee from a collection of people might be that every 
nonmember know at least one member of the committee, for ease of communication. A 
committee with this property is a dominating set of the acquaintance graph of the set 
of people. 
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9.2.2 Minimality Conditions 


Notice that if S is a dominating set of a graph G,, then so too is every superset of 
S. However, not every subset of S is necessarily a dominating set. 


DEFINITION 


D9: A minimal dominating set in a graph G is a dominating set that contains no 
dominating set as a proper subset. 


EXAMPLE 


E6: For the Petersen graph G of Figure 9.2.1, the sets S; = N(1) = {2,5,7}, So = 
{1,3,6, 10}, and $3 = {1,2,3, 4,5} are all minimal dominating sets. Hence, the Petersen 
graph contains minimal dominating sets of cardinalities 3, 4, and 5. 


Early work on the topic of domination focused on properties of minimal dominating 
sets. We begin with two classical results of Ore [Or62]. 
FACTS 
F1: (Ore’s Theorem) [Or62] Let D be a dominating set of a graph G = (V, EZ). Then 
D is a minimal dominating set of G if and only if each v € D has at least one of the 


following two properties: [P;]: there exists a vertex w € V\D such that N(w)ND = {v}; 
[P2]: the vertex v is adjacent to no other vertex of D. 


F2: [Or62] If G = (V,E£) is a graph with no isolated vertex and D is a minimal 
dominating set of G, then V \ D is a dominating set of G. 


F3: [BoCo79] If G is a graph with no isolated vertex, then there exists a y(G)-set in 
which every vertex has property P,. 


The Domination Chain 

Here we discuss a domination inequality chain. 

DEFINITION 

D10: A set S of vertices is said to be independent if no two vertices in S are adjacent. 


D1l1: The vertex independence number a(G) is the maximum cardinality of an 
independent set in G. 


D12: The independent domination number i(G) is the minimum cardinality of a 
maximal independent set of G. 


D13: While the domination number y(G) is the smallest cardinality of a minimal 
dominating set in a graph G, the upper domination number I(G) is the maximum 
cardinality of a minimal dominating set in G. 


D14: For any set S CV, the open neighborhood N(S) is defined as UvesN(v) and 
the closed neighborhood N[S] = N(S)US. A set S of vertices is irredundant if for 
every vertex v € S, N[v] \ N[S — {v}] 40. 
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D15: The minimum cardinality of a maximal irredundant set in G' is called the irre- 
dundance number of G, and is denoted ir(G). 


D16: The maximum cardinality of an irredundant set in G is called the upper irre- 
dundance number of G, and is denoted IR(G). 


EXAMPLES 


E7: The tree T in Figure 9.2.2 has maximal independent sets of two sizes: {1, 2,3, 6, 7,8} 
and {1,2,3,5}. Thus, 7(T) = 4 and a(T) = 6. 


Ie p88 
se Ns 


Figure 9.2.2: A tree T with i(T) = 4 and a(T) = 6. 


E8: The tree T in Figure 9.2.3 has maximal irredundant sets of two different sizes: 
{2,3,8,9} and {2,4,6,8,10}. Thus, ir(T) = 4 and IR(T) = 5. Note that for this tree 
T, y(T) =5. 
7 8 9 1011 
6 


123 4 5 
Figure 9.2.3: A tree T with ir(T) = 4 and IR(T) =5. 
FACTS 


F4: [Be62] An independent set is maximal independent if and only if it is independent 
and dominating. 


F5: [Be62] Every maximal independent set in a graph is a minimal dominating set of 
the graph. 


F6: [CoHeMi78] A dominating set is a minimal dominating set if and only if it is 
dominating and irredundant. 


F7: [BoCo79] Every minimal dominating set in a graph is a maximal irredundant set 
of the graph. 


Since every maximal independent set is a dominating set, and every minimal domi- 
nating set is a maximal irredundant set, we have the following inequality chain, which 
was first observed by Cockayne, Hedetniemi, and Miller in 1978. 

F8: [CoHeMi78] For every graph G, 


ir (G) < ¥(G) < i(G) < a(G) < T(G) < IR(G). 


F9: [CoFaPaTh81] If G is a bipartite graph, then a(G) =I'(G) = IR(G). 
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REMARK 


R3: The inequality chain, known as the domination chain, in Fact F8 has become 
one of the strongest focal points for research in domination theory; approximately 100 
research papers have been published on various aspects of this sequence of inequalities. 
For example, Cockayne, Favaron, Mynhardt, and Puech [CoFaMyPu00] characterized 
trees T with y(T) = i(T) in terms of the sets of vertices of T which are contained in 
all its minimum dominating and minimum independent dominating sets. These sets 
were characterized by Mynhardt [My99], who used a tree pruning procedure. A simple 
constructive characterization of such trees is given in [DoGoHeMy06]. 


9.2.3. Domination Perfect Graphs 


Motivated by the concept of perfect graphs in the chromatic sense, Sumner and 
Moore [SuMo79] defined a graph to be domination perfect as follows. 


DEFINITION 


D17: A graph G is domination perfect if y(H) = i(H) for every induced subgraph 
Hf of G. 


Building on several other results, including those by Zverovich and Zverovich [ZvZv91], 
Fulman [Fu93], and Topp and Volkmann [ToVo91], Zverovich and Zverovich [ZvZv95| 
finally provided a forbidden induced subgraph characterization of domination perfect 
graphs in terms of seventeen forbidden induced subgraphs. 


FACT 


F10: [ZvZv95] A graph is domination perfect if and only if it does not contain one of 
seventeen graphs G1—G'17 shown in Figure 9.2.4 as an induced subgraph. 


Gi3 
Figure 9.2.4: Minimal domination imperfect graphs G1—Gj7. 
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DEFINITION 


D18: A graph G is called minimal domination imperfect if G is not domination 
perfect and y(H) = i(H), for every proper induced subgraph H of G. 


REMARK 


R4: The 17 subgraphs presented in the characterization of domination perfect graphs 
given in Fact F10 are all minimal domination imperfect graphs. Thus Fact F10 states 
that there are exactly 17 minimal domination imperfect graphs. 


9.2.4 Bounds on the Domination Number 


Since determining whether a graph has domination number at most k is NP-complete 
(see [GaJo79] and Chapter 1 of [HaHeS198]), it is of interest to find bounds for this 
parameter. 


REMARK 


R5: Obviously if G is a graph of order n, then 1 < 7(G) < n. Equality of the lower 
bound is attained if and only if A(G) = n — 1, and equality holds for the upper bound 


if and only if A(G) =0, i.e, G= Kp. 


Bounds in Terms of Order and Minimum Degree 


Restricting ourselves to graphs without isolated vertices, we have the following general 
upper bound on the domination number of a graph. 


FACT 


F11: [BuHeTul2] Let G be a graph of order n and size m having no isolated vertices. 
Then, the bound y(G) < an+ bm is valid if and only if both 2a+b > 1 and b > 0 hold. 


REMARK 
R6: Taking (a,b) = (5, 0) in Fact F11, we have that the upper bound on the domination 
number can be improved from its order to one-half its order if we impose the condition 


that the graph is isolate-free. This result was first observed by Ore [Or62] and is a 
consequence of Facts F2 and F3. 


FACT 


F12: [Or62] If G is a graph of order n with no isolated vertex, then y(G) < n/2. 
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DEFINITION 


D19: The corona of two graphs G; and G2 is the graph G = G, o Gg formed from 
one copy of G; and |V(G)| copies of Gz where the ith vertex of G, is adjacent to every 
vertex in the ith copy of Gz. The corona H o Kj, in particular, is the graph constructed 
from a copy of H and for each vertex v € V(H), a new vertex v’ and the pendant edge 
vu’ are added. 


FACT 


F13:  [FiJaKiRo85, PaXu82] If G is a graph of order n with no isolated vertex, then 
9(G) = n/2 if and only if the components of G are the cycle C, or the corona H 0 ky 
for any connected graph H. 


REMARK 


R7: The graphs G for which y(G) = |n/2| were characterized independently in [Ba- 
CoHaHeSh00] and [RaVo98]. 


Minimum Degree Two 


If we restrict the minimum degree 6(G) of G to be at least two, then the upper bound in 
Fact F12 due to Ore on the domination number can be improved from one-half its order 
to two-fifths its order except for seven exceptional graphs (one of order four and six of 
order seven). More precisely, McCuaig and Shepherd [McSh89] defined a collection 6 
of “bad” graphs, shown in Figure 9.2.5. 


a> Oa OEP, 
D> LT xe OO 


Figure 9.2.5: The family B of “bad” graphs. 


FACTS 


F14: [McSh89] If G is a connected graph of order n with 6(G) > 2 and G ¢ B, where 
B is the collection of seven graphs shown in Figure 9.2.5, then y(G) < 2n/5. 


F15: [McSh89] If G is a connected graph of order n > 8 with 6(G) > 2, then 7(G) < 
2n/5. 


DEFINITIONS 


D20: A key L4) is the graph obtained from a 4-cycle Cy by adding a pendant edge 
to one of its vertices. We define a unit to be a graph that is isomorphic to a 5-cycle C5 
or to a key L41. We call a unit a cycle unit or a key unit according to whether it 
is a cycle or a key, respectively. In a cycle unit, we select two vertices at distance two 
apart in the unit, and we call these two vertices the link vertices of the unit; in a key 
unit we call the vertex of degree one the link vertex of the unit. 
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D21: To show that the bound of Fact F15 is sharp, McCuaig and Shepherd [McSh89] 
introduced a family F of graphs constructed as follows. Let F denote the family of 
all graphs G that are obtained from the disjoint union of at least two units, each of 
which is a cycle unit or a key unit, by adding edges in such a way that G is connected 
and every added edge joins two link vertices. 


EXAMPLE 


E9: A graph in the family F with three cycle units and two key units is shown in 
Figure 9.2.6 with the link vertices indicated by the darkened vertices. 


Figure 9.2.6: A graph in the family F. 


FACT 
F16:  [McSh89] If G € F has order n, then G is a connected graph with 6(G) > 2 
satisfying y(G) = 2n/5. 


Minimum Degree Three 


If we restrict the minimum degree to be at least three, then the upper bound in Fact 
F15 due to McCuaig and Shepherd on the domination number can be improved from 
two-fifths its order to three-eighths its order. 


FACT 


F17: [Re96] If G is a graph of order n and 6(G) > 3, then 7(G) < 3n/8. 


EXAMPLE 


E10: The two non-planar cubic graphs of order n = 8 (shown in Figure 9.2.7) both 
have domination number 3 = 3n/8. We remark that the graph Mg in Figure 9.2.7(a) is 
called the Mobius ladder on eight vertices. 


(a) Ms (b) Fs 
Figure 9.2.7: The two non-planar cubic graphs of order eight. 


Section 9.2. Domination in Graphs 1089 


REMARK 


R8: The ratio 3/8 in Reed’s Fact F17 is best possible. Gamble (see [McSh89], [Re96]) 
gave infinitely many connected graphs of minimum degree at least three with domination 
number exactly three-eighths their order. 


EXAMPLE 


E11: Let H’ be any connected graph. For each vertex v of H’, add a (disjoint) copy 
of the cubic graph Fg shown in Figure 9.2.7(b) and identify any one of its vertices that 
is in a triangle with v. Let H denote the resulting graph, and let H denote the family 
of all such graphs H. When H’ = P3, the resulting graph H is shown in Figure 9.2.8. 


Figure 9.2.8: A graph H € H. 


REMARK 


R9: Reed’s Fact F17 has been generalized in several ways. 


FACTS 


F18: [ShSoYuHe09] If G is a graph of order n and 6(G) > 2, then y(G) < (3n+|Va|)/8, 
where V2 denotes the set of vertices of degree 2 in G. 


F19: [LoRa08] If G is a graph of order n and 6(G) > 2 with girth g > 5, then 
y¥<(g+H)n 
DEFINITIONS 


D22: A vertex x is called a bad-cut-vertex of G if G— x contains a component, Cz, 
which is an induced 4-cycle and « is adjacent to at least one but at most three vertices 
on C,. We let bc(G) denote the number of bad-cut-vertices in G. 


D23: A cycle C is called a special-cycle if C is a 5-cycle in G such that if wu and v 
are consecutive vertices on C’, then at least one of u and v has degree 2 in G. We let 
sc(G) denote the maximum number of vertex disjoint special-cycles in G that contain 
no bad-cut-vertices. 


D24: A graph is (C4, Cs)-free if it has no induced 4-cycle or 5-cycle. 


FACTS 


F20: [HeScYel1] If G is a connected graph of order n > 14 with 6(G) > 2, then 
1(G) < $(3n + se(G) + be(G)). 


F21: [HeScYel1] If G is a connected graph of order n > 14 with 6(G) > 2 that contains 
no special cycle and no bad-cut-vertex, then y(G) < 3n/8. 
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F22: [HeScYel1] If G is a (C4, Cs)-free connected graph of order n > 14 with 6(G) > 2, 
then y(G) < 3n/8. 


F23: [HeScYel1] If G is a 2-connected graph of order n > 14 and dg(u) + dg(v) > 5 
for every two adjacent vertices u and v, then 7(G) < 3n/8. 


REMARKS 


R10: Fact F23 can be restated as follows: If G is a 2-connected graph of order n > 14 
such that the set of degree-2 vertices in G form an independent set, then 7(G) < 3n/8. 


R11: The ratio 3/8 in Fact F22 is best possible. Henning, Schiermeyer, and Yeo 
[HeScYel1] gave infinitely many (C4, Cs)-free connected graphs of minimum degree two 
with domination number exactly three-eighths their order. 


R12: That the bound of Fact F23 is sharp may be seen as follows. Let k > 2 be an 
integer, and let H be the family of all graphs that can be obtained from a 2-connected 
graph F of order 2k that contains a perfect matching M as follows. Replace each edge 
e = uv in the matching M by an 8-cycle uavbcdefu with two added edges, namely, 
be and cf. Let H denote the resulting 2-connected graph of order n = 8k. Then, 
7(H) = 3k = 3n/8 and the set of degree-2 vertices in H form an independent set. 


EXAMPLE 


E12: A graph in the family H with k& = 4 that is obtained from an 8-cycle F' is shown 
in Figure 9.2.9. 


~ 


Figure 9.2.9: A graph in the family H. 


Cubic Graphs 


REMARK 


R13: As a special case of Fact F17, every connected cubic graph G on n vertices has 
domination number 7(G) < 3n/8. Kostochka and Stodolsky [KoSt09] proved that the 
two non-planar cubic graphs Fg and Gg shown in Figure 9.2.7 are the only connected 
cubic graphs that achieve the three-eighths bound in Fact F17. 


DEFINITION 


D25: Let G”,.. denote the family of all connected cubic graphs of order n. 


cubic 
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FACT 


F24: [KoSt09] If GEG and n > 10, then 7(G) < 4n/11. 


eaie 
REMARK 


R14: Reed [Re96] conjectured that the upper bound in Fact F17 can be improved to 
(G) < [n/3| UGG Goubic* 


FACTS 


F25: Kostochka and Stodolsky [KoSt09] disproved Reed’s conjecture by constructing 
a connected cubic graph G on 60 vertices with y(G) = 21 and presented a sequence 
{Gx }f2, of connected cubic graphs with 


ge GR SO sd 
1 a 
kre |V(G)| — 23 3 69 


F26: Kelmans [Ke06] constructed a smaller (with 54 vertices) counter-example to 
Reed’s conjecture and an infinite series of 2-connected cubic graphs Hy, with 


i (Ar) ne 1 1 


im . 
k>00 |V(H;)| ~ 3 60 


F27: [Ke06], [KoSt09] The following holds. 


1 1 G 4 
0.35=-+—< sup ( lim UG) < 4 . 9.363636. 
3 60 ~ Gegn,,, \n>% on 11 


More Bounds Involving Minimum Degree 


FACTS 
F28: [SoXu09] If G is a graph of order n and 6(G) > 4, then y(G) < 4n/11. 
F29: [XiSuCh06] If G is a graph of order n and 6(G) > 5, then 7(G) < 5n/14. 


F30: [CaRo85, CaRo90] For any graph G of order n and minimum degree 6, 


Oren c = Ga) . 


F31: [AlSp92, Ar74, Pa75] For any graph G of order n and minimum degree 0, 


V(G) <n (AS) ; 


F32: [Ar74, Pa75] For a graph G of order n and minimum degree 4, 


[ee 


5 6+1 
tone ; 
19) s Fy j 


F33:  [We81] Let & = |log,n — 2 log,(log, n) + log,(log, e)|. Then for almost every 
graph G of order n,k +1<7(G) <k+4+2. 


w& 
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Bounds in Terms of Size and Degree 


FACTS 

F34: If G is a connected graph of size m, then 7(G) < (m+ 1)/2. 

F35: [Sa97] If G is a connected graph of size m with 6(G) > 2, then 7(G) < (m+2)/3 
with equality if and only if G is a cycle of length n where n = 1(mod3). 

DEFINITION 

D26: A graph G with m edges is called an 4 -graph if 6(G) > 2, G is connected, and 
7(G) > m/3. 


EXAMPLE 


m 


E13: An example of an %-graph is shown in Figure 9.2.10. 


eal 


Figure 9.2.10: A graph G of size m with 7(G) = (m+ 1)/ 


REMARK 


R15: A characterization of '-graphs can be found in [He99]. 
Bounds in Terms of Order and Maximum Degree 


FACTS 
F36: [Be73, WaAcSa79] For any graph G of order n with maximum degree A, 


tals V(G)<n-A. 


F37: [FIVo90] For any graph G of order n with no isolated vertex, minimum degree 6, 


and maximum degree A, 
1 A 
< ; 
@) <3 (n+1-6-05 


The following fact is an immediate consequence of Fact F37. 


F38: [Pa75] For a graph G of order n with no isolated vertex and minimum degree 6, 


1G) <5 (n+2-8). 
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Bounds in Terms of Order and Size 


Vizing [Vi65] bounded the size of a graph having a given order and domination num- 
ber. It follows that if a graph of a given order has sufficiently many edges, then it is 
guaranteed to have a dominating set of some specified order. 


DEFINITION 


D27: A minimum edge cover in a graph is a minimum number of edges required 
to cover all the vertices of the graph. 


FACT 


F39:  [Vi65] If G is a graph of order n and size m with domination number 7 > 2, then 
m < |3(n—7)(n—7+2)|. Furthermore, the maximum size is attained by taking the 
complete graph on n — y + 2 vertices and removing a minimum edge cover and then 
adding y — 2 isolated vertices. 


REMARK 

R16: The graphs constructed in Fact F39 also achieve the upper bound of Fact F36; 
that is, such a graph G has A(G) = n— 7(G). 

FACTS 


F40: [Sa91] If G is a graph of order n and size m with domination number y > 2 and 
with A(G) <n-—y-1, then m < $(n— 7) (n-y+1). 


F41: [Be62,Vi65] For any graph G of order n and size m, 
n—-m<7(G)<n+1—V1+ 2m. 


Furthermore, 7(G) = n — m if and only if every component of G is a star. 
Bounds in Terms of Packing 


DEFINITIONS 


D28: Aset SCV is called a packing in a graph G = (V, E) if the vertices in S are 
pairwise at distance at least 3 apart in G, ie., if u,v € S, then dg(u, v) > 3. 


D29: The packing number p(G) of G is the maximum cardinality of a packing in G. 


REMARK 


R17: Note that if S is a packing in G, then for each pair of vertices u,v € S$, N[u] 
N{v] =. Hence, the packing number provides a lower bound on 7(G). 


FACTS 
F42: For any graph G, p(G) < 7(G). 


F43: [MeMo75] For a tree T, p(T) = 7(T). 
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Bounds in Terms of Radius 


FACT 


F44:  [DePeWal0] If G is a connected graph on at least two vertices with radius r, 
then y(G) > 2r/3. 


REMARK 


R18: Equality holds in the bound of Fact F44 for cycles with orders congruent to 
0 modulo 6. 


9.2.5 Nordhaus—Gaddum-Type Results 


The original paper by Nordhaus and Gaddum [NoGa56] in 1956 gave sharp bounds 
on the sum and product of the chromatic numbers of a graph and its complement. Since 
then such results have been given for several parameters. Fact F45 below is the first 
such result for domination. 


FACT 


F45: [JaPa72] For any graph G of order n > 2, 
(a) 3< 7(G)+7(G) <n+1, and 
(b) 2< y(G)7(G) <n. 


DEFINITIONS 


D30: For a pair of graphs G and H, the cartesian product GOH of graphs G and 
HZ is the graph with vertex set V(G) x V(H) and where two vertices are adjacent if and 
only if they are equal in one coordinate and adjacent in the other. 


D31: Let Gi 6 Ge 6 G3 denote an edge-disjoint factoring of the complete graph. 


FACTS 


F46:  [PaXu82] Let G be a graph of order n > 2. Then, y(G)y(G) = n if and only if 
G is one or the complement of one of the following graphs: K,,, disjoint union of cycles 
of length 4 and the corona H o Ky for any graph H, and K30 Ks3. 


F47: — [JoAr95] If G is a graph of order n > 2 such that G and G have no isolated 


vertices, then 7(G) + 7(G) < (n+ 4)/2. 


F48: [GoHeSw92] Let G; 6 G2 @ G3 = K,,. Then, 
(a) (Gi) + (G2) + ¥(G3) < 2n + 1, and 
(b) the maximum value of the product 7(G1)y(G2)y(G3) is n3/27 + O(n?). 


REMARK 


R19: From Fact F48, there exist constants c; and cy such that the maximum triple 
product always lies between n?/27 + cin? and n?/27 + cn’. 
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9.2.6 Domination in Planar Graphs 


The Dominating Set decision problem remains NP-hard even when restricted to 
planar graphs of maximum degree 3 (see [GaJo79]). Hence it is of interest to determine 
upper bounds on the domination number of a planar graph. 


REMARK 


R20: A tree of radius 2 and diameter 4 can have arbitrarily large domination number. 
So the interesting question is what happens when the diameter of a planar graph is 2 
or 3. Bounding the diameter of a planar graph is a reasonable restriction to impose 
because planar graphs with small diameter are often important in applications (see 
[FeHeSe95]). 

FACT 

F49: [MaSe96] The domination number of a planar graph of diameter 2 is bounded 
above by 3. 

EXAMPLE 


E14: The graph of Figure 9.2.11, constructed by MacGillivray and Seyffarth [MaSe96], 
shows that the bound in Fact F49 is achievable. 


Figure 9.2.11: A planar graph of diameter 2 and domination number 3. 


FACTS 


F50: [GoHe02] Every planar graph of diameter 2 has domination number at most 2 
except for the graph of Figure 9.2.11, which has domination number 3. 


F51: [DoGoHe06] Every planar graph of diameter 3 and of radius 2 has domination 
number at most 5. 


F52: [DoGoHe06] Every planar graph of diameter 3 has domination number at most 9. 


F53: [DoGoHe06] Every sufficiently large planar graph of diameter 3 has domination 
number at most 6, and this bound is sharp. 


F54: [GoHe02] For each orientable surface, there are finitely many graphs with diam- 
eter 2 and domination number more than 2. 


F55: [GoHe02] For each orientable surface, there is a maximum domination number 
of graphs with diameter 3. 
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EXAMPLE 


E15: The sharpness of the bound in Fact F53 is shown by the graph of Figure 9.2.12, 
which can be made arbitrarily large by duplicating any of the vertices of degree 2. 
Furthermore, by adding edges joining vertices of degree 2, it is possible to construct 
such a planar graph with minimum degree equal to 3. 


Figure 9.2.12: A planar graph with diameter 3 and domination number 6. 


9.2.7 Vizing’s Conjecture 


One of the oldest unsolved problems in domination theory involves graph products. 
In 1963 Vizing [Vi63] suggested the problem of determining a lower bound on the 
domination number of a product graph in terms of the domination numbers of its 
factors G and H. Five years later he offered it as a conjecture, which remains one of 
the most famous open problems involving domination. 


CONJECTURE 
Vizing’s Conjecture [Vi68]: For any graphs G and H, y(GO A) > 7(G)7(#). 


DEFINITION 


D32: A graph G is said to be decomposable if y(G) = k and V(G) can be partitioned 
into k subsets C), C2,...,C, such that each of the induced subgraphs G[C;] is a complete 
subgraph of G. 


FACTS 


F56: [BaGe79] If F is a spanning subgraph of a decomposable graph G where y(F') = 
7(G), then for any graph H, y(F OFA) > 7(F)y(#). 


Clark and Suen [ClSu00] provided the following general result. 


F57: [ClSu00] For any graphs G and H, 7(GO A) > $4(G)7(#). 
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The best general result to date is due to Suen and Tarr [SuTal0,Tal0]. 


F58: {SuTal0,Tal0] For any graphs G and H, 


(GOH) > 59(G)y(H) + 5 min{o(@), 1H}. 


F59: [BrDoGoHaHeK]Ral] Let G be a claw-free graph. Then for any graph H without 
isolated vertices, y(GO H) > $7(G)(7(A) + 1). 


F60: [AhSz09] If G is a chordal graph, then for any graph H, y(GO H) > 7(G)y7(#). 


REMARKS 


R21: Since the family of graphs in Fact F56 includes trees and cycles, Vizing’s 
Conjecture is true if one of G and H is a tree (independently proved by Jacobson and 
Kinch [JaKi86]) or a cycle (independently proved in [ElPa91]). 


R22: For surveys on graph products and Vizing’s conjecture, see [HaRa91], [HaRa95}, 
[BrDoGoHaHek]Ral], and Hartnell and Rall’s Chapter 7 of [HaHeS198b]. 


9.2.8 Domination Critical Graphs 


In this section we consider graphs that are critical with respect to their domina- 
tion number. Criticality may be defined for different graph modifications, including 
vertex deletion and edge addition or deletion. For example, Brigham, Chinn, and Dut- 
ton [BrChDu88] studied graphs that are vertex domination critical in the sense that 
their domination number decreases when any vertex is deleted. Also, a graph may be 
critical in the sense that its domination number increases when any edge is deleted, 
as studied in [BaHaNiSu83] and [WaAc79]. However, the most attention has probably 
been directed to those graphs that are edge addition critical, that is, their domination 
number decreases when any missing edge is added (see [SuB183}]). 


REMARK 


R23: The addition of an edge to a graph cannot increase its domination number and 
can decrease it by at most one. 


DEFINITION 
D33: A graph G is domination critical if for every edge e ¢ E(G), y(G+e) = 


4(G) — 1. If G is a domination critical graph with y(G) = k, then the graph G is said 
to be k-critical. Thus G is k-critical if y(G) = k and 7(G +e) =k —1 for each edge 


e¢ E(G). 


FACTS 
F61: The l-critical graphs are (vacuously) K,, for n > 1. 


F62: [SuB183] A graph G is 2-critical if and only if each component of G is a star. 
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F63: [Su90] A disconnected graph G is 3-critical if and only if G = AU B where either 
A is trivial and B is any 2-critical graph or A is complete and B is a complete graph 
minus a 1-factor. 


F64: [SuBl83] The diameter of a connected 3-critical graph is at most 3. 
F65: [SuBl83] Every connected 3-critical graph of even order has a 1-factor. 


F66: [W090] Every connected 3-critical graph on more than six vertices has a hamil- 
tonian path. 


REMARKS 


R24: To date only the 1-critical and 2-critical have been characterized. For k > 2, the 
structure of the k-critical graphs is more complex. Most of the known results concentrate 
primarily on the concept of 3-critical graphs. 


R25: It remains an open problem to determine whether every connected 3-critical 
graph on more than six vertices is hamiltonian. 


R26: For a survey of edge domination critical graph results, see Sumner’s Chapter 16 
in [HaHeS198b]. 


R27: Graphs for which the domination number remains unchanged when a vertex 
is deleted, or an edge is deleted or added have also been studied. For a survey, see 
Chapter 5 of [HaHeS198]. Note that six classes of graphs result from the effect these 
three graph modifications have on the domination number, that is, the changing or 
unchanging of the domination number. For example, the k-critical graphs form a class 
for which the domination number changes when an arbitrary edge is added, and a second 
class for this graph modification consists of the graphs where the domination number 
remains the same upon the addition of an arbitrary edge. Relationships among these 
six classes of graphs are investigated in [HaHe03]. 


9.2.9 Domination Parameters 


As noted in [HaHa95], domination related parameters can be defined by combin- 
ing domination with another graph theoretical property. We consider the parameters 
defined by imposing an additional constraint on the dominating set. 


DEFINITION 
D34: For a given graph property P, the conditional domination number 7(G : P) 


is the smallest cardinality of a dominating set S C V such that the induced subgraph 
G([S] satisfies property P. 


EXAMPLE 
E16: Examples of conditional domination parameters. 


P1. G{[S] is an independent set (independent domination [CoHe77]). 
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P2; has no isolated vertices (total domination [CoDaHe80]). 


P3. is connected (connected domination [SaWa79]}). 


Pd. 


Gis 
Gis 

PA. G[S] is a complete graph (clique domination [Ke85],[KeCo88}). 
G[S] has a perfect matching (paired-domination [HaS198}). 
Gis 


P6. 


has a hamiltonian cycle (cycle domination [LeWi77]). 


REMARKS 


R28: By definition, y(G) < 7(G : P) for any property P. With the exception of 
independent domination, these conditional domination parameters do not necessarily 
exist for all graphs. However, graphs with no isolated vertices have total and paired- 
dominating sets. 


R29: In many cases the additional constraint on the dominating set is application 
driven. For example, as first suggested by Wu and Li [WuLi99], a virtual backbone in 
a wireless network can be represented by a connected dominating set in its underlying 
graph model. 


R30: The generic nature of this formalization provides a method for defining new 
invariants by considering different properties P. 


R31: Also, domination parameters have been defined by changing the method of 
dominating. For example, requiring that each vertex outside the dominating set has at 
least & neighbors in the dominating set is k-domination [FiJa85]. 
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INTRODUCTION 


The study of tolerance graphs arose from a natural extension of ideas that have 
grown out of the now classical notion of interval graph [Be73, Be85, Go80, LeBo62]. In 
an interval graph, each vertex is associated with an interval on the real line with two 
vertices adjacent if and only if their associated intervals intersect. In 1982, Golumbic 
and Monma [GoMo82] suggested associating “tolerances” with each interval so that now 
two vertices are joined iff the length of the intersection of their associated intervals is 
at least the minimum of the two tolerances. 

The first journal paper on tolerance graphs [GoMoTr84] set the stage for two decades 
of further research on global themes which focus on special families of tolerance graphs 
and their properties, directed graph versions, generalizations beyond intervals and re- 
stricted models. All these involve the notion of measured intersection known as toler- 
ance, culminating in the book by Golumbic and Trenk [GoTr04]. 

In this survey, we will explore several of these themes. After introducing intersection 
graphs in Section 9.3.1, we will move on to the original class of (interval) tolerance 
graphs and their variations in Section 9.3.2. We will then describe several tolerance 
graph models on trees in Section 9.3.6, and finally present a more general framework 
called rank-tolerance [GoJa06] in Section 9.3.5. 


9.3.1 Intersection Graphs and Their Applications 


We begin with some basic definitions and results. 
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DEFINITION 


D1: Let C = {5},...,5;,} be a collection of sets. The intersection graph of C is the 
graph G obtained by assigning a distinct vertex vu; to each set S; in C and joining two 
vertices by an edge precisely when their corresponding sets have a nonempty intersection, 
ie., ujv; € E(G) if and only ifi 4 j and S$; S; #0. 


FACT 


F1: Every graph is an intersection graph. (Marczewski’s Theorem [Ma45], see also 
[ErGoPo66, Ha94]) 


When the type of sets allowed in C is limited, interesting structured classes of graphs 
may result. Let F be a family of sets of a specified type, for example, intervals on a line, 
paths in a tree, circles in the plane, live areas of variables in a program flow diagram, 
senate subcommittee members, ideals in a commutative ring, patterns of consecutive 
elements in a DNA string, etc. 


DEFINITION 


D2: A graph G is an F-intersection graph if there exists an intersection represen- 
tation C = {S1,...,Sn} of G where each S; is in the family F. 


Among the well known classes of intersection graphs are the interval graphs, trape- 
zoid graphs, circular-arc graphs, parallelogram graphs, chordal graphs (subtrees of a 
tree), permutation graphs, string graphs, segment graphs, and many others. The com- 
plements of comparability graphs (cocomparability graphs) also have an intersection 
graph model, known as function diagrams, as shown by Golumbic, Rotem and Urrutia 
[GoRoUr83]. Later, in Section 9.3.6, we will describe several types of intersection graphs 
on trees. 

Most of the early uses of the intersection graph model have been described in the 
classical books [Be73, Go80, Ro76] and in some more recent books [BrLeSp99, MaPe95, 
McMCc99, Sp03]. Yet, the scope of research in this general area has expanded significantly 
both from the modeling and algorithmic points of view. 

Intersection graphs have become a necessary and important tool for solving real- 
world problems mathematically and algorithmically. Some of these applications in- 
clude mobile frequency assignment [OsRo81, OsRo83], pavement deterioration anal- 
ysis [GaNe81], relational databases [Go88], evolutionary trees [Wa95], physical map- 
ping of DNA [GoGoKaSh95, GoKaSh94], container ship stowage [AvPeSh00], VLSI 
circuit design [DaGoPi88] and temporal reasoning notions from artificial intelligence 
[GoSh93, Go98, Gol2]. 


Containment Graphs 


Besides the intersection graph model, researchers have also considered a graph model 
based on the relation of containment. 


DEFINITION 
D3: The containment graph G = (V, EF) of a collection C = {S;} of distinct subsets 


of a set S has vertex set V = {1,...,n} and edge set H = {ij| either S; C S$; or 
5S; C S;}. A graph with such a representation is called a containment graph. 
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FACTS 


F2: The class of containment graphs is equivalent to the class of comparability graphs 
or to the transitively orientable (TRO) graphs. 


Golumbic and Scheinerman [GoSc89] observed the following variation on this fact. 


F3: Every comparability graph can be represented as the containment graph of a 
collection of subtrees (substars) of a star. 


Dushnik and Miller [DuMi41] characterized the containment graphs of intervals on 
the line as precisely those having partial order dimension 2; these are also equivalent 
to the permutation graphs, see [Go80, Ro76, Sp03]. Generalizing interval containment, 
Golumbic and Scheinerman [GoSc89] also showed the following. 


F4: A graph G is the containment graph of rectilinear boxes! in d-space if and only 
if the partial order dimension of G is at most 2d. 


Using a different model, Golumbic, Rotem and Urrutia [GoRoUr83] have shown that 
the partial order dimension of G equals k if and only if the cocomparability graph G 
can be realized as the concatenation of k — 1 permutation diagrams.? 


9.3.2 The Classical Model of Interval Tolerance 


Tolerance graphs were introduced by Golumbic and Monma [GoMo82] and Golumbic, 
Monma and Trotter [GoMoTr84] to generalize some of the applications associated with 
interval graphs. Their motivation was the need to solve scheduling problems in which 
resources may be needed on an exclusive basis, such as rooms, vehicles, or support per- 
sonnel, but where a measure of flexibility or tolerance would be allowed for sharing or 
relinquishing the resource when total exclusivity prevented a solution. 


DEFINITIONS 


D4: An undirected graph G = (V, EF) is a tolerance graph if there exists a collection 
T = {Iy}vev of closed intervals on the real line and an assignment of positive numbers 
t = {ty}vev such that 


vw € BE = |L,N1,| > min{ty, tw}. (9.3.1) 


Here |I,,| denotes the length of the interval [,,. The positive number t, is called the 
tolerance of v, and the pair (Z,t) is called an interval tolerance representation of G. 


D5: A tolerance graph is said to be a bounded tolerance graph if it has a tolerance 
representation in which t, < |I,| for all vu € V. 


Tolerance graphs generalize both interval graphs and permutation graphs. 


1Boxes with sides parallel to the axes. 

2A permutation diagram consists of two parallel lines each labeled with a permutation of the numbers 
1,...,n and n line segments connecting the matched pairs of numbers. The intersection graph of the 
segments of a permutation diagram is called a permutation graph. 
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FACTS 


F5: [GoMo82] A graph G is an interval graph if and only if it has a tolerance graph 
representation where all tolerances are equal (to some positive constant). 


F6: [GoMo82] A graph G is a permutation graph if and only if it has a tolerance graph 
representation where the tolerance of every vertex equals the length of its interval. 


F7: If we restrict the tolerances to be 1 or oo, we obtain the class of non-partitioned 
interval probe graphs.? 


F8: [GoMo82] Every bounded tolerance graph is a cocomparability graph. 


F9: [La93] The bounded tolerance graphs are equivalent to the class of parallelogram 
graphs.* 


F10: [GoMoTr84] Tolerance graphs are perfect and are contained in the class of weakly 
chordal graphs. 


DEFINITION 


D6: A tolerance graph is said to be a proper tolerance graph if it has a tolerance 
representation in which no interval properly contains another interval. A tolerance 
graph is said to be a unit tolerance graph if it has a tolerance representation in 
which each interval I, has unit length for all v € V. 


For the case of intersection graphs, the proper interval graphs are equivalent to the 
unit interval graphs [R069]. This is not true for tolerance graphs. 


FACTS 

F11: Every interval graph is a unit tolerance graph, but not conversely. 

F12: Every unit tolerance graph is a proper tolerance graph, but not conversely. 
F13: Every proper tolerance graph is a bounded tolerance graph, but not conversely. 


A complete hierarchy of these and other classes of perfect graphs ordered by inclusion 
can be found in Golumbic and Trenk [GoTr04]. 

In their original paper, Golumbic, Monma and Trotter [GoMoTr84] made the fol- 
lowing conjecture: 


Conjecture: If a graph G is both a tolerance graph and a cocomparability graph, then 
it 1s a bounded tolerance graph. 


This conjecture remains open, and has become a significant challenge to the graph 
theory community. It became even more interesting after Langley [La93] proved that 
bounded tolerance graphs are equivalent to parallelogram graphs, and more recently 
when Mertzios, Sau and Zaks [MeSaZa09] showed a 3-dimensional geometric intersection 
model for tolerance graphs called the parallelepiped representation. 


8G= (V, E) is a non-partitioned interval probe graph if the vertex set can be partitioned V = PUN 
into a set P of probes and N of non-probes such that N is a stable set and there exists a (fill-in) 
FCNxN, where G’ = (V,EU F) is an interval graph (see [McWaZh98, GoLi01] and Chapter 4 of 
[GoTr04]). 

4A parallelogram graph is the intersection graph of a set of parallelograms whose upper and lower 
sides are on a given pair of horizontal lines. 
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9.3.3. The Algorithmics of Tolerance Graphs 


The computational complexity of recognizing tolerance graphs and bounded toler- 
ance graphs had remained open for 28 years. Hayward and Shamir [HaSh04] showed 
that the problem is in NP, and Mertzios, Sau and Zaks [MeSaZa10] proved that it is 
NP-hard, thus obtaining the following: 


FACTS 


F14: [MeSaZal0] Recognizing tolerance graphs and bounded tolerance graphs is NP- 
complete. 


The following related result answers Question 3.11 of [GoTr04]: 
F15: [Buls07] Recognizing bipartite tolerance graphs has linear time complexity. 


Narasimhan and Manber [NaMa92] presented a polynomial time algorithm to find a 
maximum weighted stable set of a tolerance graph, given a tolerance representation for 
the graph. 


F16: [NaMa92] A maximum weighted stable set of a tolerance representation can be 
found in time O(n? log n). 


Coloring bounded tolerance graphs in polynomial time is an immediate consequence 
of their being cocomparability graphs. Narasimhan and Manber [NaMa92] use this fact 
(as a subroutine) to find the chromatic number of any (unbounded) tolerance graph in 
polynomial time, but not the coloring itself. Subsequently, Golumbic and Siani [GoSi02] 
gave a different algorithm to find a coloring of a tolerance graph, given a tolerance 
representation for it (see also Golumbic and Trenk [GoTr04]). 


F17: [GoSi02] Finding a minimum coloring of a tolerance representation with at most 
q intervals having unbounded tolerance can be done in O(qn + nlogn) time. 


9.3.4 Variations of Tolerance on Intervals 


A variety of “variations on the theme of tolerance” in graphs have been defined and 
studied over the past years. By changing the function min in the tolerance definition 
(inequality (9.3.1)) with a different binary function ¢ (for example, max, sum, product, 
etc.), we obtain a class that is called ¢-tolerance graphs. 

By allowing a separate left tolerance and right tolerance for each interval, various 
bitolerance graph and poset models can be obtained. Directed graph analogues to several 
of these models have also been defined and studied (see [GoTr04]). 

By substituting a different “host” set instead of the real line and with a specified 
type for the subsets of that host, instead of intervals on a line, we obtain classes such as 
tolerance graphs of paths on a tree. Bibelnieks and Dearing [BiDe93] considered NeST 
graphs, a tolerance model on neighborhood subtrees of a continuous tree (in the plane), 
studied further in [HakeMa02]. 

By replacing the measure of the length of an interval by some other measure p of 
the intersection of the two subsets (for example, cardinality in the case of discrete sets, 
or number of branching nodes or maximum degree in the case of subtrees of trees), we 
could obtain yet other variations of tolerance graphs. 

We will now survey several of these variations. For further study of tolerance graphs 
and related topics, we refer the reader to Golumbic and Trenk [GoTr04]. 
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DEFINITIONS 


Let @ be a symmetric binary function, positive valued on positive arguments. 


D7: A graph G = (V, E) is a ¢-tolerance graph if there is an interval representation 
T = {Iy}vev with positive tolerances t = {ty},ev such that 


vwe E> |, 01,| > b{ty, tw}. (9.3.2) 
Ds: A ¢-tolerance chain graph is defined to be a ¢-tolerance graph which has a 


representation Z consisting of a nested family of intervals (i.e., a set of intervals totally 
ordered by inclusion). 


FACTS 
The following are due to Jacobson, McMorris and Mulder [JaMcMu91]. 


F18: The min-tolerance chain graphs are equivalent to the class of threshold graphs. 
F19: The maz-tolerance chain graphs are equivalent to the class of interval graphs. 
F20: The sum-tolerance chain graphs are equivalent to the class of coTT graphs. 


We note that ¢-tolerance chain graphs are an earlier special case of rank-tolerance 
graphs, which we will present in Section 9.3.5. 


DEFINITIONS 


D9: An Archimedean function has the property of tending to infinity whenever one 
of its arguments tends to infinity, that is, lim, (x, c) = co for every fixed c > 0. 


D10: A graph G is an Archimedean ¢-tolerance graph, or more simply, an 
Archimedean graph, if G is a ¢-tolerance graph for all Archimedean functions ¢. 


FACTS 


Generalizing a known result of Jacobson, McMorris and Scheinerman [JaMcSc91] 
for trees, Golumbic, Jamison and Trenk [GoJaTr02] prove the following: 


F21: All trees, cacti, chordless suns, and complete bipartite graphs kK, are Archimedean. 
In [GoJaTr02], they also prove the following: 


F22: Every graph G can be represented as a dg-tolerance graph for some Archimedean 
polynomial ¢g. Moreover, there is a “universal” Archimedean function ¢* such that 
every graph G is a ¢*-tolerance graph. 
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Threshold Tolerance and coTT Graphs 


DEFINITIONS 


D11: An undirected graph G = (V, E) is called a Threshold Tolerance (TT) graph 
if its vertices can be assigned positive weights {w, | v € V} and positive tolerances 
{ty |v € V} such that rye FE => w,+ wy, > min{t,, ty}. 


Monma, Reed and Trotter [MoReTr88] introduced this family as a generalization of 
the well-known class of threshold graphs [ChHa77, MaPe95], namely, the special case 
in which all tolerances are equal to some constant (called the threshold). 

The complement of a threshold tolerance graph is called a coTT graph. An alternate 
definition of coTT graphs is: 


D12: A graph G = (V,E) is a complement Threshold Tolerance (coTT) graph 
if for each vertex v there exist two positive numbers a, and b, such that for any pair of 
vertices x and y: cy e E => ay < by and ay < bz. (See also [GoTr04] page 187.) 


FACTS 


F23: Threshold graphs are threshold tolerance graphs, but threshold tolerance graphs 
are not (interval) tolerance. 


F24: coTT graphs are tolerance graphs, and coTT graphs are also strongly chordal. 


Bounded Bitolerance Graphs and Ordered Sets 


The bounded bitolerance graphs generalize bounded tolerance graphs by allowing sep- 
arate left and right tolerances for each interval in the representation. We define them 
below in terms of their complements, which are comparability graphs. 


DEFINITIONS 


D13: An ordered set P = (V, ~) is a bounded bitolerance order if each vertex v € V 
can be assigned a closed interval [, = [L,, Ry] and two “tolerant points” p,,q. € Ly 
satisfying Ly < py and q, < Ry, such that 


rx y => Ry < py and gy < Ly. 


D14: An undirected graph G = (V,E£) is a bounded bitolerance graph if it is the 
incomparability graph of a bounded bitolerance order. 


D15: A trapezoid graph is the intersection graph of a set of trapezoids whose upper 
and lower sides are on a given pair of horizontal lines. 


D16: From such a trapezoid representation {T,,}, a trapezoid order is obtained as 
x ~< y <=> T, lies totally to the left of Ty. 


FACTS 
The following are due to Langley [La93]: 


F25: The bounded bitolerance orders are equivalent to the class of trapezoid orders. 
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F26: The bounded bitolerance graphs are equivalent to the class of trapezoid graphs. 


Trapeziod graphs were originally introduced in the context of a circuit design appli- 
cation [DaGoPi88]. They noted the following: 

A trapezoid order can be viewed as the intersection of two interval orders P, = 
(V, ~1) and Py = (V, <2) in which x X y if e<,y and x~<2y, that is, the upper interval 
of x is disjoint and to the left of the upper interval of y, and the lower interval of x 
is disjoint and to the left of the lower interval of y. This property is known as having 
interval dimension idim 2. 


9.3.5 Rank-Tolerance Graphs 


Rank-tolerance graphs (to be defined formally below) were motivated by extending 
the notion of ¢-tolerance chain graphs. A rank-tolerance representation of a graph 
assigns to each vertex two parameters: a rank, which represents the size of that vertex, 
and a tolerance which represents an allowed extent of conflict with other vertices. Two 
vertices will be adjacent if and only if their joint rank exceeds (or equals) their joint 
tolerance. By varying the coupling functions used to obtain the joint rank or joint 
tolerance, a variety of graph classes arise, many of which have interesting structure. 


BACKGROUND 


In the investigation of ¢-tolerance graphs, as we have seen in Section 9.3.4, a par- 
ticular case appeared to be of recurring interest — namely, when all the intervals in 
the representation are nested to form a chain under inclusion (the ¢-tolerance chain 
graphs). In this case, the length of the intersection of two nested intervals is just the 
minimum of their lengths. So if we denote r, = |I,|, for chain representations we may 
substitute min(ry,7w) for |Z, Ly| in the inequality (9.3.2) defining ¢-tolerance graphs. 
Thus, here, only the lengths, and not the actual intervals, play a role in determining 
adjacency. 

Motivated by this observation, Golumbic and Jamison [GoJa06] extended and ex- 
plored a large number of classes that arise when the min of ry,ry is also allowed to be 
replaced by a more general function p. That paper laid the foundation for a general 
theory of rank-tolerance graphs. 


DEFINITION 


Let C denote the class of all commutative binary operations on the positive real 
numbers. The functions in C will be thought of as combining or coupling two input 
tolerances (or ranks) and giving a single joint tolerance (or rank) as output. 


D17: Given two coupling functions p and ¢, we say that a graph G = (V,£) isa 
[p; d]-graph if there are mappings v > r, and v > t, from the vertex set V of G into 
the positive real numbers Rt such that for all vertices v 4 w in V, 


vwe EB => ply, tw) > (tv, tw). (9.3.3) 


The r, are called the ranks and the ty, are called the tolerances. The class of all 
[p; d]-graphs is denoted simply by [p; 4], where p is called the rank function and ¢ is 
called the tolerance function. ° 


5Our notation here [p; ¢]-graphs differs from the original paper [GoJa06] which called them [4, p] 
and a more recent paper [Jal2] which called them N(p; ¢). 
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FACTS 
In our notation, we have 
F27: [min; ¢] = ¢-tolerance chain graphs. 


The three Facts in Section 9.3.4 on tolerance chain graphs due to Jacobson, McMorris 
and Mulder [JaMcMu91] can now be restated as follows: 


F28: [min; min] = threshold graphs. 
F29: [min; maz] = interval graphs. 
F30: [min; sum] = coTT graphs. 


The paper by Golumbic and Jamison [GoJa06] is the primary source for rank- 
tolerance graphs where they prove the following: 


F31: [min; prod] = coTT graphs. 
F32: [maz; sum] = TT graphs = [maz; prod]. 


Question: One interesting, unstudied class worthy of future investigation is the 
class sum; prod]. Jamison has shown a new, yet unpublished result that every Threshold 
Tolerance graph is a [sum; prod]-graph. See also [JaSp06]. 


F33: If at least one of the coupling functions p or ¢ is moderately nice (continu- 
ous and weakly increasing), then reversing the roles of tolerance and rank leads to a 
representation of the complement G of G, giving us a beautiful co-symmetry between 
classes: 


[p; o] = col¢; el. 
The core functions min, maz, sum, prod all satisfy nice properties. However, there 
are examples of discrete non-strict cases where this co-symmetry fails. 


DEFINITIONS 
D18: ¢is Archimedean if lim, ¢(x,c) = co for every fixed c > 0. 
D19: ¢is dual Archimedean if lim,—,9 d(x, c) = 0 for every fixed c > 0. 


The next result is one of several on reflexive classes: 


FACTS 


F34: If ¢ is nondecreasing, weakly increasing, and associative, then [¢; ¢] is the class 
of all threshold graphs. 


F35: Under any of the following three conditions, 
e pis both Archimedean and dual Archimedean, or 
e ¢ is both Archimedean and dual Archimedean, or 
e p and ¢ are both dual Archimedean, 

the class [p; ¢] contains all threshold graphs. 


The reader is referred to [GoJa06] for further results on rank-tolerance graphs. In 
[(Jal2], Jamison studies a more comprehensive mathematical theory of conflict-tolerance 
graphs. 
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9.3.6 Intersection and Tolerance Graphs on Trees 


DEFINITIONS 


D20: A chordal graph is one that contains no chordless cycle of size greater than or 
equal to 4. 


D21: A split graph is one whose vertex set can be partitioned into a clique and an 
independent set. 


FACTS 
F36: A graph is a split graph if and only if it is chordal and its complement is chordal. 


Let T be a tree and let {T;} be a collection of subtrees (connected subgraphs) of T. 
We may think of the host tree T either (1) as a continuous model of a tree embedded 
in the plane, thus generalizing the real line from the one-dimensional case, or (2) as a 
finite discrete model of a tree, namely, a connected graph of vertices and edges having 
no cycles, thus generalizing the path P, from the one-dimensional case. 

The distinction between these two models becomes important when measuring the 
size of the intersection of two subtrees. For example, in the continuous model (1), we 
might take the size of the intersection to be the length of a longest common path of the 
two subtrees measured along the host tree [BiDe93]. In the discrete model (2), we might 
count the number of common vertices or common edges [GoJa85a, GoJa85b, JaMu00, 
JaMu05]. We use the expressions “nonempty intersection” or “vertex intersection” to 
mean sharing a vertex or point of T, and “nontrivial intersection” or “edge intersection” 
to mean sharing an edge or otherwise measurable segment of JT. In this way, edge 
intersection is more tolerant than vertex intersection. 

Using this terminology, a classical result of [Bu74, Ga74, Wa78] stated the following. 


F37: A graph is the vertex intersection graph of a set of subtrees of a tree if and only 
if it is a chordal graph. 


McMorris and Shier [McSh83] give an analogous version for split graphs. 


F38: A graph G is the vertex intersection graph of distinct induced subtrees of a star 
iy» if and only if G is a split graph. 


In contrast to these results, Golumbic and Jamison [GoJa85a] observed that the 
family of edge intersection graphs of subtrees of a tree yield all possible graphs. In fact, 
they proved the following variation on Marczewski’s Theorem: 


F39: Every graph can be represented as the edge intersection graph of substars of a 
star. 


Two different classes of intersection graphs also arise when considering simple paths 
(instead of subtrees) of an arbitrary host tree T. The path graphs, which are the vertex 
intersection graphs of paths on a tree, also known as VPT graphs, are a subfamily of 
chordal graphs. However, the graphs obtained as the “edge intersection graphs of paths 
in a tree,” called EPT graphs, are not necessarily chordal. The class of EPT graphs are 
not perfect graphs, and the recognition problem for them is NP-complete [GoJa85al, 
whereas the VPT graphs are perfect and can be recognized efficiently [Ga78]. See also 
Monma and Wei [MoWe86] and Systo [Sy85]. 
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Type of Interaction | Objects | Host | Graph Class 

vertex intersection subtrees | tree chordal graphs 

vertex intersection subtrees | star split graphs 

edge intersection subtrees | star all graphs 

vertex intersection paths path | interval graphs 

vertex intersection paths tree path graphs or VPT graphs 
edge intersection paths tree EPT graphs 

containment intervals | line permutation graphs 
containment paths tree (open question) 
containment subtrees | star comparability graphs 


Table 9.1: Some graph classes involving trees. 


FACT 


F40: [GoJa85b] In the special case of the host tree having maximum vertex degree 3 
(binary trees), the VPT and EPT classes are the same, i.e., deg3-VPT = deg3-EPT. 
See Table 9.1. 


Thus, EPT graphs are a more tolerant model than VPT graphs, but they have a 
high algorithmic cost. Table 9.1 summarizes the subtree graph classes we have discussed 
here. A full treatment can be found in Chapter 11 of Golumbic and Trenk [GoTr04]. 


The (h,s,t) Graphs: Degree Constrained Representations 


Jamison and Mulder [JaMu00, JaMu05] introduced a constant tolerance model for sub- 
trees of a tree where degree restrictions are placed on the trees. This further generalizes 
VPT and EPT graphs. 


DEFINITIONS 

D22: An (h, s,t)-representation of a graph G consists of a collection of subtrees {.S, |v € 
V(G)} of a tree T, such that (i) the maximum degree of T is at most h, (ii) every subtree 
has maximum degree at most s, and (iii) there is an edge between two vertices in G if 


and only if the corresponding subtrees in T have at least t vertices in common. 


D23: A graph is weakly chordal if neither the graph nor its complement contains a 
chordless cycle of size greater than 4. 


REMARKS 


Using this notation, where co denotes that no restriction is imposed, we immediately 
have from their definitions: 


R1: interval graphs = (2, 2,1). 
R2: EPT graphs = (co, 2, 2). 


R3: VPT graphs or path graphs = (co, 2, 1). 
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The following results have been shown in the literature: 


FACTS 

F41: chordal graphs = (co, co, 1) [Bu74, Ga74, Wa78] 
= (33,1) [McSc91] 
= (3,3,2).  [JaMu05] 


F42: (3,2,1) = (8,2,2) =V PT N chordal = EPT NM chordal. [GoJa85b] 
F43: (4,2,2) = EPT 2M weakly chordal. [GoLiSt08b] 
F44: (4,4,2) = weakly chordal N{K2,3,4P2, Po U Pi, Ps, Hi, H2, H3}-free. [GoLiSt09] 


In the paper [CoGoLiSt08], the family of (4,3,2) is characterized and a polynomial 
time recognition is also provided. The class (3, 3,3) is studied in [JaMu00]. For further 
results in this area, see [GoLiSt08a], [GoLiSt08c] and [JaMu05]. 
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INTRODUCTION 


Harper [Ha64] discusses a coding problem in which the integers 1,2,...,2” form the 
code words, each assigned to a vertex of the n-dimensional hypercube. If code words 
i and j are assigned to adjacent vertices, then Aj; is defined to be |i — j|. The paper 
determines the minimum value of }> A;; over all possible assignments. In concluding 
remarks Harper says “Another problem, as yet not solved, is this: how to number the 
vertices of an n-cube so that maxAj; ... is minimized.” This latter problem is precisely 
that of determining the bandwidth of a hypercube, and this is the first known reference 
in graph theoretic terms. 

[ChChDeGi82] and [LaWi99], each with extensive bibliographies, provide compre- 
hensive surveys of bandwidth. Further results can be found in [Ch88] and [Mi91]. All 
graphs discussed in this section are assumed to be simple and finite. 


9.4.1 Fundamentals 
The Bandwidth Concept 


EXAMPLE 


E1: Figures 9.4.1 and 9.4.2 show different vertex labelings of the three-dimensional 
hypercube Q3 and the corresponding adjacency matrices. 
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00000111 
00001011 
00001101 
00001110 
01110000 
10110000 
11010000 
11100000 


Figure 9.4.1: Hypercube Q3 with labeling and associated adjacency matrix. 


01011000 
10100100 
01010010 
10100001 
10000101 
01001010 
00100101 
00011010 


Figure 9.4.2: A second labeling of Q3 and associated adjacency matrix. 


For the labeling of Figure 9.4.1, all ones in the adjacency matrix lie in the seven diagonals 
above and below the main diagonal. Since the matrix is symmetric, we can restrict our 
attention to the diagonals in the upper triangular portion. The matrix corresponding 
to the labeling of Figure 9.4.2 has ones in only the four (“upper”) diagonals closest to 
the main diagonal. The bandwidth of a graph corresponds to the minimum number of 
such diagonals, taken over all possible labelings. For Q3 this number is four. Thus the 
labeling of Figure 9.4.2 yields the minimum. This concept is formalized as follows. 


DEFINITIONS 

NOTATION: An edge with endpoints (end vertices) wu and v is denoted uv. 

D1: A proper numbering of G is a bijection f : V > {1,2,...,n}. 

D2: Let f be a proper numbering of a graph G. The bandwidth of f, denoted B;(G), 
is given by By(G) = max{|f(u) — f(v)| + uw € EF}. 

D3: The bandwidth of G is B(G) = min{By(G) : f is a proper numbering of G}. 


D4: A bandwidth numbering of G is a proper numbering f such that B(G) = By(G) 
(i.e., a proper numbering that achieves B(G)). 


Applications 


The introduction mentions an application of bandwidth related to coding theory. 
[LaWi99] includes a survey, along with references, of other applications. 


EFFICIENT MATRIX STORAGE 

Storing the entire upper triangular portion of the adjacency matrix is one of several 
computer representations of a graph. However, only the B(G) diagonals above the 
main diagonal need be stored. If B(G) is small, this can represent a significant savings. 
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VLSI LAYOUT 


The placement problem for modules of a VLSI design is the location of the modules on 
a two-dimensional grid so that certain criteria are met. Modules that must communicate 
with each other should be as close as possible. A simplified model of the geometry 
can be given by a graph G whose vertices correspond to modules and whose edges 
correspond to wires between modules. Then B(G) represents the maximum distance 
between communicating modules. 


INTERCONNECTION NETWORKS 


An interconnection or parallel computation network is a collection of processors 
with links between them. This can be modeled by a graph G where the vertices represent 
the processors and edges correspond to the links. Sometimes it is desirable to simulate 
the network represented by G on a second network modeled by graph H. This can be 
done by a one-to-one mapping f : V(G) + V(#), where processor u in G is simulated 
by processor f(u) in H, and link uv in G is simulated by a shortest path between f(u) 
and f(v) in H. Ift is the communication time for a link uv in G, then the corresponding 
time in H is dt where d is the distance between f(u) and f(v) in H. Ift =1 and A is 
a path, the greatest possible delay in the simulation is B(G). 


BINARY CONSTRAINT SATISFACTION PROBLEM 


A binary constraint satisfaction problem involves a collection of variables, a set 
of possible values for each, and constraints between them. The problem is to assign to 
each variable a permissible value such that all constraints are satisfied. The associated 
constraint graph G has vertices representing the variables, with an edge between two 
vertices if the corresponding variables share a nontrivial constraint. If B(G) is small, 
the problem may be more easily solvable than otherwise since then it might be possible 
to deal with only a small number of variables at a time. 


Algorithms 


DEFINITIONS 


D5: The bandwidth decision problem is the problem which accepts as input an 
arbitrary graph G and an arbitrary integer K and returns “YES” if B(G) < K and 
“NO” otherwise. 


D6: For a given fixed positive integer k, the bandwidth-k decision problem is the 
problem which accepts as input an arbitrary graph G and returns “YES” if B(G) <k 
and “NO” otherwise. 


D7: A polynomial algorithm for graphs is one whose execution time is bounded by 
a polynomial in some parameter of the problem, often the number of vertices. 


D8: An NP-complete problem is a problem having a “YES” or “NO” answer that 
can be solved nondeterministically in polynomial time, and all other such problems can 
be transformed to it in polynomial time. Such problems are generally accepted as being 
computationally difficult. 
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FACTS 
F1: ([Pa76] The bandwidth decision problem is NP-complete. 


F2:  [GaGrJoKn78] The bandwidth decision problem is NP-complete for trees with 
maximum degree three. 


F3: — [Sa80] The bandwidth-k problem is solvable in polynomial time for any fixed 
positive integer k. 


REMARKS 


R1: The important distinction between the bandwidth decision problem and the 
bandwidth-k decision problem is that the integer kK in the former is an input vari- 
able while the integer & in the latter is fixed for all graphs and does not appear as an 
input. 


R2: In view of Facts Fl and F2 above, it is highly unlikely that a polynomial algorithm 
can be found for computing the bandwidth of all graphs or even for trees with maximum 
degree three. Sometimes, though, it is possible to restate an NP-complete problem, by 
limiting its generality, so that the revised problem becomes polynomial. This has been 
done for bandwidth, as illustrated in Fact F3. 


R3: Many approximation algorithms have been developed, some dealing with a ma- 
trix equivalent of the bandwidth problem, and a listing of several of them is given 
in [LaWi99]. Further surveys and references are in [Ev79, HeGr79, Sm85]. While it 
certainly would be advantageous to determine bandwidth exactly, approximate values 
remain useful in practical applications, since any reduction in the number of nonzero 
diagonals of the adjacency matrix provides benefit. 


R4: Apparently the first attempt to develop an approximate algorithm was reported 
in [AIMa65], and its effective use was limited to small matrices. The first algorithm to 
receive wide acceptance is discussed in [R068]. [CuMc69] describes an algorithm which 
took center stage during the 1970’s, despite several limitations. [GiPoSt76] presents a 
greatly improved version. Details can be found in [ChChDeGi82]. 


R5: More recent work on both approximate algorithms for general graphs and exact 
algorithms for specific classes of graphs includes [GoOp90, KaSh96, HaMa97, Ya98, 
BlKoRaVe00, Fe00, Gu01, KITa01, CaMaPr02, KrSt02]. 


9.4.2 Elementary Results 

The Bandwidth of Some Common Families of Graphs 
DEFINITIONS 

D9: The complete k-partite graph Ky, .n5,....n, iS the graph whose vertex set is 


partitioned into sets A; of n; vertices, 1 <i < k, with two vertices adjacent if and only 
if they are in distinct sets. 
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D10: The n-dimensional hypercube Q,, is the graph having 2” vertices, each labeled 
with a distinct n-digit binary sequence, and two vertices are adjacent if and only if their 
labels differ in exactly one position. 


FACTS 
F4: B(P,,) = 1, where P,, is the path having n vertices. 
F5: B(C,,) = 2, where C;,, is the cycle having n vertices. 


F6: B(K,,) =n-—1, where K,, is the complete graph having n vertices. 
] 


F7:  [Ei79 
\V (Knz.ng,....me)| — [(r1 +1)/2]. Thus, B(Kn,n,) = [n1/2] +2 —1 [Ch70]. 


F8: [Ha66] B(Q,) = 725 (in/2))- 


Let ny > no > -:: > nx be positive integers. Then B(Kn,ino,....n,) = 


EXAMPLE 


E2: Figure 9.4.2 shows a bandwidth numbering of Q3 and Figure 9.4.3 presents band- 
width numberings for C7 and Ks, 3. 


3 2 fe ¢2° Gc, Ti 28 
5 4 
: a 
6 
K 
C, 53 


Figure 9.4.3: Bandwidth numberings of C7 and K;5,3. 


A Few Basic Relations 


NOTATION: The minimum degree and maximum degree of the vertices in a graph G are 
denoted Omin(G) and dmax(G), respectively. 

TERMINOLOGY NOTE: Several authors use 6(G) and A(G) instead. 

FACTS 


F9: [ChDeGikKo75] B(G) > [5max(G)/2. 


F10: If H is a subgraph of G, then B(H) < B(G). 


F11: If graph G has components Gj, G2,...,Gz, then B(G) = max{B(G), B(G2), 
...,B(Gx)}. 


F12: [ChDeGiko75] If G is a nonplanar graph, then B(G) > 4. 
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On the Bandwidth of Trees 


DEFINITIONS 


D11: An (edge) subdivision of edge e = uv in graph G is the graph obtained from G 
by replacing e by the path (u, w, v) where w is a new vertex of degree two. A refinement 
of G is a graph obtained from G by a finite number of subdivisions. 


D12: The complete k-ary tree T;,,q of depth d is the rooted tree in which all vertices 
at level d— 1 or less have exactly & children, and all vertices at level d are leaves. 


FACTS 


F13: [ChDeGiko75] For any tree T, B(T) < ||V(T)|/2]. Equality holds if and only if 
|V(T)| is even and T is the star Ky jy(ry\-1- 


F14: [WaYa95, AnKaGe96] Let T be a tree with k univalent vertices. Then 
B(T) < [k/2]. 


F15: [Sm95] Let Ti,q be the complete k-ary tree of depth d. Then we have B(Th,a) = 
[k(k* — 1)/(2d(k — 1))). 


F16: [Ch88] If tree T contains a refinement of the complete binary tree Tp 4, then 
B(T) > [4/2]. 


EXAMPLE 


E3: Figure 9.4.4 shows bandwidth numberings of two trees, one for which B(T) < |n/2| 
and Ky,7, for which equality occurs. 


2 
6 1 1 
7 5 3 
2 4 
8 5 
8 4 
7 6 
Ki, 


Figure 9.4.4: Bandwidth numberings of two trees. 
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Alternative Interpretations of Bandwidth 


Three alternative interpretations of bandwidth are shown below. Others are given in 
[Li00]. 

DEFINITIONS 

D13: The k‘’ power of graph G, denoted G*, is the graph having the same vertex 


set as G and an edge between two vertices if and only if the distance between them is 
at most / in G. 


D14: The complementary numbering f. of proper numbering f of G is defined 
by f-(v) =n +1 — f(v) for each vertex v of G. 


FACTS 


F17: [ChChDeGi82] For a real symmetric matrix M, let m;; be the value in position 
(i, 7). Consider the problem of finding a symmetric permutation of the rows and columns 
of M such that the maximum of |: — j|, taken over all pairs (i,7) for which m,,; is 
nonzero, is minimized. This problem’s equivalence to the bandwidth problem follows 
by replacing each nonzero entry of M by 1 and considering the resulting matrix as an 
adjacency matrix of a graph. 


F18: [ChChDeGi82] G has bandwidth k if and only if k is the smallest integer such 
that G can be embedded in P® where P* is the kth power of the path P,, on n vertices. 


F19: By.(G) = By(G) so the complementary numbering of a bandwidth numbering 
also is a bandwidth numbering. 


9.4.3. Bounds on Bandwidth 


Two General Bounds 


FACTS 


F20: [Ha66] For S C V, let OS be the subset of S with at least one neighbor outside 
of S. Then B(G) > max; min{|0S| : |S| = &}. 


F21: [Ch80-a] For S C V, let AS be the subset of edges of G with exactly one endpoint 


in S. Then B > max; min { C+e0is°— :|S| = ih. 


Subdivisions, Mergers, Contractions, and Edge Additions 


DEFINITION 


D15: The merger of two vertices u and v of graph G is the graph, denoted G|,,,v, 
obtained from G by identifying u and v and then eliminating any loops and duplicate 
edges. If e = uv, the merger G|,,,, is called a contraction of G along e and denoted 
Gle. 
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FACTS 


F22: [ChOp86] If H is obtained from G by a subdivision of an edge, then 


and this result is sharp. 
F23: [ChOp86] For any graph G and vertices u,v € V(G), 
B(G) -1< B(Gluy) < 2B(G) 
and both bounds are sharp. 
F24: [Ch80-a, ChOp86] For any graph G and edge e € E(G), 
B(G) -1< B(G|e) < [[8B(G) — 11/21 
and both bounds are sharp. 


F25: [WaWeYa95] Let B(G) = b and g(b,|V(G)|) be the maximum possible value of 
B(G +e). Then 


b+1 if |V(G)| < 3b+4 
gb, |V(G)I) = 4 [UV(G@)|—1)/3] if 38b+5 <|V(G)| < 66-2 
2b if |V(G)| > 6b—-1 


EXAMPLE 


E4: Figure 9.4.5 illustrates Fact F25 by showing a graph G having bandwidth 2 and a 
corresponding G' + e having bandwidth 4, both shown with bandwidth numberings. 


Figure 9.4.5: A graph whose bandwidth doubles when an edge is added. 


REMARK 


R6: Fact F25 gives a complete solution to a question originally posed by Erd6és: whether 
B(G+e)— B(G) < 1, where G'+e is a graph obtained from G by adding an edge e not 
originally in G. This was first shown not to be the case in [Ch80-a]. 
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Nordhaus—Gaddum Types of Bounds 


DEFINITIONS 


D16: The complement of graph G, denoted G, is the graph with the same vertex 
set as G and e € G if and only ife ¢ G. 


D17: A property P of a graph holds for almost all graphs if the ratio of the number 
of n-vertex graphs possessing P divided by the number of n-vertex graphs approaches 
one as n approaches infinity. 


FACTS 


F26: [ChErChGr81] For any graph G, |V(G)| — 2 < B(G) + B(G). 

F27: [ChErChGr81] There is a positive constant c, such that B(G)+.B(G) < 2|V(G)| 
—c; log |V(G)| for any graph. 

F28: [ChErChGr81] There is a positive constant cz such that 2|V(G)|—c1 log |V(G)| < 


B(G) + B(G) for almost all graphs. 


F29: [FuWe01]} Let f(n) = max{B(G) + B(G) : G an n-vertex graph}. Then 


In — [(4+2V2) logs n] < f(n) < 2n — 4logy n + o(logn) 
Other Bounds 


FACTS 
F30: [ChDeGiKo75] Let G be a graph and G* be its k'” power. Then B(G*) < kB(G). 
F31: [Ch80-a] For graph G, B(G) < |V(G)| — 3 if and only if G contains a P4. 


F32: [Ch80-a] Let G be a graph such that d(x, y) < 2 for every pair of vertices z, y. 
Then B(G) = |V(G)| — 2 if and only if every component of G is a vertex, a Ky», or a 
Ks. 


9.4.4 On the Bandwidth of Combinations of Graphs 


Two or more graphs can be combined in a variety of ways to form a new graph, 
and information about the bandwidth of the new graph often can be gleaned from the 
bandwidths of the original graphs. 


Cartesian Product 


DEFINITION 


D18: The Cartesian product of graphs G and H, denoted G x H, is the graph where 
V(G x H) = V(G) x V(A) and (91,h1)(g2,h2) € E(G x H) if and only if either (i) 
Gi = g2 and hyhe € E(#) or (ii) hy = he and 9ig2 € E(G). 
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FACTS 
F33: [Ch75, ChDeGiko75] For graphs G and H, 
B(G x H) < min{|V(H)|B(G), |V(@)|BUD}- 
F34: [Ch75] For paths P,, and P,,, where max{m,n} > 2, B(P» x Pp) = min{m,n}. 


F35: Ifm>2 and n> 3, then B(P,, x C,) = min{2m,n}. 


EXAMPLE 


E5: Figure 9.4.6 shows a bandwidth numbering of P3 x C’7 which, by Fact F35, has 
bandwidth 6. 


7 13 19 14 8 


Figure 9.4.6: A bandwidth numbering of P3 x C7. 


Sum of Two Graphs 


DEFINITION 

D19: The sum of graphs G and H, denoted G + H, is the graph with V(G + H) = 
V(G) UV(A) and E(G + A) = E(G)U E(A) U {uw : ue V(G) andv€ V(A)}. 
FACTS 


F36: [LiWiWa91] Let G and A be graphs such that |V(G)| > |V(#)| and B(G) < 
[|V(G)|/2]. Then B(G + H) = [|V(G)|/2] + |V(A)| - 1. 


F37: [LiWiWa91] Let G and A be graphs such that |V(G)| > |V(#)| and B(G) > 
[|V(G)|/2]. Then |V(G)|/2+|V(H)|-1 < B(G+H) < min{ B(G)+|V(4)|, max{ B(H) 
+|V(G)I, [IV (A)|/2] + |V(G)| — 1}}- 


F38: [LiWiWa91] For paths P, and P,, with n >m, B(P,+ Pn) = [n/2] +m-—1. 
F39: [LiWiWa91] For cycles C,, and C,, with n > m, 


[n/2] +m—1 ifn>5 
B(Cy,+ Cn) =< 5 ifn<4andm=3 


6 ifn=m=4. 
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EXAMPLE 


E6: Figure 9.4.7 shows a bandwidth numbering of Cs + C3 which, by Fact F39, has 
bandwidth 5. 


Figure 9.4.7: A bandwidth numbering of Cs + C3. 


Corona and Composition 


DEFINITIONS 


D20: The corona of graphs G and H, denoted Go H, is the graph constructed from 
one copy of G and |V(G)| copies of H, one associated with each vertex of G. If uv € V(G) 
and H, is the copy of H associated with v, there are the additional edges vh for every 
he V(A,). 


D21: The composition of graphs G and H, denoted G(H), is the graph where 
V(G(A)) = V(G) x V(A) and (91,h1)(g2,h2) € E(G(A)) if and only if either (i) 
gigz € E(G) or (ii) g1 = go and hyh2 € E(A). 


FACTS 


F40: [Ch80-b] For graphs G and H, B(Go H) < B(G)(|V(H#)| +1), and this bound is 
sharp. 


F41: [Ch80-b] For graphs G and H, B(G(A)) < (B(G)+1)|V(A)|-1, and this bound 


is sharp. 


Strong Product and Tensor Product 


DEFINITIONS 


D22: The strong product of graphs G and H, denoted G(Sp)H, is the graph with 
V(G(Sp)H) = V(G) x V(A) and (91,h1)(g2,h2) € E(G(Sp)H) if and only if either 
gig2 € E(G) and hyh2 € E(#f) or g1 = 92 and hyhg € E(#f) or hy = hg and 
gige € E(G). 
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D23: The tensor product of graphs G and H, denoted G(T p)H, is the graph with 
V(G(Tp)H) = V(G) x V(A) and (gq, h1)(g2, he) € E(G(Tp)H) if and only if gigo € 
E(G) and hyh2 € E(#f). 


FACTS 


F42: [LaWi95] (a) If m >n> 2, B(Pm(Sp)Pa) =n +1. 


m+2 ifn>|n/2|+1 


(b) If m > 3 and n > 2, B(Cm(Sp) Pa) = { 2n+1 otherwise. 


(c) im > n, B(C,,(Sp)C,) = 2n +2. 


F43: [LaWi97-b] 


n+1 ifm >n-> 4 and m,n even 
B(Cn(Tp)Cr) = ¢ min{n+1,2m+1} ifm odd, n even 
2n+1 ifm >n-> 3 and m,n odd 


9.4.5 Bandwidth and Its Relationship to Other In- 
variants 


Many bounds for bandwidth in terms of other graphical invariants have been found. 
Several are listed in the previously mentioned survey papers and in [BrDu85, BrDu91]. 


NOTATION: Throughout this subsection, G is a graph with V = V(G), E = E(G), 
B= B(G), Omax _ dmax(G), and Omin >= dmin(G). 


Vertex Degree 


DEFINITION 
D24: The degree sequence of graph G is a listing of the degrees of the vertices of G, 
usually in monotonic order. 
FACTS 
F44: [Ch70] If the graph G has the degree sequence dy < dp <--- < dy, then 
Be metas (9 > 1)/2]5 43/2: 
Setting 7 = 1 yields B > din. 


F45: [ChChDeGi82] If G contains no copies of K3, B > | (35min — 1)/2]. 


EXAMPLE 


E7: Two graphs that show Fact F45 is sharp are the 3-dimensional hypercube Q3 and 
the graph along with the bandwidth numbering shown in Figure 9.4.8. Both examples 
have no K3, dmin = 3, and bandwidth B = | (3(3) — 1)/2| = 4. 
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Figure 9.4.8: Bandwidth numbering of a graph with no Ks, dmin = 3, B = 4. 


Number of Vertices and Edges for Arbitrary Graphs 


FACTS 


F46: [DuBr89] B < (|E| 4+ 1)/2. 


F47: [BrDu85] B > [21v| = i-,/QV|= 1)? =s)E|| /2. 


F48: [LaWi97-a] If G is connected, then B > |V| — s, where s is the largest integer 
such that s(s— 1) < |V|(\V| — 1) — 2|E|, and the bound is sharp. 


F49: [DuBr89] If B > |V|/2, then |E| > |V|(\V| — 1)/[2(\V| — B)]. 
F50: [DuBr89] If B > |V|/2, then |E| > (2[|V|/2] — 1) (IVI/(V| — 2)/B- UY, 


F51: [AlLiMcEr92] Let B = [(1 —«)|V|] with 0 < « <1. Then there are positive con- 
stants c, (which depends on €) and cz such that c;|V|/e < m(|V|, B) < co(log (2/€)|V | /e 
where m(|V|, B) is the minimum possible number of edges in a graph with |V| vertices 
and bandwidth B. 


Number of Vertices and Edges for Graphs with no k3 


FACTS 


F52: [ChTr84] Let t(n, B) be the maximum number of edges that an n-vertex graph 
having no K3 and bandwidth at most B can have. Then (2 _ V2) nB < t(n,B) < 
5+V3nB . 


F53: [BrCaDuFiVi00] Let G be a bipartite graph with partite set sizes m and n, 
m <n, bandwidth B, and ([m/2]+1)B <n < (m+1)B-1. Then |E| < 2mB-2m-— 
3+ [(n+1)/B] + |(n+1)/B], and this bound is sharp. 


F54: [BrCaDuFiVi00] Let G be a bipartite graph with partite set sizes m and n, 
m<n = (t+1)B+ |aB| where a is a fixed constant such that 0 < a < 1, and 
B> (m+t+4|m/t] —5/2)/(1— a). Then |E] < 2mB — |m/t|(2m—t|m/t] — t), and 
this bound is sharp. 
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Radius and Diameter 


DEFINITIONS 


D25: The radius of graph G, denoted rad(G), is the smallest number r such that 
there is a vertex u of G with distance at most r from every other vertex of G. 


D26: The diameter of graph G, denoted diam(G), is the maximum distance between 
any two vertices of G. 


FACTS 

F55: [ChChDeGi82] For any graph G, B < bmax(dmax — 1)". 
F56: [Ch70, ChDeGiko75] For any graph G, 

[(V| — 1)/diam(G)| < B < |V| — diam(G) 


F57: [ChSe89] For any graph G, B > max{(|V(G"’)| — 1)/diam(G’)} where the maxi- 
mum is taken over all connected subgraphs G’ of G that have at least two vertices. 


REMARK 


R7: Paths and cycles achieve the lower bound of Fact F56. Figure 9.4.9 shows a 
bandwidth numbering of a graph having |V| = 9, diameter = 4, and B = 5, so the 
graph achieves the upper bound. 


Figure 9.4.9: Bandwidth numbering of a graph with |V| = 9,d=4, B=5. 


Vertex and Edge Chromatic Number 


DEFINITIONS 


D27: The (vertex) chromatic number of graph G, denoted y(G), is the smallest 
number & such that there is a function f : V(G) > {1,2,...,&} with the property that, 
if wv is an edge, then f(u) 4 f(v). 


D28: The edge-chromatic number of graph G, denoted x/(G), is the smallest number 
k such that there is a function f : E(G) > {1,2,...,&} with the property that, if edges 
e, and eg share a common vertex, then f(e1) 4 f(e2). 
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FACTS 
F58: [ChDeGiko75] For any graph G, B > x(G) - 1. 


F59: [BrDu85] For any graph G, B > x/(G)/2. 


Vertex Independence and Vertex Cover Numbers 


DEFINITIONS 


D29: The vertex independence number of graph G, denoted ind(G), is the largest 
cardinality of a set of vertices which induces a graph with no edges. 


D30: The vertex cover number of graph G, denoted ao(G), is the smallest cardinality 


of a set of vertices such that every edge is incident to at least one of the vertices in the 
set. 


FACTS 
F60: [Ch70, ChDeGiko75] For any graph G, 
[IV|/ind(@)] 1 < B<|V|—[ina(@)/2| -1 


F61: [De76] For any graph G, B > ao(G)/ind(G). 


Girth, Vertex Arboricity, and Thickness 


DEFINITIONS 


D31: The girth of graph G, denoted girth(G), is the size of a smallest induced cycle 
of G. 


D32: The vertex arboricity of graph G, denoted arbor(G), is the minimum number 
of subsets into which V(G) can be partitioned such that the vertices of each subset 


induce an acyclic subgraph. 


D33: The thickness of graph G, denoted thick(G), is the smallest number of planar 
subgraphs of G whose union is G. 


FACTS 
F62: [BrDu91] If G is not a forest, then B > (girth(G) — 1)(arbor(G) — 2) + 2. 
F63: [BrDu91] If G is not a forest, then 

B > [(girth(G) — 1)|V|/(2 - ind(G)] — girth(G) + 2 


F64: [BrDu85] For any graph G, thick(G) < max (B/2, 1). 
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9.4.6 Related Concepts 


The study of bandwidth has spawned investigations into a variety of related ideas. 


Bandsize 
Bandsize is discussed briefly in [LaWi99]. 
DEFINITION 


D34: Let f be a proper numbering of a graph G. The bandsize of f, denoted bs ;(G), 
is the number of distinct edge differences produced by f. The bandsize of G is given 
by bs(G) = min{bs;(G) : f is a proper numbering of G}. 

FACT 


F65: [ErHeWi89] For any graph G, B(G) > bs(G). 


Edgesum (Bandwidth Sum) 


The edgesum first appeared in [Ha64]. Edgesums are discussed in [Se70, Io74, Io76, 
ChChDeGi82, Ch88, YaWa95, YuHu95, YuHu96, LaWi99]. 


DEFINITION 

D35: Let f be a proper numbering of G. The edgesum generated by f is s;(G) = 
wera |f(u) — f(v)|. The edgesum of G is given by s(G) = min{ss(G) : f isa 
proper numbering of G}. 

FACTS 

F66: For the n-dimensional hypercube Qn, s(Qn) = 2"~1 (2” — 1). 


F67: Like bandwidth, the edgesum decision problem is NP-complete. 


Cyclic Bandwidth 
Cyclic bandwidth is discussed in [Li94, Li97, HakaRi99, LaShCh02]. 
DEFINITION 


D36: Let f be a proper numbering of G. The cyclic bandwidth of f is B.s(G) = 
max{|| f(u) — f(v)|le : w € E} wherellz||. = min{|z|,n — |x|}. The cyclic bandwidth 
of G is given by B.(G) = min{B.¢(G) : f is a proper numbering of G}. 


Edge-Bandwidth 
Edge-bandwidth is introduced and several results are presented in [JiMuShWe99]. 
DEFINITIONS 


D37: An edge-numbering f of a graph G is a bijection from E(G) to the set of 
integers. 
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D38: Let f be an edge-numbering of G. The edge-bandwidth of f is Bi(G) = 
max{|f(e1) — f(e2)| : edges e; and eg adjacent in G}. The edge-bandwidth of graph 
G is given by B'(G) = min{B;(G) : f an edge numbering of G}. 


D39: The line graph of a graph G is the graph L(G) such that V(L(G)) = E(G) and 
two vertices in L(G) are adjacent if and only if the corresponding edges are adjacent in 
G. 

FACTS 

F68: For any graph G, B’(G) = B(L(G)). 

F69: For any graph G, B(G) < B’(G) and, if G is a forest, B’(G) < 2B(G). 


EXAMPLE 


E8: Figure 9.4.10 shows an edge-bandwidth numbering of a graph G and a bandwidth 
numbering of the line graph L(G). Notice that the edge-bandwidth numbering of an 
edge of G is identical to the bandwidth numbering of the corresponding vertex in L(G). 
It is not difficult to see that B(L(G)) = 3. 


6 3 
7 1 
4 3 
G, B'G)=3 L(G), B(L(G))= 3 


Figure 9.4.10: B(L(G)) = B'(G) =3. 


Profile 
The profile of a graph is discussed in [LiYu94, LaWi99]. 
DEFINITIONS 


D40: Let f be a proper numbering of a graph G, and let vertex v € V(G). The profile 
width is wy(v) = maxzenjr(f(v) — f(x)) where N|v] is the closed neighborhood of v. 


D41: Let f be a proper numbering of a graph G. The profile of f is P;(G) = 
vev wr(v). The profile of G, denoted P(G), is given by P(G) = min{P;(G): f isa 
proper numbering of G}. 

FACTS 

F70: P(P,)=n-1. 

F71: P(C,) = 2n-3. 

F72: Ifm<n, then P( Km») =mn+m(m — 1)/2. 
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Cutwidth 
References for cutwidth include [GuSu82, Le82, Ch85, MaPaSu85, Ya85, Ch8g]. 
DEFINITION 


D42: Let f be a proper numbering of a graph G. The cutwidth of f is cs(G) = 
max; |{vw € E(G): f(v) <i< f(w)}|. The cutwidth of G is cutw(G) = min{c;(G) : 
f is a proper numbering of G}. 

FACTS 

F73: cutw(P,) = 

F74: cutw(C,) = 


( 

( 

F75: cutw(K,) = |n?/4). 

F76: cutw(Kin) = [n/2). 
( 


F77: cutw(Th,a) = [(d— 1)(kt — 1)/2] + 1ifd> 3. 


Topological Bandwidth 
Some references for topological bandwidth are [Ch80-a, MaPaSu85, Ch88}. 
DEFINITION 


D43: The topological bandwidth of graph G, denoted B*(G), is given by B*(G) = 
min{B(G") : G’ is a refinement of G}. 


FACTS 

F78: For any graph G, B*(G) < cutw(G). 

F79: For any tree T, B*(T) < cutw(T) < B*(T) + log, B*(T) + 2. 
F80: If Gis P,, Ch, or Kin, then B*(G) = cutw(G). 


F81: B*(K,) =n-1 < cutw(K,). 


Additive Bandwidth 


The proper numbering of Q3 given in Section 9.4.1 corresponds to an adjacency ma- 
trix with ones on all but one diagonal above the main diagonal, including the diagonal 
farthest away. Since B(Q3) = 4, this is far from a bandwidth numbering. However, 
consider the main contradiagonal (running from the lower left corner of the adjacency 
matrix to the upper right). All ones are on the main contradiagonal and the two contra- 
diagonals above and below it. This concept, recognized and elucidated in [BaRuS192], 
is the basis for a second type of bandwidth called additive bandwidth. Since the ones of 
the adjacency matrix are all on contradiagonals within two of the main contradiagonal, 
the additive bandwidth for Q3 is at most two, and it is easy to see that equality holds. 
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DEFINITION 


D44: Let f be a proper numbering of a graph G. The additive bandwidth of f is 
B}(G) = mar{|f(u) + f(v) — (n+ 1)| : uv € E}. The additive bandwidth of G, 
denoted Bt(G), is given by Bt(G) = min{ BF (G) : f is a proper numbering of G}. 


REMARKS 


R8: The expression |f(u) + f(v) — (n+ 1)| indicates, in the adjacency matrix, the 
number of contradiagonals (perhaps zero) from the main contradiagonal which contains 


the one corresponding to edge uv, and the summation involved motivated the name 
“additive bandwidth.” 


R9: Many of the investigations which have been made into bandwidth have been 
repeated for this new concept. However, while it is believed that the corresponding 
decision problem is NP-complete, this had not been proven nor have any algorithms yet 
been developed. 


R10: Since B*+(Q3) = 2 < B(Q3) =4, we see that additive bandwidth can be smaller 
than bandwidth. The graphs P,,, x P,, represent an infinite family for which this is true 
by the same factor of two. In fact, this factor of two is best possible. On the other 
hand, additive bandwidth can be arbitrarily larger than bandwidth (by Fact F34 and 
Fact F82 below). 


R11: In addition to the three results given below, the values of Bt(G) for other 
families of graphs have been determined, as have several relationships between it and 
other invariants. The complete k-ary tree has proven difficult, and only partial results 
are available for it. We have seen that adding an edge to a graph can double its original 
bandwidth. In fact, the addition of an edge can triple the original additive bandwidth 
[BrCaViWiYe03]. Additional references for additive bandwidth are [Ha93, HaCaCa94, 
VoBr94, BrCaRoVi96, Ca96, DuBr97]. 


FACTS 
F82: [BaRuSl92] Bt(P,, x P,) = [min{m,n}/2]. 
F83: [BaRuS192] If B+(G) > 1, then B(G) < 2B+(G). 


F84: [BaRuBrCaSlVi95] B(tK3) =2 and Bt(tK3) =t. 
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INTRODUCTION 


In pursuit-evasion problems, a team of mobile pursuers (or searchers) attempts to 
capture one or more mobile evaders (fugitives, intruders) within a graph. For example, 
the pursuers may represent soldiers, policemen, or robots. The evaders might be ter- 
rorists, criminals, lost children, or even a poisonous gas. The graph may represent a 
road map, building floor plan, cave system, pipe network, etc. Many distinct variations 
of pursuit-evasion problems can be formulated by specifying the rules of movement for 
the pursuers and for the evaders, the knowledge each opponent has about the other, 
the rules of capture, the kind of graph, and the objective function. Typical objectives 
include minimizing the number of pursuers, the distance travelled by pursuers, or the 
elapsed time until capture. Because finiteness of the latter two objectives is equiva- 
lent to optimization of the first objective, the complexity of the latter two problems is 
bounded below by that of the first. However, optimization of the first objective is usu- 
ally W’P-hard for general graphs; hence the bulk of the graph-theoretic pursuit-evasion 
literature focuses on that objective. 


The following subsections discuss several of the most-studied pursuit-evasion varia- 
tions. Other surveys on pursuit-evasion include [Bi91], [FoPe96], [A104], [Ha07], [FoTh08], 
[ChHolIs11], and [BoYal1]. Now there is also an entire book on this topic [BoNol11]. 
Throughout this section, except where explicitly specified otherwise, G will denote a 
connected undirected graph or multigraph, possibly with loops. 
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9.5.1 Sweeping and Edge Search 
FACT 


F1: Parsons [Pa78] describes the original pursuit-evasion problem. The following 
definitions are adapted from [Pa78}. 


DEFINITIONS 


D1: Consider an embedding of G in 3D space such that each vertex resides at a distinct 
location and no two edges intersect except at a common endpoint. (For every G such 
an embedding exists.) Let k denote the number of pursuers, and let P = (P,,..., Py) 
where each P; : [0,00) — G is a continuous function. Then P is a sweep strategy for G 
if for every continuous function E : [0,00) > G, there exists some pursuer j € {1,...,k} 
and time t such that P;(t) = E(t). Here P;(t) denotes the location within graph G of 
pursuer j at time t, E(t) denotes the location of an evader at time t, and capture occurs 
when P;(t) = E(t). 


D2: The sweep number of G, denoted sw(G), is the smallest k such that a sweep 
strategy P = (P,,..., Px) exists. The sweep problem on G is to determine sw(G), 
and G is k-sweepable if sw(G) < k. 


FACTS 


F2: Petrov [Pe82] independently develops another pursuit-evasion model, based on a 
system of differential equations. See [Pe82] for details. 


F3: Golovach [Go89] describes yet another formulation for pursuit-evasion. The 
following definitions are adapted from [Go89]. 


DEFINITIONS 


D3: An edge search operation is one of the following: p(a) = place a pursuer at 
vertex x; r(a) = remove a pursuer from vertex x; and s(e,2z,y) = slide a pursuer along 
edge e from endpoint x to other endpoint y. (We may write s(e,x,y) as s(x, y) if only 
one edge (2, y) exists, or as s(e) if the sliding direction is forced or inconsequential.) 


D4: Initially every edge of G is contaminated (might contain an evader). An edge e = 
(x, y) becomes clear if a pursuer slides along e from x to y while either (i) another pursuer 
resides at x or (ii) every other edge incident to x is clear. If ever any unoccupied vertex 
x is incident to a contaminated edge, then any clear edges incident to x immediately 
become recontaminated. (So if a pursuer slides from «x to y while neither (i) nor (ii) 
holds, then edge (x, y) does not become clear.) An edge search strategy for G is any 
sequence of edge search operations that ends with every edge of G being simultaneously 
clear. 


D5: The edge search number of G, denoted es(G), is the smallest number of 
pursuers needed to implement any edge search strategy. The edge search problem on 
G is to determine es(G), and G is k-edge-searchable if es(G) < k. 
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FACT 


F4:  Golovach [Go89] shows that the formulations of Parsons, Petrov, and Golovach 
are all equivalent problems. Therefore sw(G) = es(G) for every G. 


EXAMPLES 


El: The graph in Figure 9.5.1(a) has edge search number 2. Here is a strategy that 
clears this graph using 2 pursuers: p(1), p(1), s(a), s(b), s(c), s(d,1,2), s(e), s(f), s(3,2), 
s(g), s(h,2,4), s(i), sj), s(5,4), s(k), s(1,4,6), s(m), s(n). 


E2: The graph in Figure 9.5.1(b) has edge search number 3. Here is a strategy 
that clears this graph using 3 pursuers: p(1), p(2), p(2), s(a,2,1), s(b,1,2), s(c,2,1), 
s(d), s(e,1,3), s(3,2), s(g), s(h,2,4), (4,3), s(j), s(k,3,5), s(5,4), s(m), s(n,4,6), s(0,6,5), 
s(p,5,6), s(q). 


Figure 9.5.1: Typical graphs with edge search numbers 2 and 3. 


FACTS 


F5: The edge search number es(G) = 1 if and only if G is a simple path. So es(G) = 1 
if and only if G contains neither a cycle nor a vertex of degree 3 or more. Equivalently, 
es(G) = 1 if and only if G contains neither of these two minimal forbidden minors: a 
loop with one vertex and one edge, or a star with three edges. 


F6:  Megiddo et al. [MeHaGaJoPa88] show that es(G) < 2 if and only if G con- 
tains none of the minimal forbidden minors illustrated in Figure 9.5.2. This paper also 
provides a structural characterization for the 2-edge-searchable graphs, similar to the 
graph in Figure 9.5.1(a). 


F7: Megiddo et al. [MeHaGaJoPa88] show that if G is biconnected, then es(G) < 3 
if and only if G contains none of the minimal forbidden minors illustrated in Fig- 
ure 9.5.3. This paper also provides a structural characterization for the biconnected 
3-edge-searchable graphs, similar to the graph in Figure 9.5.1(b), and also a more gen- 
eral characterization for the non-biconnected 3-edge-searchable graphs. 
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Figure 9.5.2: Forbidden minors for graphs with edge search number < 2. 
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Figure 9.5.3: Forbidden minors for biconnected graphs with edge search number < 3. 


EXAMPLES 


E3: The graph in Figure 9.5.2(a) has edge search number 3: p(1), s(1,4), p(3), s(3,4), 
r(4), s(4,5), p(2), s(2,6), p(7), s(7,6), s(6,5), s(5,8), s(5,8), s(8,9), s(8,10). However, if 
edge (8,10) is removed, the resulting graph has edge search number 2: p(1), s(1,4), p(3), 
(3,4), s(4,5), (4,5), s(5,8), s(8,9), r(9), s(5,6), p(2), s(2,6), s(6,7). 


E4: The graph in Figure 9.5.2(b) has edge search number 3: p(1), s(1,2), p(2), s(2,3), 
(2,4), p(5), s(5,3), s(3,4), s(4,6). However, if edge (4,6) is removed, the resulting graph 
has edge search number 2: p(1), s(1,2), p(2), s(2,3), s(2,4), s(4,3), s(3,5). 


E5: The graph in Figure 9.5.2(c) has edge search number 3: p(1), p(1), p(1), s(a), 
s(b,1,2), s(c). However, if edge c is removed, the resulting graph has edge search number 
2: p(1), p(1), s(a), s(b). 


E6: The graph Ky in Figure 9.5.3(a) has edge search number 4: p(1), p(1), p(1), 
s(1,2), s(1,3), s(1,4), p(3), s(3,2), s(2,4), s(4,3). However, if edge (3,4) is removed, the 
resulting graph has edge search number 3: p(1), p(1), p(1), s(1,2), s(1,3), s(1,4), s(3,2), 
s(2,4). 


E7: The graph in Figure 9.5.3(b) has edge search number 4: p(1), p(1), p(1), s(1,3), 
s(1,5), s(1,7), p(2), s(2,3), r(3), s(3,8), p(4), s(4,5), s(5,8), (8,7), s(7,6). However, if 
edge (6,7) is removed, the resulting graph has edge search number 3: p(2), s(2,3), p(3), 
(3,1), s(3,8), p(1), s(1,7), s(7,8), s(1,5), (8,5), s(5,4). 
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E8: The graph in Figure 9.5.3(c) has edge search number 4: p(1), p(1), p(1), p(1), 
s(a), s(b,1,2), s(c,1,2), s(g), s(h,2,3), s(i,2,3), s(d,3,1), s(e), s(f). However, if edge i is 
removed, the resulting graph has edge search number 3: p(1), p(1), s(a), p(2), s(b,2,1), 
s(c,1,2), s(g), s(h,2,3), s(d,3,1), s(e,1,3), s(f). 


FACTS 


F8:  Megiddo et al. [MeHaGaJoPa88] prove that the decision version of the edge 
search problem is W’P-complete for arbitrary graphs G. 


F9: For every n > 4, es(K,,) = n, where K,, denotes a complete graph (or clique) 
with n vertices. 


F10: Megiddo et al. [MeHaGaJoPa88] present a linear-time algorithm for computing 
es(G) when G is a tree. 


F11: There exist polynomial-time algorithms for computing es(G) when G is a split 
graph, an interval graph, or a cograph. 


REMARK 


R1: It currently remains unresolved whether or not polynomial-time algorithms exist 
for computing es(G) when G is a permutation graph, an outerplanar graph, a series- 
parallel graph, or a planar graph. 


FACTS 


F12: Parsons [Pa78] shows that if G is a tree and k > 2, then es(G) > & if and only 
if G has a vertex v with degree d > 3 such that splitting v into d vertices each having 
degree 1 yields a forest in which at least three trees have edge search number at least 
k-1. 


F13: Let T, denote a smallest tree such that es(T;,) = k. Then T has a single edge, 
T> is a star with three edges, and T3 is the tree shown in Figure 9.5.2(a). In general for 
k > 2, Ty may be formed from three copies of 7,1 by choosing one leaf from each copy 
of T;,_; and fusing together these three vertices. 


F14: Let mz denote the number of edges in Ty. Then m, = 1, mz = 3, and m3 = 9. 
In general for k > 2, it follows that my, = 3mz_1, SO ME = 3k-1. Hence if T is any tree 
with m edges, then es(T’) < 1+ logs m. 


F15: LaPaugh [La93] shows that recontamination is not useful for edge search. That 
is, es(G) pursuers can always clear G using an edge search strategy in which no clear 
edge ever becomes recontaminated. 


REMARK 


R2: LaPaugh’s result that recontamination is not useful applies only to edge search but 
not to sweeping. This is because edge search permits arbitrary removal and placement of 
a pursuer, which essentially allows pursuers to jump between any vertices of the graph. 
However, sweeping requires each pursuer to move continuously through the graph, and 
therefore may require a pursuer to traverse (and unintentionally clear) a contaminated 
edge. 
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EXAMPLE 


E9: The graph in Figure 9.5.4 illustrates that recontamination is sometimes useful for 
sweeping. First note that this graph has edge search number 3 as follows: p(1), p(1), 
p(1), s(a), s(b,1,2), (c,1,2), r(2), r(2), p(3), p(3), s(F), s(g), (3,4), 8(4,2), s(j), ste), 
(2,5), s(h), s(5,7), r(7), p(6), s(i), r(6), r(6), p(7), p(7), s(1), s(m,7,8), s(n). However, 
this solution requires jumping to avoid recontamination. That is, rather than removing 
two pursuers from vertex 2 and placing them on vertex 3, instead let these two pursuers 
slide along edges d and g to reach vertex 3. Then edge d is temporarily cleared, but 
later it becomes recontaminated when both pursuers depart from vertex 4. Because 
sweeping only permits moving along edges (no jumping), the graph in Figure 9.5.4 has 
sweep number 3, but every sweep strategy with 3 pursuers requires recontamination to 
occur. 


Figure 9.5.4: Recontamination can be useful for sweeping. 


DEFINITIONS 
D6: A search strategy is monotonic if recontamination does not occur. 


D7: A search strategy is internal if no pursuer is ever removed from a vertex (so 
jumping does not occur). 


D8: A search strategy is connected if the set of clear edges always induces a connected 
subgraph. 


FACT 


F16: Barriere et al. [BaFrSaTh03] provide inequalities that show the relationships 
between the numbers of pursuers needed to clear a graph when one or more of these 
constraints (m = monotonic, i = internal, c = connected) are required during edge 
search. In particular, es(G) = m(G) = i(G) < mi(G) < c(G) = ic(G) < mc(G) = 
mic(G). 
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9.5.2 Node Search and Mixed Search 


FACT 


F17:  Kirousis and Papadimitriou [KiPa85], [KiPa86] introduce a variation of pursuit- 
evasion that lacks sliding and that has a novel rule for clearing edges (capturing evaders). 
The following definitions are adapted from [KiPa86]. 


DEFINITIONS 


D9: A node search operation is one of the following: p(x) = place a pursuer at 
vertex x, and r(x) = remove a pursuer from vertex «. 


D10: Initially every edge of G is contaminated. An edge e = (x,y) becomes clear if 
pursuers simultaneously occupy both endpoint vertices x and y. As previously stated 
with edge search, if ever an unoccupied vertex x is incident to a contaminated edge, 
then all clear edges incident to 2 become recontaminated. A node search strategy 
for G is any sequence of node search operations that ends with every edge of G being 
simultaneously clear. 


Dll: The node search number of G, denoted ns(G), is the smallest number of 
pursuers needed to implement any node search strategy. The node search problem 
on G is to determine ns(G), and G is k-node-searchable if ns(G) < k. 


FACTS 


F18:  Kirousis and Papadimitriou [KiPa86] show that recontamination is not useful 
for node search. That is, ns(G) pursuers can always clear G using a node search strategy 
in which no clear edge ever becomes recontaminated. 


F19: Bienstock and Seymour [BiSe91] unify edge search and node search into a more 
general framework called mixed search. The following definitions are adapted from 
[BiSe91]. 


DEFINITIONS 


D12: Mixed search operations are the same as edge search operations: p(x) = 
place a pursuer at vertex x; r(a) = remove a pursuer from vertex x; and s(e,z,y) = 
slide a pursuer along edge e from endpoint x to other endpoint y. 


D13: Initially every edge of G is contaminated. As with edge search, edge e = (x, y) 
becomes clear if a pursuer slides along e from x to y while either (i) another pursuer 
resides at x or (ii) every other edge incident to x is clear. Also, as with node search, 
edge e = (x,y) becomes clear if pursuers simultaneously occupy both endpoint vertices 
x and y. Recontamination may occur the same as with edge search and node search. 
A mized search strategy for G is any sequence of mixed search operations that ends 
with every edge of G being simultaneously clear. 


D14: The mixed search number of G, denoted ms(G), is the smallest number of 
pursuers needed to implement any mixed search strategy. The mixed search problem 
on G is to determine ms(G), and G is k-mixed-searchable if ms(G) < k. 
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FACTS 


F20: _Bienstock and Seymour [BiSe91] show that recontamination is not useful for 
mixed search. Thus ms(G) pursuers can always clear G using a mixed search strategy 
in which no clear edge ever becomes recontaminated. 


F21: For any G, construct G® and G” by replacing each edge of G with two edges in 
series or with two edges in parallel, respectively. Bienstock and Seymour [BiSe91] show 
that es(G) = ms(G*°) and ns(G) = ms(G”), so edge search and node search both reduce 
to mixed search. Therefore the recontamination result for mixed graphs in [BiSe91] 
implies the previous recontamination results for edge search in [La93] and for node 
search in [KiPa86]. 


F22: [KiPa86] and [BiSe91] provide inequalities that show the relationships between 
the edge search, node search, and mixed search numbers. Combining those inequali- 


ties yields that max{es(G), ns(G)} — 1 < ms(G) < min{es(G), ns(G)}, so these three 
parameter values are always within one of each other. 


EXAMPLES 
E10: If Gis a path with at least one edge then es(G)=1, ns(G)=2, and ms(G)=1. 


Ell: If G is a loop with one vertex and one edge then es(G)=2, ns(G)=1, and 
ms(G')=1. Here is a (trivial) node search strategy that requires only 1 pursuer: p(1). 


E12: If G is a cycle with two vertices and two edges then es(G)=ns(G)=ms(G)=2. 


E13: If Gis a cycle with at least three edges then es(G)=2, ns(G)=3, and ms(G)=2. 


E14: IfG isa star with at least three edges then es(G)=ns(G)=ms(G)=2. 


E15: If Gis the graph in Figure 9.5.2(c) then es(G)=3, ns(G)=2, and ms(G)=2. Here 
is a node search strategy that requires only 2 pursuers: p(1), p(2). 


1 2 1 2 

| | 

3 4 3r-7 4775 
| 

5 6 6-7 


(a) (b) (c) 


Figure 9.5.5: Examples for edge search, node search, and mixed search. 


E16: If Gis K,4 then es(G)=ns(G)=ms(G)=4. 
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E17: If G is the graph in Figure 9.5.5(a) then es(G)=5, ns(G)=4, and ms(G)=4. 
Here is a node search strategy that requires only 4 pursuers: p(1), p(3), p(5), p(2), r(2), 
p(4), r(4), p(6). 

E18: If G is the graph in Figure 9.5.5(b) then es(G)=2, ns(G)=3, and ms(G)=2. 
E19: If Gis the graph in Figure 9.5.5(c) then es(G)=3, ns(G)=3, and ms(G)=2. Here 
is a mixed search strategy that requires only 2 pursuers: p(4), p(3), (3,1), r(1), p(5), 


s(5,2), r(2), p(6), s(6,7). 

FACTS 

F23: There exist polynomial-time algorithms for computing ns(G) and ms(G) when 
G is a split graph, an interval graph, a cograph, or a permutation graph. 


F24: There exist polynomial-time algorithms for computing ns(G) when G is a tree, 
an outerplanar graph, or a series-parallel graph. 


REMARKS 


R3: It currently remains unresolved whether or not polynomial-time algorithms exist 
for computing ms(G) when G is a tree, an outerplanar graph, or a series-parallel graph. 


RA4: It currently remains unresolved whether or not polynomial-time algorithms exist 
for computing ns(G) and ms(G) when G is a planar graph. 


FACTS 


F25: Kirousis and Papadimitriou [KiPa86] prove that the decision version of the node 
search problem is W’P-complete for arbitrary graphs G. 


F26: Bienstock and Seymour [BiSe91] prove that the decision version of the mixed 
search problem is NP-complete for arbitrary graphs G. 


F27:  Kirousis and Papadimitriou [KiPa86] show that for every G, ns(G) is exactly 
one plus the vertex separation of G. Subsequently, Kinnersley [Ki92] showed that the 
vertex separation of G always equals the pathwidth of G; hence ns(G) is exactly one plus 
the pathwidth of G. 


TERMINOLOGY NOTE: See Section 2.4 of this Handbook for a definition of pathwidth. 


FACTS 


F28: Suppose the evader is visible, that is, the evader’s position is always known to 
the pursuers. In this situation Seymour and Thomas [SeTh93] show that the fewest 
pursuers needed to implement a node search strategy is exactly one plus the treewidth 
of G. 


F29: Suppose instead that the (invisible) evader is lazy, that is, the evader can only 
move immediately before a pursuer is placed on the vertex where it resides (so that if the 
evader did not move it would be captured). In this situation Dendris et al. [DeKiTh97]| 
show that the fewest pursuers needed to implement a node search strategy is again 
exactly one plus the treewidth of G. 


TERMINOLOGY NOTE: See Section 2.4 of this Handbook for a definition of treewidth. 
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9.5.3. Cops-and-Robbers 


The cops-and-robbers problem differs from the previously considered pursuit-evasion 
problems in several significant ways: both the cops (pursuers) and the robber (evader) 
must reside only at vertices, the cops and robber take alternating turns; everybody’s 
location is visible to everyone else. 


FACT 


F30: Nowakowski and Winkler [NoWi83] and Quilliot [Qu83] each independently 
originate the cops-and-robbers problem. However, each only considers the special case 
when there is only one cop. 


F31: Aigner and Fromme [AiFr84] extend the cops-and-robbers problem to permit 
multiple cops. The following definitions are adapted from [AiFr84]. 


DEFINITIONS 


D15: <A cops-and-robbers game on G proceeds as follows. There are two players, 
C (a team of k cops) and R (a robber). C begins by placing each of the k cops at any 
vertex of G. (C is permitted to place more than one cop at the same location.) Next, 
R places the robber at any vertex. The players continue alternating turns. On C’s 
turns, each cop either remains at its present location or moves to an adjacent vertex 
(so multiple cops may move simultaneously). Similarly, on R’s turns, the robber either 
remains at its present location or moves to an adjacent vertex. Both C and R always 
know the locations of all participants. A cop captures the robber if the cop resides at 
the same vertex as the robber, and in this case player C wins the game. Player C has 
a winning strategy if no matter what choices R makes, player C can eventually win the 
game. Otherwise, if the robber can indefinitely avoid capture no matter what choices 
C makes, then player R wins the game. 


D16: The cop number of G, denoted c(G), is the smallest number of cops k needed 
for player C to win the cops-and-robbers game on G. The cops-and-robbers problem 
on G is to determine c(G), and G is k-cop-winnable if c(G) < k. 


EXAMPLES 


E20: IfG isa tree, then c(G) = 1. Player C’s winning strategy is for the cop to move 
toward the robber along the shortest path that connects them. 


E21: If G is a complete graph (or clique), then c(G) = 1. 
E22: IfG is a cycle with at least four edges, then c(G) = 2. 


E23: If G is a complete bipartite graph K,,, with p > 2 and q > 2, then c(G) = 2. 
Player C initially places one cop on each side of the bipartition. 


E24: If G is the graph shown in Figure 9.5.6(a) (the 3-cube), then c(G) = 2. If 
C places cops at vertices {1, 8} then a robber placed at any vertex can be captured 
immediately. But if only one cop is available, then no matter where it is placed, the 
robber can always escape to a vertex that is not adjacent to the cop’s location. 
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E25: If G is the graph shown in Figure 9.5.6(b) (Petersen’s graph), then c(G) = 3. If 
C places cops at vertices {2, 5, 6} then a robber placed at any vertex can be captured 
immediately. But if only two cops are available, then no matter where they are placed, 
the robber can always escape to a vertex that is not adjacent to either cop’s location. 


E26: IfGisap-by-q grid graph with p > 2 and q > 2, then c(G) = 2. (Figure 9.5.6(c) 
illustrates a 4-by-4 grid graph.) Cop 1 moves toward the row of the robber, but if 
already on the same row, then cop 1 moves toward the column of the robber. Cop 2 
moves toward the column of the robber, but if already on the same column, then cop 2 
moves toward the row of the robber. 


1 2 1,1}-—41,2;1,341,4 
le lls oa 

3 4 2,1}-42,2}2,3-42,4 
le pile, eile, el 

5 6 3,1-43,2-43,3-43,4 
le tl 

7 8 4,1}—4,2; 44,3-—44,4 


(a) (b) (c) 


Figure 9.5.6: Examples for the cops-and-robbers problem. 


DEFINITIONS 


D17: Let N[v] denote the closed neighborhood of vertex v. Then v is a corner vertex 
if there exists some other vertex u with N[v] C N{u]. 


D18: G is dismantlable if there exists a sequence of removing corner vertices 
that ends when only one vertex remains. Such a sequence is called an elamination 
ordering. 


FACTS 
F32: Nowakowski and Winkler [NoWi83] and Quilliot [Qu83] each developed a char- 
acterization of the graphs G with c(G) = 1, as follows: c(G) = 1 if and only if G is 


dismantlable. 


F33: If G is a chordal graph, then c(G) = 1. (A chordal graph always has a simplicial 
vertex uv such that N[v] is a clique, and any such v is a corner vertex.) 


F34: Clarke [C102] shows that if G is an outerplanar graph then c(G) < 2. 
F35: Theis [Th11] shows that if G is a series-parallel graph then c(G) < 2. 


F36: Aigner and Fromme [AiFr84] show that if G is a planar graph then c(G) < 3. 
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F37: Schroeder [ScO1] shows that if G is a toroidal graph (can be embedded in a 
torus) then c(G) < 4. 


F38: Joret et al. [JoKaTh10] show that if G is a treewidth-k graph then c(G) 
=e) t. 


F39: Frankl [Fr87] shows that if G is a d-cube then c(G) = [4+]. 


F40: Aigner and Fromme [AiFr84] show that if G is a graph with girth (length of 
smallest cycle) > 5, then c(G) > the minimum degree of any vertex in G. 


F41: Fomin et al. [FoGoKr08] prove that the cops-and-robbers problem is \VP-hard 
for arbitrary graphs G. 


F42: Goldstein and Reingold [GoRe95] show that if the initial location of each cop 
is specified as part of the problem instance, then this variation of the cops-and-robbers 
problem is EX PTIM E-complete. 


F43: Chung et al. [ChHolIs11] give a pseudo-polynomial-time dynamic programming 
algorithm for solving the cops-and-robbers problem when the number of cops is fixed. 
The algorithm’s running time is O(n?**+?), where n is the number of vertices in G and 
k is the number of cops. 


F44: Llewellyn et al. [Ll1ToTr89] show that it is W“P-complete to determine whether k 
cops can capture an infinitely fast robber if the cops are placed sequentially but cannot 
move once placed. 


REMARKS 


R5: The complexity status of the decision version of the (standard) cops-and-robbers 
problem currently remains unresolved. Is it WP-complete? Is it EX PT IM E-complete? 


R6: Another currently unresolved question is known as Meyniel’s conjecture: Is 
c(G) in O(./n) for connected graphs G? Chiniforooshan [Ch08] shows that c(G) is 
in O(n/logn), which is currently the best known bound. (Here again n denotes the 
number of vertices of G.) 


9.5.4 Additional Variations 


The previous subsections have discussed some of the best-known pursuit-evasion 
problems such as sweeping, edge search, node search, mixed search, and cops-and- 
robbers. The current subsection discusses some of the many possible additional varia- 
tions that can be constructed by increasing or restricting the capabilities of the pursuers 
and/or the evader, and/or by modifying the kind of graph structure through which the 
pursuers and evaders move. 


FACT 


F45: Nowakowski [No93], Dyer [Dy04], Barat [Ba06], Alspach et al [AIDyHaYa07], and 
Yang and Cao [YaCa07-a] [YaCa07-b] examine the sweeping, edge search, node search, 
and mixed search problems when G is a directed graph or multidigraph. Different 
variants occur depending on which participants must obey the specified edge directions. 
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DEFINITIONS 


D19: In directed sweeping, both the pursuers and the evader must obey the specified 
edge directions. 


D20: = In undirected sweeping, both the pursuers and the evader may ignore the 
edge directions (so they can traverse each edge in either direction). 


D21: In weak sweeping, the pursuers must obey the edge directions, but the evader 
may ignore these directions. 


D22: In strong sweeping, the evader must obey the edge directions, but the pursuers 
may ignore these directions. 


FACT 


F46: Let d(G), u(G), w(G), and s(G) denote the minimum number of pursuers needed 
to capture an evader using directed, undirected, weak, or strong sweeping, respectively. 
Dyer [Dy04] shows these inequalities: s(G) < min{d(G), u(G)} and max{d(G), u(G)} 


< w(G). 
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Figure 9.5.7: Directed graph examples. 


EXAMPLES 

E27: If G is a directed path then d(G)=1, u(G)=1, w(G)=1, and s(G)=1. 

E28: If G is the graph in Figure 9.5.7(a) then d(G)=2, u(G)=1, w(G)=2, and s(G)=1. 
E29: If G is the graph in Figure 9.5.7(b) then d(G)=2, u(G)=2, w(G)=2, and s(G)=1. 


E30: If G is the graph in Figure 9.5.7(c) then d(G)=2, u(G)=2, w(G)=2, and s(G)=1. 
For strong sweeping, one pursuer can clear G' by starting at vertex 1 and then traversing 
each edge in a backward direction. 
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E31: If G is the graph in Figure 9.5.7(d) then d(G)=2, u(G)=2, w(G)=2, and s(G)=2. 


E32: If G is the graph in Figure 9.5.7(e) then d(G)=2, u(G)=3, w(G)=3, and s(G)=1. 
For strong sweeping, one pursuer can clear G' by starting at vertex 1 and then traversing 
edges a, b, c each in a backward direction. 


E33: If G is the graph in Figure 9.5.7(f) then d(G)=2, u(G)=3, w(G)=3, and s(G)=2. 


E34: If G is the graph in Figure 9.5.7(g) then d(G)=3, u(G)=3, w(G)=4, and s(G)=1. 


FACTS 


F47: Dyer [Dy04] shows that s(G) = 1 if and only if each strongly connected compo- 
nent of G is either a single vertex or a cycle or a subdivision (homeomorphism) of one 
of the graphs shown in Figure 9.5.7(c) or (e). 


F48: Gottlob et al. [GoLeSc03] examine pursuit-evasion when G in a hypergraph. 
This variation is known as robber-and-marshals. The robber (evader) resides in a vertex, 
and each marshal (pursuer) resides in a hyperedge. Capture occurs when any marshal 
occupies a hyperedge that is incident to the robber’s vertex. 


F49:  Barriere et al. [BaFlFrSa02], Kolling and Carpin [KoCa08] [KoCal0], Daniel et 
al. [DaBoKoTo10], and Borie et al. [BoToKol11] consider pursuit-evasion on graphs in 
which each vertex and each edge has a specified width. These vertex and edge widths 
represent the number of pursuers needed to guard or clear each vertex or edge. The 
latter two papers also consider graphs where each edge may have a specified length. 


F50: Kolling and Carpin [KoCa08] [KoCa10] define a variation of node search known 
as the Graph-Clear problem, and also present a polynomial-time algorithm for Graph- 
Clear on trees. 


F51:  Fomin and Golovach [FoGo00] [FoHeTe05], Daniel et al. [DaBoKoTo10], and 
Borie et al. [BoToKol1] consider variations of pursuit-evasion with different objectives 
such as minimizing the elapsed time until capture, or the total distance travelled by all 
the pursuers, or the sum of the times that each pursuer is present in the graph. 


F52: Daniel et al. [DaBoKoTo10] present a pseudo-polynomial-time heuristic algo- 
rithm called ESP for edge search on series-parallel graphs. 


F53: Borie et al. [BoToKol11] develop polynomial-time and pseudo-polynomial-time 
algorithms, and also ’“P-completeness and strong ’P-completeness results, for several 
variations of sweeping on an assortment of graph classes. 


EXAMPLES 


Each vertex and edge in the graphs of Figure 9.5.8 is labeled with both its name and 
its width. Each edge in these graphs has length 1, and each pursuer travels at speed 1. 


E35: The graph in Figure 9.5.8(a) can be cleared with 2 pursuers: Both pursuers 
start at vertex u and depart u simultaneously. In parallel, one pursuer clears edge a, 
and one pursuer clears edge b. Finally both pursuers arrive at v simultaneously. The 
total distance travelled is 2, and the elapsed time is 1. 
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Figure 9.5.8: Examples for sweeping with specified vertex and edge widths. 


E36: The graph in Figure 9.5.8(b) can be cleared with 3 pursuers: All pursuers start 
at vertex u. One pursuer guards u while the other two pursuers depart from u, clear 
edge a, and arrive at v. One pursuer guards v while the other pursuer departs from 
v, clears b, and arrives at u. Finally two pursuers depart from u, clear c, and arrive 
at v. The total distance travelled is 5, and the elapsed time is 3. [Alternatively, if 4 
pursuers are available, this graph can be cleared in elapsed time 2. Or, if 5 pursuers are 
available, the graph can be cleared in elapsed time 1.] 


E37: The graph in Figure 9.5.8(c) can be cleared with 3 pursuers: All pursuers start 
at vertex u and depart u simultaneously; two pursuers clear edge a, while the third 
pursuer clears edge b. When the first two pursuers arrive at v, one guards v while the 
other travels through the graph toward w. Two pursuers arrive at w simultaneously and 
clear w. Next these two pursuers depart w simultaneously; one clears d while the other 
travels through the graph toward v. When two pursuers reside at v, they both depart 
v and clear c. Finally all three pursuers arrive at x simultaneously. The total distance 
travelled is 10, and the elapsed time is 6. [Alternatively, if 4 pursuers are available, this 
graph can be cleared with total distance 6 and elapsed time 2, as follows: In parallel, 
two pursuers travel from u to x along edges a and c, while another pursuer travels from 
u to x along edges b and d. The fourth pursuer remains stationary at w.] 


E38: The graph in Figure 9.5.8(d) can be cleared with 10 pursuers: Initially, 2 
pursuers reside at vertex u, 3 pursuers reside at vertex v, and 5 pursuers reside at 
vertex x. In parallel, these 10 pursuers clear edges a, b, and d, respectively, and all 
pursuers arrive at vertex z simultaneously. Next, again in parallel, 4 pursuers clear edge 
c, while the other 6 pursuers clear edge e. The total distance travelled is 20, and the 
elapsed time is 2. [Alternatively, if 20 pursuers are available, this graph can be cleared 
in elapsed time 1.] 


E39: The graph in Figure 9.5.8(e) can be cleared with 11 pursuers: All pursuers 
start at vertex u, and one pursuer remains stationary to guard u. In parallel, 2 pursuers 
clear edge a, 3 pursuers clear edge b, and 5 pursuers clear edge d. These 10 pursuers 
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arrive at vertex v simultaneously. Next, again in parallel, 4 pursuers clear edge c, while 
6 pursuers clear edge e. The total distance travelled is 20, and the elapsed time is 2. 
(Alternatively, if 20 pursuers are available, this graph can be cleared in elapsed time 1.] 


E40: The graph in Figure 9.5.8(f) can be cleared with 25 pursuers: 3 pursuers start 
at vertex u, 18 pursuers start at vertex x, and 4 pursuers start in the center of edge d. 
These last 4 pursuers clear edge d; 2 pursuers travel toward u, and 2 pursuers travel 
toward x. When they arrive, the 5 pursuers at u clear u, and the 20 pursuers at x clear 
x. Next, 5 pursuers depart from u; 2 pursuers clear edge a, and 3 pursuers clear edge 
b. Simultaneously, 20 pursuers depart from x; 8 pursuers clear edge e, and 12 pursuers 
clear edge f. When they arrive, the 10 pursuers at v clear v, and the 15 pursuers at w 
clear w. Finally, in parallel, 6 pursuers depart v along edge c, and 6 pursuers depart 
w along c. Eventually these 12 pursuers will meet in the middle of edge c. The total 
distance travelled is 35, and the elapsed time is 2. 


FACTS 


F54: Sugihara and Suzuki [SuSu89], Dawes [Da92], Neufeld [Ne96], Tanaka [Ta96], 
Dumitrescu et al. [DuKoSuZy08], and Munteanu and Borie [MuBol10] study sweeping 
on p-by-q grid graphs such that the evader becomes visible to any pursuer that occupies 
the same row or column, the pursuers can communicate information such as the evader’s 
position, and the ratio of the speeds of the evader and pursuers is fixed or bounded. 


F55: Hahn and MacGillivray [HaMa06], [Ha07] describe an exponential-time algo- 
rithm for solving the cops-and-robbers problem on directed graphs when the number of 
cops is fixed. 


F56: Goldstein and Reingold [GoRe95] show that the cops-and-robbers problem is 
EX PTIM E-complete for directed graphs. 


F57: The literature considers many additional variants of pursuit-evasion, some of 
which we briefly mention here: 


e The pursuers and/or evader must begin at specified locations within the graph. 
e A pursuer can see and/or capture the evader if within distance at most e. 

e Pursuers are non-uniform (different speeds, visibility, and/or capture capabilities). 
e Evaders can capture/destroy pursuers. 


e Evaders move randomly rather than adversarially, for example, via a Markovian 
random walk. 


e The pursuers’ strategy is randomized rather than deterministic. 
p Sy 


e The pursuers’ strategy yields capture with high probability rather than guaranteed 
capture. 


e The pursuers’ strategy performs well in average-case rather than worst-case. 


e The algorithm that produces the pursuers’ strategy is approximate or heuris- 
tic/experimental rather than optimal. 
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e The size and/or structure of the graph is not known in advance. 


e The environment is a geometric space rather than a graph, for example, a circle, 
convex polygon, non-convex polygon, or any other 2D or 3D region (bounded or 
unbounded, possibly containing holes or obstacles). 
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Glossary for Chapter 9 


additive bandwidth B*(G) — of graph G: 
Bt(G) = min {B}(G) | f is proper numbering of c} 
additive bandwidth BF (G) of proper numbering f — of graph G: 
Bt (G) = maf |f(u) + f(v) — (n+ 1)| | w € B(G)} 


almost all graphs — property P holds for: if the ratio of the number of n-vertex 
graphs possessing P divided by the number of n-vertex graphs approaches one as n 
approaches infinity. 

Archimedean ¢ if limz-,.. (a,c) = co for every fixed c > 0 

__, dual if lim,_,o ¢(a,c) = 0 for every fixed c > 0 

Archimedean function: a function ¢ such that for every c > 0, limz-4. 6(x, c) = co 
and limy-400 o(c, 2) = co. 

Archimedean graph: if G is a ¢-tolerance graph for all Archimedean functions ¢. 

asteroidal triple: three vertices in a graph such that, for any two of them, there is a 
path containing those two but no neighor of the third. 

automorphism group: the set of all automorphisms of a graph under the operation 
of composition. 

automorphism: an isomorphism between a graph and itself. 

bandsize, — of proper numbering f ina graph G: the number of distinct edge differences 
produced by f; denoted bs; (G). 

bandsize, — of graph G: bs(G) = min{bsf(G) : f is proper numbering of G}. 

bandwidth, B;(G) — of proper numbering f in a graph G: 


B,(G) = max{|f(u) — f(v)| | w € E(G)} 


bandwidth, B(G) — of graph G: B(G) = min{B,(G) : f is aproper numbering of G}. 
bandwidth decision problem: the problem which has answer “YES” if B(G) < Kk 
and “NO” otherwise when presented with given graph G and positive integer K. 
bandwidth numbering — of graph G: proper numbering f such that B(G) = B,(G). 
bandwidth-k decision problem: for fixed integer k, the problem which has answer 

“YES” if B(G) < k and “NO” otherwise when presented with given graph G. 
bounded bitolerance graph: a graph that is the incomparability graph of a bounded 
bitolerance order. 
bounded bitolerance order: an ordered set P = (V,~) if each vertex v € V can 
be assigned a closed interval [, = [L,,R,] and two “tolerant points” py,dy € Iv 
satisfying L,, <p, and q, < R,, such that « < y <=> R, < py and gz < Ly. 
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cartesian product G UO H or Gx H — of graphs G and H: a graph where V(G x H) = 
V(G) x V(#) and (91, h1)(g2, he) € E(G O A) if and only if either (i) g1 = g2 and 
hyha € E(A) or (ii) hy = ho and gig € E(G). 
center — of a connected graph: the subgraph induced by the central vertices. 
__, k-: the subgraph induced by the k-central vertices. 
central vertex — in a connected graph: a vertex whose eccentricity equals the radius 
of the graph. 
__, k-: a vertex whose k-eccentricity is the k-radius. 
chordal graph: a graph containing no chordless cycle of size greater than or equal 
to 4. 
chromatic number y(G) — of graph G: the smallest number & such that there is a 
function f : V(G) > {1,2,...,k} with the property that, if wv is an edge, then 
f(u) # flv). 
closed interval, — between two vertices: a set consisting of these two vertices and all 
vertices lying in some geodesic between them. 
closed intervaly — of a set: the union of all closed intervals between every pair of 
vertices in the set. 
closed neighborhood, — of a vertex v: N(v)U {v}, where N(v) is the open neighbor- 
hood of v; denoted Nv]. 
closed neighborhood), — of a set S: U,egN[v], where N[v] is the closed neighborhood 
of v; denoted N[S]. 
competition graph: an intersection graph of the family of outsets of the vertices in 
some digraph. 
, p-: a p-intersection graph of the family of outsets of the vertices of some digraph. 
__, @-tolerance: a graph that is the ¢-tolerance intersection graph of the family of 
out-sets of the vertices of some digraph. 
complement — of graph G = (V,E): a graph G with V(G) = V and vertices u and v 
are adjacent in G if and only if they are not adjacent in G. 
complement Threshold Tolerance graph: if for each vertex v there exist two pos- 
itive numbers a, and b, such that for any pair of vertices x and y: rye E ==> 
Gy < by and ay < by. 
complementary numbering f, — of proper numbering f: f.(v) = +1-— f(v) for 
each vertex v of G. 
complete k-ary tree T),,, of depth d: a rooted tree in which all vertices at level d—1 
or less have exactly & children, and all vertices at level d are leaves. 
complete k-partite graph Ky, .n,,....n,: @ graph whose vertex-set is partitioned into 
sets A; of n; vertices, 1 <i < k, with two vertices adjacent if and only if they are in 
distinct sets. 
composition G(H) — of graphs G and H: a graph where V(G(H)) = V(G) x V(#) 
and (gi, h1)(g2,he) € E(G(A)) if and only if either (i) gig2 € E(G) or (ii) g1 = go 
and hyh2 € E(#). 
containment graph — of a collection {.5;} of distinct subsets of a set S: a graph with 
vertex set {1,...,n} and edge set {77 | either 5; C S; or S; C Sj}. 
contraction G|. — of a graph G along edge e: a merger where e = wv. 
convex hull — of a set: the smallest convex set containing a given set. 
convex set: a set of vertices in a graph whose closed interval is itself. 
cop number c(G): the smallest number of cops k needed for player C to win the 
cops-and-robbers game on G. 
cops-and-robbers game: a two-player game with players C and R, as follows. First 
player C places k cops at any vertices of G. Next player R places a robber at 
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any vertex. The players continue alternating turns. On C’s turns, each cop either 
remains at its present location or moves to an adjacent vertex. On R’s turns, the 
robber either remains at its present location or moves to an adjacent vertex. Both 
C and R always know the locations of all participants. Player C wins the game if 
any cop ever resides at the same vertex as the robber. 

corona — of two graphs G, and G2: the graph formed from one copy of G, and |V(G,)| 
copies of Gy where the ith vertex of G, is adjacent to every vertex in the ith copy 
of Go. 

cutwidth, — of proper numbering f in a graph G: 


ce(G) = max; |{vw € E(G): f(v) <i< f(w)}I. 


cutwidth, — of a graph G: cutw(G) = min{cs(G) : f is a proper numbering of G}. 
cyclic bandwidth, B.;(G) — of proper numbering f in a graph G: 


Bes (G) = max{||f(u) — f(v)||- : wv € E}, where ||z||. = min{|z|,n — |a|}. 
cyclic bandwidthz B.(G) — of graph G: 
B.(G) = min{B.,(G) : f is proper numbering of G}. 


daisy: a graph that can be constructed from k > 2 disjoint cycles by identifying a set 
of k vertices, one from each cycle, into one vertex. 

decomposable graph: a graph G whose vertex-set can be partitioned into 7(G) sub- 
sets C1,C2,...,Cy(q) such that each of the induced subgraphs G[C;] is a complete 
subgraph of G. 

degree sequence — of a graph G: a listing of the degrees of the n vertices of G, usually 
in monotonic order. 

diameter — of a graph G: the maximum distance between any two vertices of G; 
denoted diam(G). 

__, k-: the maximum k-eccentricity among the vertices in a graph. 

directed distance from wu to v: the length of a shortest directed u— v path. 

distance, — between two vertices: the length of a shortest path between these two 
vertices. 

distancez — between two subgraphs: the minimum distance between a vertex in one 
subgraph and a vertex in the other subgraph. 

dominating function — of a graph G: a function f : V > {0,1} such that for each 
veV, ue No] f(u) 2 1. 

dominating set — of a graph G: a set S C V such that every vertex in V is either in 
S or adjacent to a vertex in S. 

, clique — of a graph G: a dominating set S' of G such that G[S] is a clique. 

, connected — of a graph G: a dominating set S of G such that G[S] is connected. 

, cycle — of a graph G: a dominating set S of G such that G[S] has a hamiltonian 

cycle. 


, minimal: a dominating set that contains no dominating set as a proper subset. 


, paired — of a graph G: a dominating set S' of G such that the induced subgraph 
G[S] has a perfect matching. 


__, total — of a graph G: a set S C V such that every vertex of V is adjacent to a 
vertex of S. 
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domination critical graph: a graph G with the property that adding an arbitrary 
edge not in G results in a graph with domination number less than the domination 
number of G. 
__, k-y: a domination critical graph with domination number k. 
domination number — of a graph G: the minimum cardinality of a dominating set 
of G. 
__, clique — of a graph G: the minimum cardinality of a clique dominating set of G. 
___, connected — of a graph G: the minimum cardinality of a connected dominating 
set of G. 
__, cycle — of a graph G: the minimum cardinality of a cycle dominating set of G. 
___, independent — of a graph G: the minimum cardinality of a maximal independent 
set. (A maximal independent set must be a dominating set.) 
__, paired — of a graph G: the minimum cardinality of a paired dominating set of G. 
__, total — of a graph G: the minimum cardinality of a total dominating set of G. 
__, upper — of a graph G: the maximum cardinality of a minimal dominating set 
of G. 
domination perfect graph: a graph G having the property that y(H) = i(#) for 
every induced subgraph H of G. 
dual Archimedean ¢ if lim,_,9 ¢(x,c) = 0 for every fixed c > 0. 
eccentricity — of a vertex v: the distance from vertex v to a vertex farthest from it. 
__, k-: the maximum Steiner distance among all k-element sets of vertices containing 
the vertex. 
edge-bandwidth, ~ of edge-numbering f of a graph G: By(G) = max{|f(e1) — f(e2)| : 
edges e; and e adjacent in G}. 
edge-bandwidth, — of a graph G: B'(G) = min{B;(G) : f an edge numbering of G}. 
edge chromatic number \i(G) — of a graph G: the smallest number k such that 
there is a function f : E(G) > {1,2,...,k} with the property that, if edges e, and 
€ share a common vertex, then f(e,) 4 f(e2). 
edge clique cover — of a graph G: a family € = {C,...,C,} of complete subgraphs 
of G such that every edge of G is in at least one of E(C)),..., E(C,). (Elsewhere, 
a clique is required to be a maximal complete subgraph.) 
__,p-—ofagraph G: a family {V,,..., Vin} of not necessarily distinct subsets of V(G) 
such that, for every set {i1,...,7p} of p distinct subscripts, T = V;,1--- Vi, 
induces a complete subgraph of G, and such that the collection of sets of the form 
T is an edge clique cover of G. 
, o-T-edge - of a graph G: a family {Vi,...,Vn} such that vjv; € E(G) if and 
only if at least o(t;,t;) of the sets V, contain both v; and vj. 
edge cover — of a graph G: a set of edges covering all the vertices of G. 
edge-numbering f — of a graph G: a bijection f : E(G) > {1,2,...,|E(G)]}. 
edge search number es(G): the smallest number of pursuers needed to implement 
an edge search strategy on G. 
edge search operation: any of p(x) = place a pursuer at vertex x, r(x) = remove a 
pursuer from vertex x, or s(e,x,y) = slide a pursuer along edge e from endpoint x 
to other endpoint y. 
edge search strategy: any sequence of edge search operations that ends with every 
edge of G being simultaneously clear, as follows. Initially every edge of G is con- 
taminated. An edge e = (x,y) becomes clear if a pursuer slides along e from x to y 
while either (i) another pursuer resides at x or (ii) every other edge incident to x is 
clear. If ever an unoccupied vertex x is incident to a contaminated edge, then any 
clear edges incident to x immediately become recontaminated. 
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edgesum, s/(G) — generated by proper numbering f of a graph G: 
s(@= S\ [fu - FO). 
uvE E(G) 

edgesum, s(G) — of a graph G: s(G) = min{s;(G) : f is a proper numbering of G}. 

geodesic: a shortest path between two vertices. 

geodetic number: the minimum cardinality of a geodetic set in a graph. 

geodetic set: a set of vertices of a graph whose closed interval is the vertex-set of the 
graph. 

girth — of a graph G: the size of a smallest induced cycle of G. 

hull number — of a graph G: the minimum cardinality of a hull set in G. 

hull set: a set of vertices of a graph whose convex hull is the vertex-set of the graph. 

hypercube — n-dimensional: the graph having 2” vertices, each labeled with a distinct 
n-digit binary sequence, and two vertices adjacent if and only if their labels differ in 
exactly one position. 

independent set — of vertices: a set of vertices in which no two vertices are adjacent. 

intersection graph-— of a collection C = {S1,...,5,} of sets: a graph G obtained by 
assigning a distinct vertex v; to each set S; in C and joining two vertices by an edge 
precisely when their corresponding sets have a nonempty intersection. 

interval graph: an intersection graph of a family of intervals of the real line. 

irredundance number ~— of a graph G: the minimum cardinality of a maximal irre- 
dundant set of G. 

__, upper — of a graph G: the maximum cardinality of an irredundant set of G. 

irredundant set — of vertices: a set S of vertices such that for every vertex v € S, 
N{v] — N[S — {v}] 40. 

key: the graph obtained by joining with an edge a vertex in C;, to an end-vertex of 
P,; denoted Ly, n- 

line graph L(G) — of a graph G: a graph with vertex-set equal to the edges of G and 
two vertices are adjacent if and only if the corresponding edges are adjacent in G. 

link edge: an edge joining two link vertices. 

link vertex, — in a type (a) unit: two non-adjacent vertices in that unit. 

link vertex — in a type (b) unit: the vertex of degree 1 in that unit. 

3 -graph: a graph with m edges that satisfies the three conditions: (i) 6(G) > 2, (ii) 
G is connected, and (iii) y¥(G) > m/3. 

median vertex: a vertex whose total distance is minimum among the vertices in a 
graph. 

median subgraph: the subgraph induced by the median vertices. 

merger G,,,, — of vertices u and v of a graph G: a graph obtained from G by identifying 
u and v and then eliminating any loops and duplicate edges. 

2-minimal graph: an n-vertex graph that is edge-minimal with respect to satisfying 
the three conditions: (i) 6(G) > 2, (ii) G is connected, and (iii) y(G) > 2n/5. 

minimum hull subgraph: an induced subgraph whose vertex set is a hull set of 
minimum cardinality for some graph. 

mixed search number ms(G): the smallest number of pursuers needed to implement 
a mixed search strategy on G. 

mixed search operation: same as edge search operation. 

mixed search strategy: any sequence of mixed search operations that ends with 
every edge of G being simultaneously clear, as follows. Initially every edge of G is 
contaminated. As with edge search, edge e = (x,y) becomes clear if a pursuer slides 
along e from x to y while either (i) another pursuer resides at x or (ii) every other 
edge incident to x is clear. Also, as with node search, edge e = (a, y) becomes clear 
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if pursuers simultaneously occupy both endpoint vertices x and y. Recontamination 
may occur the same as with edge search and node search. 

node search number ns(G): the smallest number of pursuers needed to implement 
a node search strategy on G. 

node search operation: either p(2) = place a pursuer at vertex x, or r(”) = remove 
a pursuer from vertex 2. 

node search strategy: any sequence of node search operations that ends with ev- 
ery edge of G being simultaneously clear, as follows. Initially every edge of G is 
contaminated. An edge e = (a, y) becomes clear if pursuers simultaneously occupy 
both endpoint vertices x and y. If ever an unoccupied vertex x is incident to a 
contaminated edge, then all clear edges incident to x become recontaminated. 

NP-complete — problem: a problem having a “YES” or “NO” answer that can be 
solved nondeterministically in polynomial time, and all other such problems can be 
transformed into it in a polynomial time. 

open neighborhood, — of a vertex v: the set of vertices that are adjacent to v; denoted 
N(v). 

open neighborhood, — of a set S: Uses N(v), where N(v) is the open neighborhood 
of vertex v; denoted N(S). 

out-set — of a vertex v in a digraph: the set of vertices x such that (v, a) is an arc in 
the digraph. 

packing — in a graph G: a set S of vertices such that each pair of vertices in S are at 
a distance at least 3 apart in G. 

packing number - of a graph G: the maximum cardinality of a packing in G. 


perfect elimination ordering — of a graph G: an ordering (v1,...,Un) of all the 
vertices of G such that, for each i € {1,...,n}, vu; is a simplicial vertex of the 
subgraph induced on the vertex subset {v;, vi41,---,Un}- 


peripheral vertex: a vertex in a connected graph whose eccentricity equals the diam- 
eter of the graph. 

periphery: the subgraph induced by the peripheral vertices. 

polynomial algorithm: an algorithm whose execution time is bounded by a poly- 
nomial in some parameter of the problem, often the number of vertices for graph 
problems. 

power — k‘” of a graph G: the graph having the same vertex-set as G and an edge 
between two vertices if and only if the distance between them is at most k in G. 

profile; P;(G') — of proper numbering f in a graph G: Pr(G) = DO ,cy we (v)- 

profile. P(G) — of a graph G: P(G) = min{P;(G) : f is a proper numbering of G}. 

profile width wy(v) — for vertex v € V(G) and proper numbering f of a graph G: 
we(v) = maxzeny|(f(v) — f(x)) where N[v] is the closed neighborhood of v. 

proper numbering — of a graph G: a bijection f : V(G) > {1,2,...,|V(G)]}. 

proper tolerance graph: a graph having a tolerance representation in which no 
interval properly contains another interval. 

pursuit-evasion: a kind of problem in which a team of mobile pursuers attempts to 
capture one or more mobile evaders within a graph or other environment. 

radius rad(G) — of a graph G: the smallest number r such that there is a vertex u of 
G with distance at most r from every other vertex of G; equivalently, the minimum 
eccentricity among the vertices of a connected graph. 

__, k-: the minimum k-eccentricity among the vertices in a graph. 

refinement — of a graph G: a graph obtained from G' by a finite number of subdivisions. 

self-centered graph: a graph whose center is itself. 

simplicial vertex: a vertex whose neighbors induce a complete subgraph. 
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split graph: a graph whose vertex set can be partitioned into a clique and an indepen- 
dent set. 

Steiner distance — of a set of vertices in a graph: the minimum size of a connected 
subgraph containing the set. 

strong diameter — of a strong digraph: the maximum strong eccentricity among the 
vertices of the strong digraph. 

strong eccentricity — of a vertex in a strong digraph: the greatest strong distance 
from the vertex to a vertex in the strong digraph. 

strong radius — of a strong digraph: the minimum strong eccentricity among the 
vertices of the strong digraph. 

strong center — of a strong digraph: the subdigraph induced by the strong central 
vertices. 

strong central vertex — in a strong digraph: a vertex whose strong eccentricity is the 
strong radius. 

strong digraph: synonym for strongly connected digraph. 

strong (Steiner) distance — between two vertices in a strong digraph: the minimum 
size of a strong subdigraph containing these two vertices. 

strong product G(Sp)H) — of graphs G and H: a graph with V(G(Sp)H) = V(G) x 
V(A) and (91,h1)(g2,he) € E(G(Sp)H) if and only if either gjgo € E(G) and 
hyhe € E(#) Or 91 = G2 and hyho € E(H) or hy _ ho and 9192 € E(G). 

strongly connected digraph: a digraph containing both a directed u—v path and a 
directed v — u path for every pair u,v of vertices in the digraph. 

strongly self-centered digraph: a strong digraph whose strong center is itself. 

subdivision — of edge wv: a graph obtained by replacing uv with path (u,w,v) where 
w is a new vertex of degree two. 

subtree graph: an intersection graph of a familiy of subtrees of a tree. 

sum G+ H - of graphs G and H: a graph with V(G + H) = V(G) UV(A) and 
E(G+ H) = E(G)UE(A)U {uv: ue V(G) and v € V(A)}. 

sweep number sw(G): the smallest k such that a sweep strategy P = (P\,..., Px) 
exists for G. 

sweep strategy: a tuple P = (P,,...,P,) of k functions specified as follows. Embed 
G in 3D space such that each vertex resides at a distinct location and no two edges 
intersect except at a common endpoint. Let & denote the number of pursuers, and 
let P = (P,,..., Px) where each P; : [0,00) + G is a continuous function. Then P is 
a sweep strategy for G if for every continuous function E : [0,00) + G, there exists 
some pursuer j € {1,...,4} and time ¢ such that P;(t) = E(t). 

tensor product G(T'p)H) — of graphs G and H: a graph with V(G(Tp)H) = V(G) x 
V(A) and (g1,h1)(g2,h2e) € E(G(Tp)H) if and only if gigg € E(G) and hihe € 

thickness thick(G) — of a graph G: a smallest number of planar subgraphs of G whose 
union is G. 

Threshold Tolerance graph: G = (V,£) if its vertices can be assigned positive 
weights {w, | v € V} and positive tolerances {t, | vu © V} such that rye E <=> 
Wy + wy > min{tz, ty}. 

__, complement: if for each vertex vu there exists two positive numbers a, and b, 
such that for any pair of vertices x and y: ry € HB = > ag < by and ay < by. 
g-tolerance chain graph: a ¢-tolerance graph which has a representation T consisting 
of a nested family of intervals. 

tolerance graph: G = (V,£) if there exists a collection ZT = {Iy}yey of closed 

intervals on the real line and an assignment of positive numbers t = {t,}ycy such 
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that uw € E ==> |I,NIy| > min{t,, tw}. Here |L,| denotes the length of the interval 
I,,. The positive number t, is called the tolerance of v, and the pair (Z,t) is called 
an interval tolerance representation of G. 

__, bounded: a tolerance graph G = (V, £) that has a tolerance representation (Z, t) 
in which t, < |J,| for allu EV. 

__, @: a tolerance graph G = (V,£) such that for a symmetric binary function ¢, 
positive valued on positive arguments, there is an interval representation ZT = 
{Iu}vev with positive tolerances t = {ty}vey such that vw € BE ==> |, A Iy| > 
O{ty, tw}. 

___, proper: a tolerance graph having a tolerance representation in which no interval 
properly contains another interval. 

___, unit: a tolerance graph such that it has a tolerance representation in which each 
interval [,, has unit length for each vertex v. 

topological bandwidth B*(G) — of graph G: 


B*(G) = min{ B(G’) : G’ is a refinement of G}. 


total distance of a vertex: the sum of the distances from the vertex to all other 
vertices. 

transitive orientation — of a graph G: an assignment of directions to the edges such 
that the resulting binary relation is transitive. 

trapezoid graph: the intersection graph of a set of trapezoids whose upper and lower 
sides are on a given pair of horizontal lines. 

type-(a) unit: a graph that is isomorphic to a cycle Cs. 

type-(b) unit: a graph that is isomorphic to a key Ly). 

unit tolerance graph: if it has a tolerance representation in which each interval I, 
has unit length for each vertex v. 

vertex arboricity — of a graph G: the minimum number of subsets into which V(G) 
can be partitioned such that the vertices of each subset induce an acyclic subgraph. 

vertex cover number - of a graph G: the smallest cardinality of a set of vertices such 
that every edge is incident to at least one of the vertices in the set. 

vertex independence number - of a graph G: the largest cardinality of a set of 
vertices which induces a graph with no edges. 

vertex independence number - of a graph G: the maximum cardinality of a maximal 
independent set of G. 

weakly chordal graph: if neither the graph nor its complement contains a chordless 
cycle of size greater than 4. 
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INTRODUCTION 


A search of a graph is a methodical exploration of all the vertices and edges. It 
must run in “linear time,” i.e., in one pass (or a small number of passes) over the graph. 
Even with this restriction, a surprisingly large number of fundamental graph properties 
can be tested and identified. 

This section examines the two most important search methods. Breadth-first search 
gives an efficient way to compute distances. Depth-first search is useful for checking 
many basic connectivity properties, for checking planarity, and also for data flow analysis 
for compilers. A treatment of at least some aspects of both these methods can be found 
in almost any algorithms text (some recent ones are [BrBr96, CLRS01, GoTa02, HSR98, 
Se02, We99]). 

All the algorithms of this section (except for §10.1.7) run in linear time or very close 
to it. Since it takes linear time just to read the graph, the algorithms are essentially as 
efficient as possible (they are “asymptotically optimal” ). 


NOTATION: Throughout this chapter, the number of vertices and edges of a graph G = 
(V, £) are denoted n and m, respectively. Time bounds for algorithms are given using 
asymptotic notation, e.g., O(n) denotes a quantity that, for sufficiently large values of 
n, is at most cn, for some constant c that is independent of n. 


Convention: In all algorithms, we assume that the graph G is given as an adjacency 
list representation. If G is undirected, this means that each vertex has a list of all its 
neighbors. The list can be sequentially allocated or linked. If G is directed, then each 
vertex has a list of all its out-neighbors. 
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10.1.1 Breadth-First Search 


The breadth-first search method (abbr. bfs) finds shortest paths from a given 
vertex of a graph to other vertices. It generalizes to Dijkstra’s algorithm, which allows 
numerical (nonnegative) edge-lengths. Throughout this section, the given graph G can 
be directed or undirected. 


DEFINITIONS 


D1: A length function on a graph specifies the numerical length of each edge. Each 
edge is assumed to have length one, unless there is an explicitly declared length function. 


D2: The distance from vertex u to vertex v in a graph, denoted d(u, v), is the length 
of a shortest path from u to v. 


D3: The diameter of a graph is the maximum value of d(u, v) for u F v. 


D4: A shortest-path tree T from a vertex s is a tree, rooted at s, that contains 
all the vertices that are reachable from s. The path in T from s to any vertex x is a 
shortest path in G, ie., it has length d(s, x). 


EXAMPLE 
E1: Figure 10.1.1 gives a shortest path tree from vertex s. 
a 
s e s 
b g awe b Cc e 
c d d fg 


Figure 10.1.1: Undirected graph and shortest path tree. 


E2: The small-world phenomenon [Mi67, K100] occurs when relatively sparse graphs 
have low diameter. Studies have shown that the graphs of movie actors, neural connec- 
tions in the c.~elegans worm, and the electric power grid of the western United States 
all exhibit the small-world phenomenon. The world-wide web is believed to have this 
structure too. 


E3: For several decades, mathematicians have computed their Erdés number as their 
distance from the prolific mathematician Paul Erdés, in the graph where an edge joins 
two mathematicians who have coauthored a paper. 


E4: The premise of the 6 Degrees of Kevin Bacon game is that the graph whose 
vertices are movie actors and whose edges join two actors appearing in the same movie 
has diameter at most 6. 


E5: In computer and communications networks, a message is typically broadcast from 
one site s to all others by passing it down a shortest path tree from s. 


E6: To solve a puzzle like Sam Lloyd’s “15 puzzle” [St96], we can represent each 
position by a vertex. A directed edge (i, j) exists if we can legally move from i to 7. We 
seek a shortest path from the initial position to a winning position. 
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Ordered Trees 


DEFINITIONS 


D5: Ina rooted tree, a vertex x is an ancestor of a vertex y, and y is a descendant of 
x, if there is a path from x to y whose edges all go from parent to child. By convention 
x is an ancestor and descendant of itself (e.g., in the tree of Figure 10.1.1 vertex e has 
3 descendants). 


D6: Vertex x is a proper ancestor (descendant) of vertex y if it is an ancestor 
(descendant) and a F y. 


D7: An ordered tree is a rooted tree in which the children of each vertex are linearly 
ordered. In a plane drawing of such a tree, left-to-right order gives the order of the 
children. (The leftmost child is first.) 


D8: Vertex x is to the left of vertex y if some vertex has children c and d, with c to 
the left of d, c an ancestor of x and d an ancestor of y. 


D9: Ina graph G, a breadth-first tree T from a vertex s contains the vertices that 
are reachable from s. It is an ordered tree, rooted at s. If x is a vertex at depth 6 in 
the tree T’, then the children of « in T are the vertices of G that are adjacent in G to 
x, but not adjacent (in G) to any vertex in T at depth less than 6, or to any vertex at 
depth 6 in T that is at the left of x. 


FACTS 
F1: Any breadth-first tree is a shortest-path tree. 
F2: A high level bfs algorithm is given below as Algorithm 10.1.1. It constructs a 


breadth-first tree. It starts from s, finds the vertices at distance 1 from s, then the 
vertices at distance 2, etc. 


Algorithm 10.1.1: Breadth-first Search 


Input: — directed or undirected graph G = (V, E), vertex s. 
Output: breadth-first tree T from s. 
V; = {all vertices at distance i from s} 
Vo = {5} 
make s the root of T 
i=0 
while V; 4 @ do construct Vj.1 
Vie =O 
for each vertex v € V; do 
“scan” vu 
for each edge (v, w) do 
ifw ¢ U, Vj then 
make w the next child of v in T 
add w to Vj41 


a=1+1 
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F3: The high-level algorithm can be implemented to run in total time O(n +m). The 
main data structure is a queue of vertices that have been added to T,, but whose children 
in T have not been computed. 


F4: In general we verify that an algorithm takes time O(n + m) by checking that it 
spends constant time (i.e., O(1) time) on each vertex and edge of G. 


F5: Not every shortest path tree is a breadth-first tree (e.g., the tree of Figure 10.1.1). 
This does not cause any problems in applications. 


F6: The diameter can be found by doing a breadth-first search from each vertex. 


F7: Dijkstra’s algorithm computes a shortest path tree from s in a graph with a 
nonnegative length function. It generalizes breadth-first search. Like bfs it finds the set 
Vz of all vertices at distance d from s, for increasing values of d. An appropriate data 
structure implements the algorithm in time O(m + nlogn) [FrTa87, CLRSO1]. 


10.1.2 Depth-First Search 


Depth-first search (abbr. dfs) was investigated in the 19*" century as a strategy 
for exploring a maze [Lu82, Tarr95]. The fundamental properties of the depth-first 
search tree were discovered by Hopcroft and Tarjan [HoTa73a, Ta72]. Tarjan also de- 
veloped many other elegant and efficient dfs algorithms (see §10.1.6). The idea of 
depth-first search is to scan repeatedly an edge incident to the most recently discovered 
vertex that still has unscanned edges. 


DEFINITIONS 
D10: Two vertices in a tree are related vertices if one is an ancestor of the other. 


D11: In an undirected graph G = (V, E), a depth-first tree (abbr. dfs tree) from a 
vertex s is a tree subgraph T, rooted at s, that contains all the vertices of G that are 
reachable from s. 


e Edges of E(T) and E(G) — E(T) are called tree edges and nontree edges, 
respectively. 


e Each nontree edge is also called a back edge. 


The crucial property is that the two endpoints of each back edge are related. 


D12: In an undirected graph G, a depth-first spanning forest is a collection of 
depth-first trees, one for each connected component of G. Each vertex of G belongs to 
exactly one tree of the forest. 


D13: Let G = (V,E) be a directed graph where every vertex is reachable from a 
designated vertex s. A depth-first tree from s is an ordered tree in G, rooted at s 
that contains all vertices V. Each edge of T is called a tree edge. Each nontree edge 
(x,y) € E —T can be classified into one of three types: 


e A back edge has y an ancestor of x. 
e A forward edge has y a descendant of x. 
e A cross edge joins two unrelated vertices. 


The crucial property is that each cross edge (x,y) has x to the right of y. 
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D14: Let G be a directed graph, in which we no longer assume that some vertex can 
reach all others. A depth-first forest is an ordered collection of trees in G so that 
each vertex of G belongs to exactly one tree. The edges of G are classified into the 4 
types of edges in Definition 13 with one additional possibility: 


e A cross edge can join 2 vertices in different trees as long as it goes from right to 
left (i-e., from a higher numbered tree to a lower numbered tree). 


EXAMPLES 


E7: Figure 10.1.2 illustrates a depth-first search of an undirected graph. In drawings 
of depth-first spanning trees, tree edges are solid and nontree edges are dashed. There 
can be many depth-first trees with the same root. For instance, the tree edge (5,6) 
could be replaced by (5, 7). 


Figure 10.1.2: Undirected graph and depth-first spanning tree. 


E8: Figure 10.1.3 illustrates a depth-first search of a directed graph. There is 1 forward 
edge, 2 back edges and 2 cross edges. 


1 


Figure 10.1.3: Directed graph and depth-first spanning tree. 
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FACTS 


F8: Any vertex s of an undirected graph has a depth-first tree from s. Any vertex s of 
a directed graph has a depth-first tree of the subgraph induced by the vertices reachable 
from s. A high level algorithm to find such a tree is the following. 


Algorithm 10.1.2: Depth-First Search 


Input: directed or undirected graph G = (V, E), vertex s 
Output: depth-first tree T from s 
make s the root of T 
DFS(s) 
procedure DFS(s) 
vertex uv is discovered at this point 
for each edge (v, w) do 
edge (uv, w) is scanned (from v) at this point 
if w has not been discovered then 
make w the next child of uv 
DFS(w) 
vertex vu is finished at this point 


F9: ‘The procedure DFS is recursive, i.e., it calls itself. The overhead for a recursive 
call is O(1). Algorithm 10.1.2 uses linear time, O(n + m). 


F10: If scanning edge (v, w) from the vertex v results in the discovery of the vertex w, 
then (v,w) is a tree edge. 


F11: Suppose that the graph G is undirected. For the tree T produced by Algo- 
rithm 10.1.2 to be a valid depth-first tree, any edge (v,w) € E—T must have v and w 
related vertices. Why does T have this property? By symmetry suppose v gets discov- 
ered before w. Then w will either be made a child of v (like edge (3,5) in Figure 10.1.2) 
or a nonchild descendant of v (like edge (3,4) in Figure 10.1.2). 


F12: Suppose that the graph G is directed. For T to be a valid depth-first tree, any 
edge (v,w) must be one of the 4 possible types. Why does T have this property? First 
suppose v gets discovered before w. In that case w will be a descendant of v and (v, w) 
will be a tree or forward edge (as in Fact F11). Next suppose v is discovered after w. 
Then either v descends from w or v is to the right of w. In the former case (v,w) is a 
back edge and in the latter case (v,w) is a cross edge. 


F13: Algorithm 10.1.2 can be extended to a procedure that constructs a depth-first 
forest F: The procedure starts with F = ). It repeatedly chooses a vertex s ¢ F’, uses 
DF'S(s) to grow a depth-first tree T from s, and adds T to F. 


F14: Algorithm 10.1.2 uses linear time. (For directed graphs a point to note is that 
a vertex w gets added to only 1 tree of Ff. This is because once discovered, vertex w 
remains “discovered” throughout the whole procedure.) 


REMARKS 


R1: We can test whether an undirected graph is connected in linear time, by using a 
depth-first search. The trees of a depth-first search spanning forest give the connected 
components. 
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R2: We can test whether all vertices of a directed graph are reachable from a vertex 
s in linear time, by a depth-first search. 


Discovery Order 


DEFINITIONS 


D15: Discovery order is a numbering of the vertices from 1 to n in the order they 
are discovered. This is also called the preorder of the dfs tree. 


D16: In finish time order the vertices are numbered from 1 to n by increasing finish 
time. This is the postorder of the dfs tree. 


FACTS 


F15: Most algorithms based on the depth-first search tree use discovery order. These 
algorithms identify each vertex v with its discovery number, also called v. This is how 
the vertices are named in Figure 10.1.3. 


F16: In discovery order, the descendants of a vertex v are numbered consecutively, 
with v first, followed by all its proper descendants. This gives a quick way to test if a 
given vertex w descends from another given vertex uv: Let v have d descendants. w is a 
descendant of v exactly when v < w < v+d. This method can be implemented to run 
in O(1) (i-e., constant) time. 


REMARKS 


R3: The power of depth-first search comes from its simplification of the edge structure 
— the absence of cross edges in undirected graphs, and the absence of left-to-right edges 
in directed graphs. Depth-first search algorithms work by propagating information up 
or down the dfs tree(s). 


RA: Many simple properties of graphs can be analyzed without using the full power of 
depth-first search. The algorithm always works with a path in the dfs tree, rather than 
with the entire dfs tree. The algorithm propagates information along the path. 


R5: Asa simple example of Remark 4 we give a procedure that shows an undirected 
graph with minimum degree 6 has a path of length > 6: execute DF'S(s) (for any s), 
stopping at the first vertex t that becomes finished. The portion of tree T constructed 
by this procedure is a path from s to t of length > 6. The reason is that all of t’s 
neighbors must be in the path for t to be finished. 


R6: Sections 10.1.3-10.1.5 deal with simpler graph properties that can be handled 
by the path view of depth-first search. Section 10.1.6 covers deeper properties whose 
algorithms require the full power of the depth-first search tree. Section 10.1.7 deals with 
both views of depth-first search. 


10.1.3. Topological Order 


Topological order is the fundamental property of directed acyclic graphs. In con- 
junction with dynamic programming, topological order leads to efficient algorithms for 
many fundamental properties of directed acyclic graphs — even properties that are 
NP-complete in general graphs. 
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DEFINITIONS 
D17: A dag is a directed acyclic graph, i-e., it has no cycles. 


D18: A source of a dag is a vertex with indegree 0, and a sink is a vertex with 
outdegree 0. 


D19: A topological numbering (topological order, topological sort) of a directed 
graph assigns an integer to each vertex so that each edge is directed from lower number 
to higher number. 


EXAMPLES 


E9: The dag of Figure 10.1.4 has source a and sink f. Alphabetic order is a valid 
topological ordering. In general a dag has many topological numberings. In this figure 
12 are possible. 


Figure 10.1.4: Dag and topological order. 


E10: A dag can always be drawn so that all edges are directed downwards, as in 
Figure 10.1.4. Topological numbers guide the vertical placement of the vertices. This 
principle is useful in algorithms for drawing graphs (see Section 10.3). 


E11: Prerequisite graphs in a university department are dags: if course X is a prereq- 
uisite to course Y, then an arrow is drawn from X to Y. There cannot be a cycle, else no 
one could graduate! The course numbering is a topological numbering: a prerequisite 
to a course always has a lower number. 


E12: A combinational circuit is a collection of logic gates and interconnecting wires, 
with no feedback. The no-feedback property makes it a dag. 


E13: A graph of program dependencies is a dag (assuming no recursion is allowed). 
For instance, the dependency graph specified by a makefile is a dag. The make utility 
always ensures that a file’s timestamp is no later than the timestamp of any dependent 
file. Thus the timestamps form a topological numbering. 


E14: The formulas in a spreadsheet depend on one another, and this dependence 
relation is a dag. When the value of a cell is changed, the values of dependent cells are 
recalculated in topological order. 


E15: In ecology, a food web is a graph whose vertices are the species of an ecosystem. 
An arrow is drawn from one species to all the other species it preys upon. This model 
is commonly assumed to be a dag, to disallow cycles in the food chain. 
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FACTS 


F17: Every dag has one or more sources and one or more sinks. This can be seen by 
examining a path of maximal length. The first (last) vertex must be a source (sink), 
since otherwise the path could be extended at the beginning (end). 


F18: A graph with a topological numbering is a dag. To see this observe that topo- 
logical numbers increase along a path, so a path cannot return to its starting vertex. 
Thus no cycle exists. 


F19: Any dag has a topological numbering. To construct such a numbering with lowest 
number 1, assign the lowest number to a source s. Then proceed recursively on the dag 
G — s, using lowest number 2. 


F20: One can similarly construct a topological numbering by repeatedly numbering a 
sink s with the highest number, and proceeding recursively on dag G — s. 


F21: The strategy of Fact F20 can be implemented efficiently by depth-first search. 
The reason is that as we grow a depth-first path in a dag, the first vertex to become 
finished is a sink. More succinctly, we can grow a depth-first path until a sink is reached. 
This gives the following high-level algorithm. 


Algorithm 10.1.3: Topological Numbering (High Level) 


Input: dag G=(V,E) 
Output: topological numbering of G: vertex v has number J[v] 
repeat until G has no vertices: 

grow a dfs path P until a sink s is reached 

set I[s] =n, decrease n by 1 and delete s from P & G 


To make this algorithm efficient, each iteration grows the dfs-path P by starting with 
the previous P and extending it, if possible. 


F22: A lower level implementation of Algorithm 10.1.3 runs in linear time. The idea 
is to use array I[1..n] for 2 purposes: 


es 0 if v has never been in P 
"l= \ ¢ if v has been deleted and assigned topological number t 


Algorithm 10.1.4: Topological Numbering (Lower Level) 


Input: dag G=(V,E) 
Output: topological numbering of G: vertex v has number J[v] 
num =n; 
for each vertex v do I/v] = 0 
for each vertex vu do if I[v] = 0 then DF'S(v) 
procedure DF'S(v) 
for each edge (v, w) do 
if I[v] = 0 then DF'S(v) 
v is now a sink in the high level algorithm 
I[v] = num; decrease num by 1 
v is now deleted in the high level algorithm 
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F23: Algorithm 10.1.4 runs in linear time. It spends O(1) time on each vertex and 
edge. 
EXAMPLE 


E16: Figure 10.1.5 illustrates how the algorithm numbers the dag of Figure 10.1.4. 


@a @a @a a a @a 
I[a]=1 


bb $b $b b e 


I[b]=3 I[e]=2 


bc bc bu 


bi 


I[f]=6 


Figure 10.1.5: Execution of topological numbering algorithm. 


FACTS 


F24: Listing the vertices in order of decreasing finish time (Definition D16) is a valid 
topological order. 


F25: Tarjan’s algorithm for topological order [Ta74b, CLRSO1] is based on Fact F24. 
Algorithm 10.1.4 is a reinterpretation of Tarjan’s algorithm. 


To illustrate this, Figure 10.1.6 shows a dfs tree for Figure 10.1.4. Each vertex is 
labelled by its name and finish number. Subtracting each finish number from 7 gives 
the topological number of Figure 10.1.5. 


Figure 10.1.6: Topologically numbering by finish times. 
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F26: Another linear-time topological numbering algorithm [Kn73] works by repeatedly 
deleting a source. The algorithm maintains a queue of sources, as well as the in-degree 
of each vertex. If the in-degrees are not initially available this algorithm can do more 
work than Algorithm 10.1.4, since it makes two passes over the graph. 


F27: Dag algorithms often propagate information from higher topological numbers to 
lower, either after scanning each edge (v,w) or at the end of DF'S(v). Propagating 
information in the opposite direction is also possible. 


F28: As an example suppose each edge e of a dag G has a real-valued length £[e]. We 
can find the longest path in G in linear time. The idea is to set d[v] to the length of 
a longest path starting at v. These values d[v] can be computed in reverse topological 
order, using the recurrence 


d[v] = max0, ¢[v, w] + dlw](v,w) € E 
It is easy to modify DFS to calculate these values. 


The algorithm can recover the longest path from the d[ | values in a second pass. The 
second pass can be faster if the first pass stores a pointer for each vertex indicating 
its successor on its longest path. Longest paths are useful in critical path scheduling. 
Finding the longest path in a general graph is NP-complete. 


F29: Similar algorithms can be used to calculate the longest path from s to t, shortest 
paths from a vertex s, etc. 


F30: More generally Fact F28 illustrates how the technique of dynamic programming 
can be used to solve problems on dags. Dynamic programming is based on similar 
recurrences [CLRSO1]. 


EXAMPLE 


E17: Figure 10.1.7 illustrates how the algorithm finds a longest path in a dag. Edges 
are labelled with their length, and vertices are labelled with their d[] values. The longest 
path corresponds to the largest d[] value, which is 5; it is the upper path from source 
to sink. 


Figure 10.1.7: Illustrating a longest path algorithm. 
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10.1.4 Connectivity Properties 


Depth-first search is the method of choice to calculate low order connectivity informa- 
tion. This section surveys notions of 1- and 2-connectivity. It starts with 1-connectivity 
of directed graphs, and it then examines 2-connectivity of undirected graphs. These 
connectivity algorithms are originally due to Tarjan [Ta72]. This section follows the 
path-based development of [Ga00], which simplifies the algorithms to eliminate the 
depth-first spanning tree. 


Strong Components of a Directed Graph 
In this section, G = (V, £) is a directed graph. 
DEFINITIONS 


D20: For two vertices u and v, a uv-path is a path starting at u and ending at v. 


D21: A directed graph G = (V,£) is strongly connected if for every two distinct 
vertices u and v, there is a uv-path and a vu-path. 


D22: In general, a directed graph will not be strongly connected. But the vertices can 
be partitioned into blocks that are strongly connected, according to this definition: two 
vertices u & v are in the same strong component (SC) if and only if they can reach 
each other, i.e., there is a uv-path and a vu-path. This defines a partition of V since it 
is an equivalence relation. 


D23: For any directed graph G, contracting each SC to a vertex gives the strong 
component graph (SC graph) (also called the condensation of G). 


D24: A tournament is a directed graph G such that each pair of vertices is joined by 
exactly one edge. This models a round robin tournament, where edge (x, y) represents 
the fact that player x beat player y. 


FACTS 


F31: Let C be a cycle in a graph G. All vertices of C are in the same SC. Contracting 
the vertices of cycle C to a single vertex yields a graph with the same SC graph as G. 


F32: The SC graph is always a dag. This follows from Fact F31. 


F33: A topological numbering of the SC graph of a tournament gives a ranking of the 
players. To see why, note that if player x is in an SC with lower topological number 
than y, then the tournament contains the edge (x,y) (not (y,x)). Thus SC number 
1 contains the players that are unequivocally in the top tier — they all beat all other 
players. SC number 2 contains the 2nd tier players — they all beat all other players 
except those in tier 1, etc. 


F34: All the vertices on a cycle belong to the same SC. In fact the SC graph is formed 
by repeatedly contracting cycles, until no cycle remains. 


F35: A sink s is a vertex of the SC graph. In fact the SC’s are {s} and the SC’s of 
G-—s. 
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F36: Facts F34 and F35 justify the following high-level algorithm for finding the SC 
graph. It repeatedly contracts a cycle or deletes a sink. 


F37: Next we present a linear-time depth-first search algorithm for finding the strong 
components and the SC graph of a given directed graph. 


Algorithm 10.1.5: Strong Components 


Input: — directed graph G = (V, EF) 
Output: strong components of G 
repeat until G has no vertices: 
grow a dfs path P until a sink or a cycle is found 
sink s: mark {s} as an SC & delete s from P & G 
cycle C: contract the vertices of C 


Like Algorithm 10.1.3, for efficiency each iteration grows P by starting with the previous 
P and extending it, if possible. 


F38: The algorithm has a low-level implementation that finds the SC graph in linear 
time [Ga00]. Sinks are deleted similar to Algorithm 10.1.3. Cycles are contracted using 
a stack to represent P and another stack to give the boundaries of contracted vertices 
in P. 


F39: The algorithm discovers each SC as a sink of the SC graph. So the SC’s can be 
numbered in topological order by the method of Algorithm 10.1.3. 


F40: The first linear-time algorithm for strong components is due to Tarjan [Ta72]. 
It computes a value called lowpoint(v) for each vertex v. lowpoint(v) is the lowest- 
numbered vertex (in preorder) in v’s SC that is reachable from v by a path of (0 or 
more) tree edges followed by a back or cross edge (lowpoint(v) equals v if no smaller 
numbered vertex can be reached). The vertices with lowpoint(v) = v are the “roots” of 
the strong components. 


F41: A third linear-time strong component algorithm is due to Sharir [Sh81] and 
Kosaraju (unpublished; see also [CLRSO1]). It does a depth-first search, followed by 
a second depth-first search on the reverse graph. This makes good sense — the first 
search discovers which vertices can reach which others, and the second search discovers 
which vertices can be reached by which others. 


EXAMPLES 


E18: Figure 10.1.8 shows a directed graph, its three strong components, and its SC 
graph. Each strong component is strongly connected. 


An elementary misperception is that a strongly connected graph has a Hamiltonian 
cycle. The component {2, 4,5, 6} illustrates that this is not always true. 


E19: Figure 10.1.9 gives a dfs tree of Figure 10.1.8. (To better illustrate the algorithm 
a different dfs from Figure 10.1.8 is used.) Each vertex is labelled by its preorder number 
followed by its lowpoint value. 
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Figure 10.1.8: Strong components of a directed graph. 
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Figure 10.1.9: Execution of strong component algorithm. 


E20: Suppose we number the vertices of an arbitrary directed graph by topologically 
numbering the SC graph, and then listing first the vertices in SC number 1, then 
the vertices in SC number 2, etc. The adjacency matrix of the graph with new vertex 
numbers is upper block triangular. This is because no edge goes from a higher numbered 
SC to a lower numbered SC. For instance, Figure 10.1.10 gives the adjacency matrix. 
It is upper triangular except for the block corresponding to SC {b, d, e}. 
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Figure 10.1.10: Upper block triangular adjacency matrix. 


E21: Example E20 shows how the SC graph is used to speed up operations on sparse 
matrices like Gaussian elimination, matrix inversion, finding eigenvalues, etc. The given 
matrix M is interpreted as a directed graph, with m,; corresponding to edge (i, 7). 
The adjacency matrix of Example E20 is constructed, and the 1 for each edge (2,7) is 
replaced by the value m;;. The resulting block upper triangular matrix has less fill-in 
for Gaussian elimination and nice properties for other matrix operations [Ha69]. 
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E22: Figurel0.1.11 below illustrates the execution of the algorithm on the graph of 
Figure 10.1.8. 
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Figure 10.1.11: Execution of strong component algorithm. 


E23: Figure 10.1.12 below shows a tournament and its SC graph. Player a is first, 
players b,d,e are in the 2nd tier, and player c is last. 


a 


b C Se 


{a} {b,d,e} {c} 


d e 


Figure 10.1.12: Tournament and its SC graph. 


E24: A Markov chain is irreducible if the graph of its (nonzero) transition probabilities 
is strongly connected. 


REMARK 


R7: The algorithm of Fact F41 is very simple to code and is covered in many textbooks. 
It can be appreciably slower than the other two algorithms, because it makes two passes 
over the graph and has larger memory requirement. 


Bridges and Cutpoints of an Undirected Graph 
In this section G = (V, E) is a connected undirected graph. 


DEFINITIONS 


D25: A vertex v is an cutpoint (articulation point) if G—v is not connected. A 
graph is a biconnected graph if it has no cutpoint. 
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D26: A biconnected component is a maximal subgraph that has no cutpoint. 


D27: An edge e is a bridge if G—e is not connected. An edge is a bridge if and only 
if it’s not in any cycle. A graph is bridgeless if it has no bridges. 


D28: Let B be the set of all bridges of G. The bridge components (BCs) of G are 
the connected components of G — B. Equivalently a BC is the induced subgraph on a 
maximal set of vertices, any of which can reach any other without crossing a bridge. 


D29: Contracting each BC to a vertex gives a tree, the bridge tree. 
D30: An orientation of an undirected graph assigns a unique direction to each edge. 


D31: A perfect matching of an undirected graph G is a spanning subgraph in which 
every vertex has degree exactly 1. 


EXAMPLES 


E25: Figure 10.1.13 shows a graph with 3 bridges, 6 cutpoints, and 7 biconnected 
components. It illustrates that an end of a bridge is a cutpoint unless it has degree one. 
However, a cutpoint need not be the end of a bridge. 


Figure 10.1.13: Undirected graph with bridges and cutpoints. 


E26: If a communications network (e.g., Internet) has a bridge, that link’s failure 
disables communication, i.e., there are sites that cannot send messages to each other. If 
the network has an articulation point, that site’s failure also disables communication. 


FACTS 


F42: All vertices on a cycle are in the same BC. In fact the bridge tree is formed by 
repeatedly contracting cycles. 


F43: A vertex x of degree < 1 is a vertex of the bridge tree. In fact the BC’s are {x} 
and the BC’s of G — «. 
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F44: Facts F42 and F43 justify the following high-level algorithm for finding the 
bridges and bridge tree. It has a linear-time implementation almost identical to Algo- 
rithm 10.1.5, the strong component algorithm. We call the last vertex « of a dfs path 
a dead end if x has degree < 1. 


Algorithm 10.1.6: Bridges 


Input: connected undirected graph G = (V, E) 
Output: bridge components and bridges of G 
repeat until G has no vertices: 
grow a dfs path P until a cycle is found or a dead end is reached 
cycle C: contract the vertices of C 
dead end x: mark {x} as a BC 
if « has degree 1, then mark its edge as a bridge (of G) 


F45: A similar linear-time algorithm finds the cutpoints and biconnected components 
of an undirected graph [Ga00]. 


F46: The original linear-time dfs algorithm of Hopcroft and Tarjan for cutpoints and 
biconnected components [Ta72] is based on the idea of lowpoints (recall Fact F40). Start 
with a dfs tree T. Assume that the vertices are numbered in discovery order and that 
each vertex is identified with its discovery number. Define 


lowpoint(v) = min{v} U {w : some back edge goes from a descendant of v to w}. 
Hopcroft and Tarjan proved that G is biconnected if and only if 
(i) vertex 1 has exactly one child (which must be vertex 2); 
(ii) lowpoint(2) = 1; 
(iii) each vertex w > 2 has lowpoint(w) < v, where v is the parent of w. 


The cutpoints have a similar characterization. 


Lowpoint is easy to compute in a bottom-up pass over T’, since 
lowpoint(v) = min{v} U {lowpoint(w) : wa child of v} U {w :(v,w)a back edge}. 
EXAMPLES 


E27: Figure 10.1.14 below illustrates the execution of the bridges algorithm on the 
graph of Figure 10.1.13. 
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Figure 10.1.14: Execution of bridge algorithm. 


E28: Figure 10.1.15 below illustrates Robbins’s Theorem that a connected undi- 
rected graph has a strongly connected orientation if and only if it is bridgeless [Ro39]. 
If one of the horizontal edges is deleted, making the other a bridge, then the graph has 
no strongly connected orientation. 


P<] p< 


Figure 10.1.15: Undirected graph and strongly connected orientation. 


E29: Kotzig’s Theorem [Ko59] states that a unique perfect matching must contain 
a bridge of G. Figure 10.1.16 shows a graph with a unique perfect matching — matched 
edges are drawn heavy. Note that deleting the bridge of the matching gives another 
graph with a unique perfect matching. This idea can be used to efficiently find a unique 
perfect matching or show it does not exist [GaKaTa01]. 


Figure 10.1.16: Graph with a unique perfect matching. 


E30: Whitney’s Flipping Theorem asserts that a graph is planar if and only if 
each biconnected component is planar [Wh32al]. 
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10.1.5 DFS as a Proof Technique 


In addition to being a powerful algorithmic tool, depth-first search can be used 
to easily prove many theorems of graph theory. (It’s a handy way to remember the 
theorems too!) This subsection gives several examples. 


DEFINITIONS 
D32: A mized graph G can have both directed and undirected edges. 


D33: <A mixed graph G is traversable if every ordered pair of vertices u,v has a 
uv-path with all its directed edges pointing in the forward direction. (Traversability 
is equivalent to connectedness if G is undirected and to strong connectedness if G is 
directed.) 


D34: <A bridge in a mixed graph is an undirected edge that is a bridge of G when 
edge directions are ignored. 


D35: An orientation of a mixed graph assigns a unique direction to each undirected 
edge. 


EXAMPLES 


E31: Robbins’s Theorem can be proved using the high-level bridge algorithm (Algo- 
rithm 10.1.6) and the strong components algorithm (Algorithm 10.1.5). When the BC 
algorithm is executed on a bridgeless graph G, it ends with G contracted to a single 
vertex. But if the SC algorithm ends with the entire graph contracted to a single ver- 
tex, then the initial graph is strongly connected. So orient the given undirected graph 
G to make the execution of the SC algorithm on the orientation mimic the execution 
of the BC algorithm on G. To do this orient edges that extend the dfs path or cause 
contractions (in the BC algorithm) so they do the same in the SC algorithm. 


This is illustrated in Figure 10.1.17, which shows how a depth-first search executed 
on the undirected graph of Figure 10.1.15 gives the orientation shown in that figure. 
Enlarged hollow vertices are contractions of original vertices. 


Figure 10.1.17: dfs proof of Robbins’s Theorem. 


E32: The same approach proves a generalization of Robbins’s Theorem by Boesch and 
Tindell [BoTi80] that a traversable graph has a strongly connected orientation if and 
only if it has no bridge. It can be proved using Algorithm 10.1.5 with the sink rule 
replaced by a rule for a “1-sink,” i.e., a vertex with no leaving directed edge and only 
one incident undirected edge. 
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E33: Kotzig’s Theorem can be proved by dfs [Ga79]. We illustrate by proving a simple 
special case: a bipartite graph with a unique perfect matching has a vertex of degree one. 
The idea is to grow a dfs path P two edges at a time, repeatedly adding an unmatched 
edge (x,y) and the matched edge containing y. When the path cannot be extended, 
the last vertex y has degree 1. If not, a back edge from y creates an even length cycle, 
whose edges yield another perfect matching, as shown in Figure 10.1.18 below. 


Figure 10.1.18: dfs proof of Kotzig’s Theorem. 


A linear-time dfs algorithm for testing if a perfect matching is unique is given in 
[GaKaTa01]. 


E34: Rédei’s Theorem [Re34] states that any tournament has a Hamiltonian path, 
i.e., a simple path through all its vertices. This is easy to see by dfs: listing the vertices 
in order of decreasing finish time gives a Hamiltonian path. 


10.1.6 More Graph Properties 


The basic properties of depth-first search were developed by Hopcroft and Tarjan as 
stepping-stones to their goal of an efficient planarity algorithm. This subsection starts 
by surveying the high-level principles of the planarity algorithm. It then surveys other 
important properties that can be decided by efficient dfs algorithms. The depth-first 
tree plays a central role in all these algorithms. 


Planarity Testing 


The first complete linear-time algorithm to decide whether or not a graph is planar is 
due to Hopcroft and Tarjan. This property has obvious applications to graph drawing, 
circuit layout, etc. This section gives the high-level depth-first approach. 


DEFINITIONS 
D36: Let G be a biconnected graph with a cycle C. The edge set EF — E(C) can be 
partitioned into a family of subgraphs called segments as follows: 

(i) An edge not in C that joins 2 vertices of C is a segment. 


(ii) The remaining segments each consist of a connected component of G—V(C), plus 
all edges joining that component to C. 
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D37: Two segments S,T of a cycle C in a graph interlace either if |V(S) NV(T) NM 
V(C)| > 3, or if there are 4 distinct vertices u,v,w,2 that occur along cycle C’ (not 
necessarily consecutively) in that order such that u,w € S and v,x € T. 


EXAMPLE 


E35: Figure 10.1.19 below shows a cycle C’ (dotted) with 5 segments. Segments S1 
and Sp» interlace, and $4 interlaces with both S3 and Ss. 


Figure 10.1.19: Planar graph with interlacing segments. 


FACTS 


F47: By Whitney’s Flipping Theorem (E30), one can test planarity by treating each 
biconnected component separately. 


F48: The graph theoretic approach used by Hopcroft and Tarjan is the following 
theorem of Auslander and Parter [AuPa61]: a biconnected graph G is planar if and 
only if 


(a) CUS is planar for every segment S; 
(b) the segments can be partitioned into two families such that no two segments in 
the same family interlace. 
The necessity of both (a) and (b) is clear. An outline of a complete proof of this theorem 
is given in [Ev79]. 


F49: Here is the overall structure of the algorithm of Hopcroft and Tarjan [HoTa74] 
which decides in linear time whether or not a graph is planar. Each biconnected com- 
ponent is processed separately. 


A depth-first spanning tree of the component is found. 
A cycle C is chosen, consisting of a path in the dfs tree plus one back edge. 
Then segments are found: 

(i) each back edge that joins two vertices of C' is a segment; 
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(ii) each remaining segment S is determined by a vertex w ¢ C' whose parent is 
in C. The edges of S are those edges with at least one endpoint descending 
from w. (Specifically, this amounts to the tree edge joining w to its parent, 
plus all edges of the subtree rooted at w, plus all back edges that join two 
descendants of w or join a descendant of w with a vertex of C.) 


The algorithm processes each segment S' recursively, checking that CUS is planar 
and S$ can be added to an imbedding of all subgraphs processed so far. (The latter 
uses the interlacing criterion.) 


F50: A number of additional ideas are used to achieve linear time. The lowpoint 
values (Fact F46) are used to guide the construction of cycles C. In fact the “second 
lowpoint” is also used. A second depth-first search is done for cycle generation. The 
planarity algorithm is intricate, but is very fast in practice. 


Triconnectivity 


Hopcroft and Tarjan show how to find the triconnected components in linear time 
[HoTa73b]. Like their planarity algorithm the approach is based on segments. 


DEFINITIONS 


D38: An undirected graph is a triconnected graph if it is connected and remains so 
whenever any two or fewer vertices are deleted. 


D39: Two vertices in a biconnected graph form a separation pair if deleting them 
leaves a disconnected graph. 


There is a natural definition of the triconnected components of a graph. 


EXAMPLE 


E36: In Figure 10.1.19 above there are 5 separation pairs: a,b; a,c; d,e; e,f; and 
g,h. 


FACTS 


F51: The following characterization of the separation pairs is easy to prove. Let G be 
a biconnected graph with a cycle C. Let a,b be a separation pair. Then a and 6 either 
both belong to C' or both belong to a common segment. Moreover, suppose a and b 
both belong to C. Then either 


(a) some segment S has V(S)N V(C) = {a,b} C V(S); or 
(b) C — {a,b} consists of two nonempty paths, and no segment contains a vertex of 
both paths. 
(The symbol “c” denotes proper set containment.) 


F52: The triconnectivity algorithm applies the characterization of Fact F51 recur- 
sively. Hopcroft and Tarjan’s triconnectivity algorithm shares algorithmic ideas with 
their planarity algorithm. 


F53: Another useful fact is that the two vertices of a separation pair are related 
(Definition D10). 
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Ear Decomposition and st-numbering 


DEFINITIONS 


D40: An open ear decomposition of an undirected graph is a partition of the edges 
into a simple cycle Po and simple paths P,,..., P, such that for each i > 0, P; is joined 
to previous paths only at its (2 distinct) ends, ie., V(P;) 1 V(U;<iP;) consists of the 2 
ends of P;. (The concept, but not the name, is due to Whitney.) 

D41: Let (s,t) be any edge of a biconnected graph. An st-numbering numbers the 


vertices from 1 to n so that s is numbered 1, ¢t is numbered n, and every other vertex 
has both a higher-numbered neighbor and a lower-numbered neighbor. 


EXAMPLE 


E37: Figure 10.1.20 shows an ear decomposition consisting of cycle Py and simple 
paths P,,..., Ps. The 15 vertices are numbered in an st-numbering (corresponding to 
the ear decomposition). 


Figure 10.1.20: Ear decomposition and st-numbering of a biconnected graph. 


FACTS 


F54: Whitney [Wh32b] proved that an undirected graph is biconnected if and only if 
it has an open ear decomposition. 


F55: An algorithm based on lowpoint values can be used to find an open ear decompo- 
sition of a biconnected graph in linear time (pathfinder in [EvTa76], although the term 
“ear decomposition” is not used). 


F56: An open ear decomposition with (s,t) € Po can be used to give an st-numbering 
in linear time [EvTa76]. 


REMARKS 


R8: st-numbering is the basis of the linear-time planarity algorithm of Lempel, Even 
and Cederbaum [LeEvCe67]. It constructs a planar imbedding by repeatedly adding a 
vertex. More precisely it starts with an imbedding of one vertex and its incident edges. 
Then it repeatedly adds all edges incident to the next vertex, updating the imbedding. 
The vertices are added in st-order. 
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R9: Ear decomposition is closely related to depth-first search. An open ear decompo- 
sition can be found efficiently on parallel computers with large numbers of processors; 
the same cannot be said for doing a depth-first search. Efficient parallel algorithms for 
bi- and triconnectivity and planarity are based on ear decomposition [Ra93]. 


Reducibility 


DEFINITIONS 


D42: A (program) flow graph is a directed graph with a distinguished vertex r, the 
start vertex, that can reach every vertex. 


D43: A flow graph is reducible if it can be transformed into the single vertex r by a 
sequence of operations of the following type: 


if (v,w) is the only edge entering w and if w # r, then contract edge (v,w) to 
vertex v. 
(The contraction operation discards parallel edges and self-loops.) 


D44: We define a problem in data structures that arises in many dfs algorithms (and 
other contexts). A universe of n elements is given. The problem is to maintain a 
partition P of this universe into sets. 


Initially each element forms a singleton set of P. 
Partition P is updated by the operation union(A, B), which replaces two sets A 
and B of P by their union AU B. 
A second operation find(a) computes the name of the set currently containing 
element 2. 
The set-merging problem is to process a sequence of m intermixed union and find 
operations. 


EXAMPLE 
E38: Figure 10.1.21 shows an irreducible flow graph. In fact, this flow graph gives a 


if 


Figure 10.1.21: Irreducible flow graph. 


forbidden subgraph characterization of reduciblity: a flow graph is reducible if and only 
if it does not contain a subgraph consisting of 4 vertices r,a,b,c (where r and a may 
coincide but otherwise the vertices are distinct) joined by vertex disjoint paths from r 
to a,a to b, atoc, btocand cto b. 
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REMARK 


R10: Flow graphs model the structure of computer programs. Any program without 
goto’s has a reducible flow graph. Many methods for code optimization (e.g., eliminating 
common subexpressions, identifying active variables, finding useless definitions, etc.) 
depend on the graph being reducible [AhSeU186]. 


FACTS 


The starting point of a linear-time reducibility algorithm of Tarjan [Ta74a] for flow 
graphs is a reformulation of reducibility. It produces the sequence of contractions that 
reduce it to the start vertex. For each vertex w in a dfs tree of a flow graph, we define 


I(w) =v: there is a simple vw-path ending with a back edge (to w) 


F57: ([Ta74a] A flow graph is reducible if and only if every set I(w) consists only of 
descendants of w. 


F58: [Ta74a] Assume that the vertices of the flow graph are indexed by preorder 
number. Let w be the largest vertex (in preorder) with an entering back edge. Suppose 
that I(w) consists only of descendants of w. If we contract the vertices of I(w) into 
vertex w, then the new graph is reducible if and only if the original was. 


F59: Fact F58 specifies a sequence of contractions that substantiate the reducibility 
of a graph. The sets J(w) can be computed simply by scanning edges in the backwards 
direction, starting at w. If a nondescendant of w is ever reached, then the graph is not 
reducible. The efficient descendance test of Fact F16 is used. 


F60: The contractions performed by the algorithm change the vertex set of the graph. 
At all times the vertices of the current graph form a partition of the original vertex set. 
This partition is manipulated by the union and find operations (Definition D44). 


F61: The best known algorithm for set-merging is based on the so-called weighted 
union and path compression rules. Tarjan showed that this algorithm solves the set- 
merging problem in time O(ma(m,n)). Here a is an inverse of Ackermann’s function 
and is very slowly growing [Ta75, CLRSO1]. 


F62: Gabow and Tarjan [GaTa85] showed that a special case of the set-merging prob- 
lem can be solved in linear time. Using this special case algorithm makes the reducibility 
algorithm run in linear time. 


F63: Suppose the vertices are numbered in discovery order. If v < w then any vw-path 
contains a common ancestor of v and w [Ta72]. This property holds in both directed 
and undirected graphs. 


EXAMPLE 


E39: Figure 10.1.22 shows a depth-first spanning tree of a flow graph with the vertices 
labelled by discovery number. Any path from vertex 5 to vertex 8 passes through one 
or both of the common ancestors of 5 and 8, i.e., vertices 1 and 2. 
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Figure 10.1.22: Depth-first search with preorder numbers. 


Two Directed Spanning Trees 


DEFINITIONS 
D45: In a flow graph, an r-tree is a directed spanning tree rooted at start vertex r. 


D46: Consider a dfs tree with root r. Edge (v,w) is a (directed) bridge in a flow 
graph if every rw-path includes (v, w). 


EXAMPLE 


E40: In Figure 10.1.21 edge (r,a) is a bridge. Duplicating it gives a graph with 2 
edge-disjoint r-trees (the directed paths r,a,b,c and r,a,c,b). 


FACTS 
Consider a dfs tree with root r. For any vertex w define 


I(w) = {v: there is a simple vw-path containing only descendants of w} 


Clearly the path of this definition ends in a back edge to w (unless v = w). Note the 
similarity with Fact F57. 


F64: A flow graph has two edge-disjoint r-trees if and only if each vertex v # r has 
two edge-disjoint rv-paths (recall Definition D20). This is a special case of Edmonds’s 
Branching Theorem, which is the same statement generalized from 2 to any k > 2 
[Ed72]. 


F65: A flow graph has 2 edge-disjoint r-trees if and only if there are no bridges. 


F66: Tarjan presents a linear-time algorithm to find 2 edge-disjoint r-trees if they 
exist [Ta74a]. More generally in an arbitrary flow graph the algorithm finds 2 r-trees 
that contain the fewest possible number of common edges. The more general problem 
is solved by identifying the bridges and duplicating each of them. This gives a graph 
with 2 edge-disjoint r-trees (Fact F65). 


F67: In terms of dfs, an edge (v, w) is a bridge if and only if (v, w) is a tree edge and is 
the only edge entering I(w). Tarjan’s algorithm identifies the bridges using techniques 
similar to the reducibility algorithm. Computing the trees is more involved. 


F68: The algorithm performs set-merging to keep track of the contracted vertices, as 
in the reducibility algorithm. As in that algorithm the data structure of [GaTa85] is 
used to achieve linear time. 
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Dominators 


DEFINITIONS 


D47: Ina flow graph with start vertex r, vertex v dominates vertex w 4 v if every 
rw-path contains v. 


D48: The immediate dominator of w, denoted idom(w), is a vertex v that dominates 
w such that every other dominator of w dominates v. 


D49: The dominator tree is a tree T whose nodes are the vertices of G. The root of 
T is the start vertex r. The parent of a vertex v £ r is idom(v). 


D50: The internal vertices of a path are all its vertices except the endpoints. 


D51: Consider a dfs tree with root r. For every vertex w # r, the semidominator 
of w, sdom(w), is defined by 


sdom(w) = min{v : some vw-path has all its internal vertices > w}. 


EXAMPLE 


E41: Figure 10.1.23 shows the dominator tree for the graph of Figure 10.1.22. Note 
that vertex 2 does not dominate 3 because of path 1,6,7,8,5,3. The start vertex 1 is 
the semidominator of every vertex except two: sdom(7) = 2, sdom(8) = 7. Although 
vertex 1 is the immediate dominator of 7 it is not the semidominator of 7. 


Figure 10.1.23: Dominator tree for Figure 10.1.22. 


FACTS 


F69: The basic properties of dominance are due to Lowry and Medlock [LoMe69]: 
Every vertex except r has a unique immediate dominator. This justifies the notion of 
dominator tree. A vertex v dominates w if and only if v is a proper ancestor of w in 
the dominator tree. 


F70: Lengauer and Tarjan [LeTa79] give an efficient algorithm to find the dominator 
tree T. It is a refinement of an earlier dfs algorithm of Tarjan [Ta74b]. 


F71: For any vertex w, sdom(w) is a proper ancestor of w. This follows from Fact 
F63. 


FACTS ABOUT SEMIDOMINATORS 
Semidominators are useful because of the next two facts proved by Lengauer and Tarjan: 
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F72: Take any vertex w #r. Let u be a vertex with minimum value sdom(u) among 
all vertices in the tree path from sdom(w) to w, excluding sdom(w). Then 


: _ Jf sdom(w) if sdom(w) = sdom(u) 
De { idom(u) otherwise 


F73: Semidominators can be computed by a recursive definition: 


sdom(w) = min{v: (v,w)an edge} U 


{sdom(u) : u> w and some edge goes from a descendant of u to w}. 
(Note the similarity with lowpoint in Fact F46.) 


F74: The algorithm of Lengauer and Tarjan [LeTa79] computes semidominators using 
Fact F73 in a backwards pass (i.e., w is decreasing). Then it computes immediate 
dominators using Fact F72 in a forwards pass. 


F75: The time for the algorithm is O(ma(m,n)). An implementation of this algorithm 
in linear time is presented in [AlHaLaTh99]. 


10.1.7 Approximation Algorithms 


Finding small spanning subgraphs with prespecified connectivity properties is usually 
a difficult (NP-hard) problem. For example, finding a bridgeless spanning subgraph 
with the fewest possible number of edges is NP-hard. (The reason is that this subgraph 
contains n edges if and only if there is a Hamiltonian cycle.) 

Depth-first search has been used to design good approximation algorithms for such 
difficult problems. Here the goal is to find a subgraph that has all the desired properties 
except that instead of having the fewest possible number of edges, it is within a small 
constant factor of this goal. This section surveys the use of depth-first search in approx- 
imation algorithms for connectivity properties. Other dfs approximation algorithms are 
surveyed in [Kh97]. 


DEFINITIONS 


D52: Consider an optimization problem that seeks to find a smallest feasible solution 
OPT. An a-approximation algorithm is a polynomial-time algorithm that is guar- 
anteed to find a solution of size at most a|OPT| [CLRSO1]. For the graph problems of 
this section, the size of the solution is the number of edges. 


D53: The smallest bridgeless spanning subgraph of a connected bridgeless undi- 
rected graph is a bridgeless spanning subgraph with the minimum possible number of 
edges. 


D54: An undirected graph is k-edge connected if it is connected and remains so when 
any set of fewer than k edges is deleted. This concept makes good sense for a multigraph. 
A k-ECSS is a k-edge connected spanning subgraph; the graph is assumed to be k-edge 
connected. So a bridgeless spanning subgraph is a 2-ECSS. A smallest k-ECSS has the 
fewest possible number of edges. From now on instead of “smallest bridgeless spanning 
subgraph,” we use the shorter equivalent phrasing, “smallest 2-ECSS.” 
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ALGORITHM 


Approximation algorithms for the smallest 2-ECSS are our first concern. A 2-approxi- 
mation can be designed from Algorithm 10.1.6 in a straightforward way. Khuller and 
Vishkin [KhVi94] were the first to go beyond this. They presented an elegant dfs 
algorithm based on a “tree carving” using the dfs tree. The following modification of 
Algorithm 10.1.6 is a path-based reinterpretation of their algorithm. 


Algorithm 10.1.7: Smallest 2-ECSS Approximation 


Input:  bridgeless undirected graph G = (V, E) 
Output: edge set F C FE, a 3/2-approximation to the smallest 2-ECSS 
F=0 
repeat until G has 1 vertex: 
grow a dfs path P until its endpoint x has all neighbors belonging to P 
let y be the neighbor of x closest to the start of P 
let C be the cycle formed by edge (x, y) edges of P 
add all edges of C' to F 
contract the vertices of C’ 


EXAMPLE 


E42: Figure 10.1.24 below gives a sample execution of the algorithm. The given graph 
on top has a Hamiltonian cycle, so the smallest 2-ECSS has n edges. 


The algorithm grows the depth-first path of solid edges shown in the middle, 
starting from r. It then adds the dashed edges. 


A typical edge addition is illustrated in the bottom graph, where the enlarged 
hollow vertex is the contraction of the last vertices on the path. 


As n approaches co, the algorithm’s solution approaches 3n/2 edges: n solid edges and 
n/2 dashed edges. So the approximation ratio approaches 3/2. 


VAVAVAVAVA 


Figure 10.1.24: Smallest 2-ECSS approximation algorithm. 
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FACTS 


Good approximation algorithms require good lower bounds on the size of the opti- 
mum solution. We analyze this algorithm using 2 lower bounds. 


F76: The Degree Lower Bound says that any 2-ECSS has at least n edges. This 
results from the fact that every vertex must have degree at least 2. 


F77: The Carving Lower Bound says that if Algorithm 10.1.7 contracts c cycles, 
then any 2-ECSS has at least 2c edges [KhVi94]. To see this let x be an endpoint of P 
giving a contraction in the algorithm. Any 2-ECSS contains > 2 edges leaving x. These 
edges disappear in the contraction operation. So we can repeat this argument for every 
contraction, getting a lower bound of 2c edges. 


F78: Algorithm 10.1.7 is a 3/2 approximation. This follows because the edge set F’ 
consists of n — 1 edges from paths P and c edges that cause contractions. If OPT is 
the edge set of a 2-ECSS, then |OPT| > n (Degree Lower Bound) and |OPT|/2 > c 
(Carving Lower Bound). Thus |F'| < 3}OPT|/2. 


F79: Vempala and Vetta [VeVe00] present a 4/3-approximation algorithm for the 
smallest 2-ECSS. Their algorithm is based on the idea of doing a depth-first search 
of objects of the graph, specifically cycles and paths. It uses the Matching Lower 
Bound: Any 2-ECSS has at least as many edges as a smallest spanning subgraph 
where every vertex has degree > 2. Vempala and Vetta give a similar 4/3-approximation 
algorithm for the smallest biconnected subgraph of a biconnected graph. 


F80: Jothi, Raghavachari & Varadrajan [JoRaVa03] use a stronger version of the 
Matching Lower Bound in a dfs algorithm that achieves performance ratio 5/4 for the 
smallest 2-ECSS. Vetta [Ve01] uses a version of the Matching Lower Bound in a dfs 
algorithm that approximates the smallest strongly connected subgraph of a strongly 
connected graph to within a factor 3/2. 


F81: The Carving Lower Bound extends to k-ECSS: If Algorithm 10.1.7 contracts c 
cycles then any k-ECSS has at least kc edges [KhVi94]. This can be proved by simply 
changing the “2”’s to k’s in Fact F77. 


F82: Gabow [Ga02] gives a dfs algorithm that is a 3/2-approximation for the smallest 
3-ECSS of a multigraph. It uses the above dfs approach of [KhVi94] for 2-ECSS, the 
Carving Lower Bound, and ear decomposition. 


F83: Khuller and Raghavachari [KhRa96] present the first approximation algorithm 
that achieves ratio < 2 for the smallest k-ECSS of a multigraph. It boosts the edge- 
connectivity of the solution graph in steps of 2. Each of these steps is a slight variant of 
the above algorithm of [KhVi94]. The analysis is based on a refinement of the Carving 
Lower Bound. Gabow [Ga03] improves the analysis to show it is a 1.61-approximation. 
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INTRODUCTION 


In many applications of graph algorithms, including social networks, communica- 
tion networks, transportation networks, VLSI design, graphics, and assembly planning, 
graphs are subject to discrete changes, such as additions or deletions of edges or vertices, 
or edge cost changes. In the last two decades there has been a growing interest in such 
dynamically changing graphs, and a whole body of algorithms and data structures for 
dynamic graphs has been discovered. This section of the Handbook is intended as an 
overview of this field. 


10.2.1 Basic Terminology 


In a typical dynamic graph problem, one would like to answer queries on graphs that 
are changing dynamically. For instance, while a graph is undergoing dynamic changes, 
one might be interested to know whether the graph is connected, or which is the shortest 
path between any two vertices. 


DEFINITIONS 


D1: An update on a graph is an operation that inserts or deletes edges or vertices of 
the graph or changes attributes associated with edges or vertices, such as cost or color. 


D2: A dynamic graph is a graph that is undergoing a sequence of updates. 
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REMARK 


R1: The goal of a dynamic graph algorithm is to update efficiently the solution of a 
problem after dynamic changes, rather than having to recompute it from scratch each 
time. Given their powerful versatility, it is not surprising that dynamic algorithms and 
dynamic data structures are often more difficult to design and to analyze than their 
static counterparts. 


DEFINITIONS 
We can classify dynamic graph problems according to the types of updates allowed. 


D3: A dynamic graph problem is said to be fully dynamic if the update operations 
include unrestricted insertions and deletions of edges or vertices. 


D4: A dynamic graph problem is said to be partially dynamic if only one type of 
update, either insertions or deletions, is allowed. 


D5: A dynamic graph problem is said to be incremental if only insertions are allowed. 


D6: A dynamic graph problem is said to be decremental if only deletions are allowed. 


REMARKS 


R2: In the first part of this work we will present the main algorithmic techniques used 
to solve dynamic problems on undirected graphs. To illustrate those techniques, we will 
focus particularly on dynamic minimum spanning trees and on connectivity problems. 


R3: In the second part of this work we will deal with dynamic problems on directed 
graphs, and we will investigate as paradigmatic problems the dynamic maintenance of 
transitive closure and shortest paths. 


RA: Interestingly enough, dynamic problems on directed graphs seem much harder 
to solve than their counterparts on undirected graphs, and they require completely 
different techniques and tools. 


10.2.2 Dynamic Problems on Undirected Graphs 


This part considers fully dynamic algorithms for undirected graphs. These algo- 
rithms maintain efficiently some property of a graph that is undergoing structural 
changes defined by insertion and deletion of edges, and/or updates of edge costs. To 
check the graph property throughout a sequence of these updates, the algorithms must 
be prepared to answer queries on the graph property efficiently. 


EXAMPLES 


E1: The fully dynamic minimum spanning tree problem consists of maintaining 
a minimum spanning forest of a graph during insertions of edges, deletions of edges, 
and edge cost changes. 


E2: A fully dynamic connectivity algorithm must be able to insert edges, delete 
edges, and answer a query on whether the graph is connected, or whether two vertices 
are in the same connected component. 
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REMARKS 


R5: The goal of a dynamic algorithm is to minimize the amount of recomputation 
required after each update. 


R6: All the dynamic algorithms that we describe are able to maintain dynamically the 
graph property at a cost (per update operation) which is significantly smaller than the 
cost of recomputing the graph property from scratch. 


In this part, first we present general techniques and tools used in designing dynamic 
algorithms on undirected graphs, and then we survey the fastest algorithms for solving 
two of the most fundamental graph problems: connectivity and minimum spanning 
trees. 


General Techniques for Undirected Graphs 


Many of the algorithms proposed in the literature use the same general techniques, and 
hence we begin by describing these techniques. As a common theme, most of these 
techniques use some sort of graph decomposition, and they partition either the vertices 
or the edges of the graph to be maintained. Moreover, data structures that maintain 
properties of dynamically changing trees are often used as building blocks by many 
dynamic graph algorithms. The basic update operations are edge insertions and edge 
deletions. Many properties of dynamically changing trees have been considered in the 
literature. 


EXAMPLES 


E3: The basic query operation is tree membership: while the forest of trees is dynam- 
ically changing, we would like to know at any time which tree contains a given vertex, 
or whether two vertices are in the same tree. Dynamic tree membership is a special 
case of dynamic connectivity in undirected graphs, and indeed in the following we will 
see that some of the data structures presented here for trees are useful for solving the 
more general problem on graphs. 


E4: Other properties have also been considered: the parent of a vertex, the least 
common ancestor of two vertices, and the center or the diameter of a tree [AlHoDeTh97, 
AlHoTh00, S1Ta83]. When costs are associated either to vertices or to edges, one could 
also ask what is the minimum or maximum cost in a given path. 


In what follows, we first present three different data structures that maintain properties 
of dynamically changing trees: topology trees, ET trees, and top trees. Next, we discuss 
techniques that can be applied on general undirected graphs: clustering, sparsification, 
and randomization. In the course of the presentation, we also highlight how these 
techniques have been applied to solve the fully dynamic connectivity and/or minimum 
spanning tree problems, and which update and query bounds can be achieved when 
they are deployed. 
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Topology Trees 


Topology trees have been introduced by Frederickson [Fr85] to maintain dynamic trees 
upon insertions and deletions of edges. 


DEFINITIONS 
D7: Given a tree T of a forest, a cluster is a connected subgraph of T. 
D8: The cardinality of a cluster is the number of its vertices. 


D9: The external degree of a cluster is the number of tree edges incident to it. 


ASSUMPTION 

In order to illustrate the solution proposed by Frederickson [Fr85, Fr97], we assume 
that the tree T has maximum vertex degree 3: this is without loss of generality, since a 
standard transformation can be applied if this is not the case [Ha69]. 


DEFINITION 
D10: A restricted partition of a tree T is a partition of its vertex set V into clusters 
of external degree < 3 and cardinality < 2 such that: 

(1) Each cluster of external degree 3 has cardinality 1. 


(2) Each cluster of external degree < 3 has cardinality at most 2. 


(3) No two adjacent clusters can be combined and still satisfy the above. 


EXAMPLE 


E5: A restricted partition of order 2 of a tree T is shown in Figure 10.2.1. 


Figure 10.2.1: A restricted partition of order 2 of a tree T. 


REMARKS 


R7: There can be several restricted partitions for a given tree T, based upon different 
choices of the vertices to be unioned. For instance, vertex 8 in Figure 10.2.1 could be 
unioned with vertex 7, instead of vertex 11, and the partition would still be valid. 
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R8: Because of clause (3), the restricted partition implements a cluster-forming scheme 
according to a locally greedy heuristic, which does not always obtain the minimum 
number of clusters, but which has the advantage of requiring only local adjustments 
during updates. 


DEFINITIONS 


D11: A restricted multi-level partition consists of a collection of restricted parti- 
tions satisfying the following: 
(1) The clusters at level 0 (known as basic clusters) contain one vertex each. 
(2) The clusters at level € > 1 form a restricted partition with respect to the tree 
obtained after shrinking all the clusters at level ¢— 1. 


(3) There is exactly one vertex cluster at the topmost level. 


D12: A topology tree is a hierarchical representation of a tree T based on multi-level 
partitions. Each level of the topology tree partitions the vertices of TJ into clusters. 
Clusters at level 0 contain one vertex each. Clusters at level ¢ > 1 form a restricted 
partition of order 2 of the vertices of the tree T’ obtained by shrinking each cluster at 
level £— 1 into a single vertex. 


EXAMPLE 


E6: An example of topology tree, together with the restricted partitions used to 
obtain its levels, is given in Figure 10.2.2. 


Multilevel restricted partition 
Hl 
i 
i 
i 
H 
tk 
n/| m i 


Topology tree 


Figure 10.2.2: Restricted partitions and topology tree of a tree T. 
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APPROACH 


Edge deletion. We sketch how to update the clusters of a restricted partition when 
an edge e is deleted from a tree 7’. First, removing e splits T into two trees, say T; and 
T>, which inherit all of the clusters of T, possibly with the following exceptions. 

e Ife is entirely contained in a cluster, this cluster is no longer connected and therefore 
must be split. After the split, we must check whether each of the two resulting 
clusters is adjacent to a cluster of tree degree at most 2, and if these two adjacent 
clusters together have cardinality < 2. If so, we combine these two clusters in order 
to maintain condition (3) in Definition D10. 

e Ife is between two clusters, then no split is needed. However, since the tree degree of 
the clusters containing the endpoints of e has been decreased, we must check if each 
cluster should be combined with an adjacent cluster, again because of condition (3) 
in Definition D10. 


Edge insertion. Similar local manipulations can be applied to restore invariants (1) — 
(3) in Definition D10 in case of edge insertions. 

Construction of the topology tree. The levels of the topology tree are built in a 
bottom up fashion by repeatedly applying the locally greedy heuristic. 

Update of the topology tree. Each level can be updated upon insertions and dele- 
tions of edges in tree T by applying a few locally greedy adjustments similar to the ones 
described before. In particular, a constant number of basic clusters (corresponding to 
leaves in the topology tree) are examined: the changes in these basic clusters percolate 
up in the topology tree, possibly causing vertex clusters to be regrouped in different 
ways. 


FACTS 


F1: The number of nodes at each level of the topology tree is a constant fraction of 
that at the previous level, and thus the number of levels is O(log n) (see [Fr85, Fr97]). 


F2: The property that only a constant amount of work has to be done on O(log n) 
topology tree nodes implies a logarithmic bound on the update time. 


F3: (Frederickson’s Theorem) [Fr85] The update of a topology tree because of an edge 
insertion or deletion can be supported in O(log n) time. 


ET Trees 


ET trees have been introduced by Henzinger and King [HeKi99] to work on dynamic 
forests whose vertices are associated with weighted or unweighted keys. Updates allow 
it to cut arbitrary edges, to insert edges linking different trees of the forest, and to add 
or remove the weighted key associated to a vertex. Supported queries are the following: 
e Connected(u,v): tells whether vertices u and v are in the same tree. 

e Size(v): returns the number of vertices in the tree that contains v. 

e Minkey(v): returns a key of minimum weight in the tree that contains v; if keys are 

unweighted, an arbitrary key is returned. 


DEFINITIONS 


D13: An Euler tour of a tree T is a maximal closed walk over the graph obtained 
by replacing each edge of T’ by two directed edges with opposite direction. The walk 
traverses each edge exactly once; hence, if T has n vertices, the Euler tour has length 
2n — 2 (see Figure 10.2.3). 
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D14: An ET tree is a balanced binary tree (the number of nodes in the left and right 
subtrees of each node differs by at most one) over some Euler tour around T. Namely, 
leaves of the balanced binary tree are the nodes of the Euler tour, in the same order in 
which they appear (see Figure 10.2.3). 


Tree T 


Euler tour of T 


Sa naa 


ET-tree of T 


Figure 10.2.3: Euler tour and ET tree of a tree T. 


REMARK 


R9: Although each vertex of T may occur several times in the Euler tour (an arbitrary 
occurrence is marked as representative of the vertex), an ET tree has O(n) nodes. 


APPROACH 


Edge insertion and deletion. If trees in the forest are linked or cut, a constant 
number of splits and concatenation allows reconstruction of the new Euler tour(s); the 
ET tree(s) can then be rebalanced by affecting only O(log n) nodes. 


Connectivity queries. The query Connected(u, v) can be easily supported in O(log n) 
time by finding the roots of the ET trees containing u and v and checking if they coincide. 


Size and Minkey queries. To support Size and Minkey queries, each node q of the 
ET tree maintains two additional values: the number s(q) of representatives below it 
and the minimum weight key k(q) attached to a representative below it. Such values 
can be maintained in O(logn) time per update, which allows answering queries of the 
form Size(v) and Minkey(v) in O(logn) time for any vertex v of the forest: the root r 
of the ET tree containing v is found and values s(r) and k(r) are returned, respectively. 
See [HeKi99] for additional details of the method. 
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FACT 


F4: Both updates and queries can be supported in O(logn) time using ET trees 
(see [HeKi99]). 


Top Trees 


Top trees have been introduced by Alstrup et al. [AlHoDeTh97] to maintain efficiently 
information about paths in trees, such as, e.g., the maximum weight on the path between 
any pair of vertices in a tree. The basic idea is taken from Frederickson’s topology 
trees, but instead of partitioning vertices, top trees work by partitioning edges: the 
same vertex can then appear in more than one cluster. 


DEFINITIONS 


D15: Similarly to [Fr85, Fr97], a cluster is a connected subtree of tree T, with the 
additional constraint that at most two vertices, called boundary vertices, have edges 
out of the subtree. 


D16: Two clusters are said to be nezghbors if their intersection contains exactly one 
vertex. 


D17: A top tree of T is a binary tree such that: 

e The leaves and the internal nodes represent edges and clusters of T’, respectively. 

e The subtree represented by an internal node is the union of the subtrees represented 
by its two children, which must be neighbors. 

e The root represents the entire tree T. 

e The height is O(log n). 


EXAMPLE 


E7: We refer to Figure 10.2.4 for an example of a top tree. 


APPROACH 

Top trees can be maintained under edge insert and delete operations in tree T by making 
use of two basic Merge and Split operations. 

Merge. It takes two top trees whose roots are neighbor clusters and joins them to form 
a unique top tree. 

Split. This is the reverse operation, deleting the root of a given top tree. 

Edge insertion and deletion. The implementation of an edge insertion/deletion 
starts with a sequence of Split of all ancestor clusters of edges whose boundary changes 
and finishes with a sequence of Merge. Since an end-point v of an edge has to be a 
boundary vertex of the edge if v is not a leaf, each edge insert/delete can change the 
boundary of at most two edges, excluding the edge being inserted/deleted. 


FACT 
F5: [AlHoDeTh97] For a dynamic forest we can maintain top trees of height O(log n) 


supporting edge insertions and deletions with a sequence of O(logn) Split and Merge. 
The sequence itself is identified in O(log n) time. 
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Figure 10.2.4: Clusters and top tree of a tree T. 
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REMARKS 


R10: ‘Top trees are typically used by attaching extra information to their nodes. 
A careful choice of the extra information makes it possible to maintain easily path 
properties of trees, such as the maximum weight of an edge in the unique path between 
any two vertices. See [AlHoDeTh97, AIHoTh00, HoDeTh01] for sample applications. 
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R11: Top trees are a natural generalization of standard balanced binary trees over 
dynamic collections of lists that may be concatenated and split, where each node of the 
balanced binary tree represents a segment of a list. In the terminology of top trees, this 
is just a special case of a cluster. 


Clustering 


The clustering technique of [Fr85] is based upon partitioning the graph into a suit- 
able collection of clusters, such that each update involves only a small number of such 
clusters. 


REMARKS 


R12: Typically, the decomposition defined by the clusters is applied recursively, and 
the information about the subgraphs is combined with the topology trees described 
above. 


R13: A refinement of the clustering technique appears in the idea of ambivalent data 
structures [Fr97], in which edges can belong to multiple groups, only one of which is 
actually selected depending on the topology of the given spanning tree. 


EXAMPLE 


E8: We briefly describe the application of clustering to the problem of maintaining a 
minimum spanning forest [Fr85]. Let G = (V, £) be a graph with a designated spanning 
tree S. Clustering is used for partitioning the vertex set V into subtrees connected in 
S, so that each subtree is only adjacent to a few other subtrees. A topology tree is 
then used for representing a recursive partition of the tree S. Finally, a generalization 
of topology trees, called 2-dimensional topology trees, is formed from pairs of nodes in 
the topology tree and allows it to maintain information about the edges in E \ S [Fr85]. 


FACTS 


F6: Fully dynamic algorithms based only on a single level of clustering obtain typically 
time bounds of the order of O(m?/*) (see for instance [Galt92, Ra95]). 


F7: When the partition can be applied recursively, better O(m!/?) time bounds can 
be achieved by using 2-dimensional topology trees (see, for instance, [Fr85, Fr97]). 


F8: (Frederickson’s theorem) [Fr85] The minimum spanning forest of an undirected 
graph can be maintained in time O(m'/2) per update, where m is the current number 
of edges in the graph. 


REMARKS 


R14: See [Fr85, Fr97] for details about Frederickson’s algorithm. With the same 
technique, an O(m'/?) time bound can be obtained also for fully dynamic connectivity 
and 2-edge connectivity [Fr85, Fr97]. 


R15: The type of clustering used can be very problem-dependent, however, which 
makes this technique difficult to be used as a black box. 
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Sparsification 


Sparsification is a general technique due to Eppstein et al. [EpGaltNi97] that can be 
used as a black box (without having to know the internal details), in order to design 
and dynamize graph algorithms. It is a divide-and-conquer technique that allows it to 
reduce the dependence on the number of edges in a graph, so that the time bounds 
for maintaining some property of the graph match the times for computing in sparse 
graphs. More precisely, when the technique is applicable, it speeds up a T(n,m) time 
bound for a graph with n vertices and m edges to T(n, O(n)), i-e., to the time needed 
if the graph were sparse. For instance, if T(n,m) = O(m1/?), we get a better bound of 
O(n'/?). The technique itself is quite simple. A key concept is the notion of certificate. 


DEFINITIONS 


D18: For any graph property P and graph G, a certificate for G is a graph G’ such 
that G has property P if and only if G’ has the property. 


D19: A subgraph on n vertices is sparse if it has O(n) edges. 


D20: A time bound T(n) is well-behaved if, for some c < 1, T(n/2) < cT(n). 
Well-behavedness eliminates strange situations in which a time bound fluctuates wildly 
with n. For instance, all polynomials are well-behaved. 


APPROACH 


Let G be a graph with m edges and n vertices. We partition the edges of G into a 
collection of O(m/n) sparse subgraphs. The information relevant for each subgraph can 
be summarized in a sparse certificate. Certificates are then merged in pairs, producing 
larger subgraphs which are made sparse by again computing their certificate. The result 
is a balanced binary tree in which each node is represented by a sparse certificate. Each 
update involves O(log(m/n)) graphs with O(n) edges each, instead of one graph with 
m edges. 


NOTATION 


In the present context, log x stands for max(1, log, x), so that log(m/n) is never smaller 
than 1, even if m < 2n. 


REMARKS 


There exist two variants of sparsification. 


R16: The first variant is used in situations where no previous fully dynamic algorithm 
is known. A static algorithm is used for recomputing a sparse certificate in each tree 
node affected by an edge update. If the certificates can be found in time O(m +n), this 
variant gives time bounds of O(n) per update. 


R17: In the second variant, certificates are maintained using a dynamic data structure. 
For this to work, a stability property of certificates is needed, to ensure that a small 
change in the input graph does not lead to a large change in the certificates. (We refer 
the interested reader to [EpGalItNi97] for a precise definition of stability.) This variant 
transforms time bounds of the form O(m?) into O(n?). 
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FACTS 


F9: [EpGaltNi97] Let P be a property for which we can find sparse certificates in time 
f(n,m) for some well-behaved f, and such that we can construct a data structure for 
testing property P in time g(n,m) which can answer queries in time g(n,m). Then there 
is a fully dynamic data structure for testing whether a graph has property P, for which 
edge insertions and deletions can be performed in time O(f(n, O(n))) + g(n, O(m)), and 
for which the query time is q(n, O(n)). 


F10: [EpGalItNi97] Let P be a property for which stable sparse certificates can be 
maintained in time f(n,m) per update, where f is well-behaved, and for which there 
is a data structure for property P with update time g(n,m) and query time g(n,m). 
Then P can be maintained in time O(f(n, O(n))) + g(n, O(n)) per update, with query 
time g(n, O(n)). 


REMARKS 


R18: Basically, the first version of sparsification (Fact F9) can be used to dynamize 
static algorithms, in which case we only need to compute efficiently sparse certificates, 
while the second version (Fact F10) can be used to speed up existing fully dynamic 
algorithms, in which case we need to maintain efficiently stable sparse certificates. 


R19: Sparsification applies to a wide variety of dynamic graph problems, including 
minimum spanning forests, edge-connectivity, and vertex-connectivity. As an example, 
for the fully dynamic minimum spanning tree problem, it reduces the update time from 
O(m1/?) [Fr85, Fr97] to O(n!/) [EpGaltNi97]. 


R20: Since sparsification works on top of a given algorithm, we need not know the 
internal details of this algorithm. Consequently, it can be applied orthogonally to other 
data structuring techniques: in a large number of situations both clustering and spar- 
sification have been combined to produce an efficient dynamic graph algorithm. 


Randomization 


Clustering and sparsification allow one to design efficient deterministic algorithms for 
fully dynamic problems. The last technique we present in this section is due to Henzinger 
and King [HeKi99]; it achieves faster update times for some problems by exploiting the 
power of randomization. 


APPROACH 

We sketch how the randomization technique works, taking the fully dynamic connec- 
tivity problem as an example. In a graph G = (V,£) to be maintained dynamically, 
the edges of a spanning forest F' are called tree edges, and the edges in F \ F are called 
non-tree edges. 


Maintaining spanning forests. Trees in the spanning forests are maintained using 
the Euler tours data structure (ET trees) described above, which allows one to obtain 
logarithmic updates and queries within the forest. 

Random sampling. A key idea behind the technique of Henzinger and King is the 
following: when e is deleted from a tree T’, use random sampling among the non-tree 
edges incident to T, in order to find quickly a replacement edge for e, if any. 
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Graph decomposition. The second key idea is to combine randomization with a 
suitable graph decomposition. We maintain an edge decomposition of the current graph 
G into O(log n) edge disjoint subgraphs G; = (V, £;). These subgraphs are hierarchically 
ordered. The lower levels contain tightly connected portions of G (i.e., dense edge cuts), 
while the higher levels contain loosely connected portions of G (i.e., sparse cuts). For 
each level 7, a spanning forest for the graph defined by all the edges in levels i or below 
is also maintained. 


REMARKS 


R21: Note that the hard operation is the deletion of a tree edge: indeed, a spanning 
forest is easily maintained throughout edge insertions, and deleting a non-tree edge does 
not change the forest. 


R22: The goal is an update time of O(log? n): after an edge deletion, in the quest for 
a replacement edge, we can afford a number of sampled edges of O(log? n). However, if 
the candidate set of edge e is a small fraction of all non-tree edges which are adjacent to 
T, it is unlikely to find a replacement edge for e among this small sample. If we found 
no candidate among the sampled edges, we must check explicitly all the non-tree edges 
adjacent to T. After random sampling has failed to produce a replacement edge, we 
need to perform this check explicitly; otherwise we would not be guaranteed to provide 
correct answers to the queries. 


R23: Since there might be numerous edges adjacent to 7’, this explicit check could be 
an expensive operation, so it should be made a low probability event for the randomized 
algorithm. This can produce pathological updates, however, since deleting all edges in 
a relatively small candidate set, reinserting them, deleting them again, and so on will 
almost surely produce many of those unfortunate events. 


R24: The graph decomposition is used to prevent the undesirable behavior described 
above. If a spanning forest edge e is deleted from a tree at some level 7, random sampling 
is used to quickly find a replacement for e at that level. If random sampling succeeds, 
the tree is reconnected at level 7. If random sampling fails, the edges that can replace e 
in level ¢ form with high probability a sparse cut. These edges are moved to level i + 1 
and the same procedure is applied recursively on level 7 + 1. 


FACT 


F11: (Henzinger and King’s Theorem) [HeKi99] Let G be a graph with n vertices and 
Mo edges, subject to edge deletions only. A spanning forest of G can be maintained in 
O(log® n) expected amortized time per deletion, if there are at least (mg) deletions. 
The time per query is O(log n). 


Connectivity 


We now give a high level description of the fastest deterministic algorithm for the fully 
dynamic connectivity problem in undirected graphs [HoDeTh0O1]: the algorithm answers 
connectivity queries in O(log n/ log log n) worst-case running time while supporting edge 
insertions and deletions in O(log?n) amortized time. Like the randomized algorithm 
in [HeKi99], the deterministic algorithm in [HoDeTh01] maintains a spanning forest F 
of the dynamically changing graph G. 
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FACTS 


F12: Let e be a tree edge of forest F', and let T be the tree of F' containing it. When 
e is deleted, the two trees T, and JT obtained from T' after the deletion of e can be 
reconnected if and only if there is a non-tree edge in G with one endpoint in 7; and the 
other endpoint in 73. We call such an edge a replacement edge for e. In other words, 
if there is a replacement edge for e, then T is reconnected via this replacement edge; 
otherwise, the deletion of e creates a new connected component in G. 


F13: To accommodate systematic search for replacement edges, the algorithm asso- 
ciates to each edge e a level ¢(e) and, based on edge levels, maintains a set of sub-forests 
of the spanning forest F': for each level 2, forest F; is the sub-forest induced by tree 
edges of level > 2. 


Fl4: F = Fo 2D Fy D> Fh >... D Fr, where L denotes the maximum edge level. 


F15: Initially, all edges have level 0; levels are then progressively increased, but never 
decreased. The changes of edge levels are accomplished so as to maintain the following 
invariants, which obviously hold at the beginning. 


INVARIANTS 


Invariant (1): F is a maximum spanning forest of G if edge levels are regarded as 
weights. 
Invariant (2): The number of nodes in each tree of F; is at most n/2*. 


REMARKS 


R25: Invariant (1) should be interpreted as follows. Let (u,v) be a non-tree edge of 
level £(u, v), and let u---v be the unique path between u and v in F (such a path exists 
since F’ is a spanning forest of G). Let e be any edge in u---v and let ¢(e) be its level. 
Due to invariant (1), ¢(e) > €(u,v). Since this holds for each edge in the path, and 
by construction Fy,,,,) contains all the tree edges of level > (u,v), the entire path is 
contained in Fy), .e., w and v are connected in Fy, 4). 


R26: Invariant (2) implies that the maximum number of levels is L < |log, n]. 


FACTS 


F16: When a new edge is inserted, it is given level 0. Its level can be then increased 
at most |log, n| times as a consequence of edge deletions. 


F17: When a tree edge e = (v, w) of level (e) is deleted, the algorithm looks for a re- 
placement edge at the highest possible level, if any. Due to invariant (1), such a replace- 
ment edge has level  < ¢(e). Hence, a replacement subroutine Replace ((u, w) ,¢(e)) 
is called with parameters e and ¢(e). We now sketch the operations performed by this 
subroutine. 


F18: Replace((u,w),¢) finds a replacement edge of the highest level < @, if any. If 
such a replacement does not exist in level 0, we have two cases: if € > 0, we recurse on 
level €—1; otherwise, = 0, and we can conclude that the deletion of (v, w) disconnects 
v and w in G. 
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F19: During the search at level @, suitably chosen tree and non-tree edges may be 
promoted at higher levels as follows. Let Ti, and T,, be the trees of forest Fy, obtained 
after deleting (v, w) and let, w.l.o.g., T, be smaller than T,,. Then T,, contains at most 
n/2°*! vertices, since T, UT, U {(v, w)} was a tree at level £ and due to invariant (2). 
Thus, edges in T, of level 2 can be promoted at level +1 by maintaining the invariants. 
Non-tree edges incident to T, are finally visited one by one: if an edge does connect Ty 
and T,,, a replacement edge has been found and the search stops; otherwise its level is 
increased by 1. 


F20: We maintain an ET-tree, as described before, for each tree of each forest. Conse- 
quently, all the basic operations needed to implement edge insertions and deletions can 
be supported in O(log n) time. 


F21: [{HoDeTh01] A dynamic graph G with n vertices can be maintained upon inser- 
tions and deletions of edges using O(log? n) amortized time per update and answering 
connectivity queries in O(log n/ log log n) worst-case running time. 


REMARKS 


R27: In addition to inserting and deleting edges from a forest, ET-trees must also 
support operations such as finding the tree of a forest that contains a given vertex, 
computing the size of a tree, and, more importantly, finding tree edges of level @ in 
T, and non-tree edges of level @ incident to T,. This can be done by augmenting the 
ET-trees with a constant amount of information per node: see [HoDeTh01] for details. 


R28: Using an amortization argument based on level changes, the claimed O(log? n) 
bound on the update time can be finally proved. Namely, inserting an edge costs 
O(log n), as well as increasing its level. Since this can happen O(log n) times, the to- 
tal amortized insertion cost, inclusive of level increases, is O(log? n). With respect to 
edge deletions, cutting and linking O(log n) forest has a total cost O(log? n); moreover, 
there are O(log) recursive calls to Replace, each of cost O(logn) plus the cost amor- 
tized over level increases. The ET-trees over Fo = F allows answering connectivity 
queries in O(logn) worst-case time. As shown in [HoDeTh01], this can be reduced to 
O(log n/ log logn) by using a O(log n)-ary version of ET-trees. 


Minimum Spanning Trees 


A few simple changes to the previous connectivity algorithm are sufficient to main- 
tain a minimum spanning forest of a weighted undirected graph upon deletions of 
edges [HoDeTh01]. A general reduction from [HeKki01] can then be applied to make 
the deletions-only algorithm fully dynamic. 


Decremental Minimum Spanning Tree 


APPROACH 


In addition to starting from a minimum spanning forest, the only change concerns the 
function Replace, which should be implemented so as to consider candidate replacement 
edges of level @ in order of increasing weight, and not in arbitrary order. To do so, the 
ET-trees can be augmented so that each node maintains the minimum weight of a non- 
tree edge incident to the Euler tour segment below it. All the operations can still be 
supported in O(log n) time, yielding the same time bounds as for connectivity. 
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We now discuss the correctness of the algorithm. In particular, function Replace re- 
turns a replacement edge of minimum weight on the highest possible level: it is not 
immediate that such a replacement edge has the minimum weight among all levels. 
This can be proved by first showing that the following invariant, proved in [HoDeTh01], 
is maintained by the algorithm. 


INVARIANT 


Invariant (3): Every cycle C has a non-tree edge of maximum weight and minimum 
level among all the edges in C. 


FACTS 


F22: Invariant (3) can be used to prove that, among all the replacement edges, the 
lightest edge is on the maximum level. Let e1 and eg be two replacement edges with 
w(e1) < w(e2), and let C; be the cycle induced by e; in F, i = 1,2. Since F is a minimum 
spanning forest, e; has maximum weight among all the edges in C;. In particular, since 
by hypothesis w(e1) < w(e2), e2 is also the heaviest edge in cycle C = (Cy UC2)\ (CiNC2). 
Thanks to Invariant (3), e2 has minimum level in C, proving that @(e2) < é(e1). Thus, 
considering non-tree edges from higher to lower levels is correct. 


F23: [HoDeTh01] There exists a deletions-only minimum spanning forest algorithm 
that can be initialized on a graph with n vertices and m edges and supports any sequence 
of edge deletions in O(m log? n) total time. 


Fully Dynamic Minimum Spanning Tree 


The reduction used to obtain a fully dynamic algorithm is a slight generalization of the 
construction proposed by Henzinger and King [HeKi01] and works as follows. 


FACT 


F24: [{HeKi01, HoDeTh01] Suppose we have a deletions-only minimum spanning tree 
algorithm that, for any k and J, can be initialized on a graph with k vertices and | 
edges and supports any sequence of (2(1) deletions in total time O(/- t(k,1)), where t is 
a non-decreasing function. Then there exists a fully dynamic minimum spanning tree 
algorithm for a graph with n nodes starting with no edges, that, for m edges, supports 
updates in time 


3+loggm 4 
O | loge n+ S- S = t(min{n, 27}, 2”) 
i=1 j=l 


REMARKS 


R29: See [HeKi01] and [HoDeTh0O1] for a description of the construction that proves 
Fact F24. 


R30: From Fact F23 we get t(k,1) = O(log” k). Hence, by combining Fact F23 and 
Fact F24, we get the claimed result. 
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FACT 


F25: [HoDeTh01] There exists a fully dynamic minimum spanning forest algorithm 
that, for a graph with n vertices, starting with no edges, maintains a minimum spanning 
forest in O(log* n) amortized time per edge insertion or deletion. 


10.2.3. Dynamic Problems on Directed Graphs 


In this part we survey the known results for dynamic problems on directed graphs. 
In particular, we focus on two of the most fundamental problems: transitive closure 
and shortest paths. These problems play a crucial role in many applications, including 
network optimization and routing, traffic information systems, databases, compilers, 
garbage collection, interactive verification systems, industrial robotics, dataflow analy- 
sis, and document formatting. 


We first present general techniques and tools used in designing dynamic path prob- 
lems on directed graphs, and then we address the results for dynamic transitive closure 
and dynamic shortest paths. In the first problem, the goal is to maintain reachability 
information in a directed graph subject to insertions and deletions of edges. The fastest 
known algorithm supports graph updates in quadratic time and reachability queries 
in constant time [DeIt08]. In the second problem, we wish to maintain information 
about shortest paths in a directed graph subject to insertion and deletion of edges, or 
updates of edge weights. Similarly to dynamic transitive closure, this can be done in 
near-quadratic time per update and optimal time per query [Delt04, Th04]. 


General Techniques for Directed Graphs 


In this subsection we discuss the main techniques used to solve dynamic path problems 
on directed graphs. We first address combinatorial and algebraic properties, and then 
we consider some efficient data structures, which are used as building blocks in designing 
dynamic algorithms for transitive closure and shortest paths. 


Path Problems and Kleene Closures 


Path problems such as transitive closure and shortest paths are tightly related to ma- 
trix sum and matrix multiplication over a closed semiring (see [CoLeRiSt01] for more 
details). 


NOTATION: The usual sum and multiplication operations over Boolean matrices are 
denoted by + and -, respectively. 


NOTATION: Given two real-valued matrices A and B, C = A© B is the matrix product 
such that C[z, y] = mini<z<n{Al[z, z] + Blz,y]}, and D = A@B is the matrix sum such 
that D[x,y] = min{A[z, y], Bix, y]}- 


NOTATION: We also denote by AB the product A © B and by AB[x, y] entry (x,y) of 
matrix AB. 
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FACTS 


F26: Let G = (V,£E) be a directed graph and let TC(G) be the (reflexive) transitive 
closure of G. If X is the Boolean adjacency matrix of G, then the Boolean adjacency 
matrix of TC(G) is the Kleene closure of X on the {+,-,0,1} Boolean semiring: 


n-1 
i=0 


F27: Let G = (V, E) bea weighted directed graph with no negative-length cycles. If X 
is a weight matrix such that X[,y] is the weight of edge (x,y) in G, then the distance 
matrix of G is the Kleene closure of X on the {@,©,R} semiring: 


n—-1 
i=0 


The next two facts recall two well-known methods for computing the Kleene closure X* 
of ann X n matrix X. 


Logarithmic Decomposition. A simple method to compute X*, based on repeated 
squaring, requires O(n“ - log n) worst-case time, where O(n“) is the time required for 
computing the product of two matrices over a closed semiring. 


F28: This method performs log,n sums and products of the form Xj41 = X; + X?, 
where X = Xo and X* = Xjog, n- 


Recursive Decomposition. Another method, due to Munro [Mu7]1], is based on a 
Divide and Conquer strategy and computes X* in O(n“) worst-case time. 


F29: Munro observed that, if we partition a matrix X into four submatrices A, B, 
D, C of size n/2 x n/2 (considered in clockwise order), and the closure X* similarly 
into four submatrices E, F', H, G of size n/2 x n/2, then X™* is definable recursively 
according to the following equations: 


E=(A+ BD*C)* 

F= EBD 

G= D*CE 

H = D* + D*CEBD* 
Surprisingly, using this decomposition the cost of computing X* starting from X is 
asymptotically the same as the cost of multiplying two matrices over a closed semiring. 


Locally Shortest Paths 


Some combinatorial properties of shortest paths in directed graphs have been recently 
discovered by Demetrescu and Italiano [DelIt04]. In particular, we consider shortest 
paths as a special case of a broader class of paths called locally shortest paths. To 
characterize how locally shortest paths change in a fully dynamic graph, we consider 
the notions of historical path and locally historical path. 
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DEFINITIONS 


D21: A path z in a graph is locally shortest if every proper subpath of z is a shortest 
path. 


D22: A historical path is a path that has been a shortest path at some point during 
the sequence of updates, and none of its edges has been updated since then. 


Using this notion we can define a superset of locally shortest paths that are called locally 
historical paths. 


D23: A path z in a graph is locally historical if every proper subpath of 7 is a historical 
path. 


REMARKS 


R31: As an alternative equivalent definition, a path mz, is locally shortest in a graph 
if every edge (u,v) in zy satisfies the relation dry + Wuv + doy = w(Try), where dry 
denotes the distance between vertex x and vertex y in the graph, wy, is the weight of 
edge (u,v), and w(mz,) is the weight of zy. 


R32: It is not difficult to prove that the amortized number of locally shortest paths 
that may change due to an edge weight update is O(n”) if updates are partially dynamic, 
i.e., increase-only or decrease-only. 


FACTS 


F30: [Delt04] If we denote by SP, LSP, HP, and LHP, respectively, the sets of 
shortest paths, locally shortest paths, historical paths, and locally historical paths in 
a graph, then at any time the following inclusions hold: SP C LSP C LHP and 
SPC HPCLHP. 


F31: [Delt04] Let G be a graph subject to a sequence of update operations. If at 
any time throughout the sequence of updates there are at most O(z) historical paths 
between each pair of vertices, then the amortized number of paths that become locally 
historical at each update is O(zn?). 


REMARKS 


R33: Locally historical paths exhibit strong combinatorial properties in graphs subject 
to (fully) dynamic updates. In particular, it is possible to prove that the number of 
paths that become locally historical in a graph at each edge weight update depends on 
the number of historical paths in the graph. 


R34: To keep changes in locally historical paths small, it is then desirable to have 
as few historical paths as possible. Indeed, it is possible to transform every update 
sequence into a slightly longer equivalent sequence that generates only a few historical 
paths. In particular, there exists a simple smoothing strategy that, given any update 
sequence % of length &, produces an operationally equivalent sequence F'(=) of length 
O(klogk) that yields only O(logk) historical paths between each pair of vertices in 
the graph. We refer the interested reader to [DelIt04] for a detailed description of this 
smoothing strategy. According to Fact F31, this technique implies that only O(n? log k) 
locally historical paths change at each edge weight update in the smoothed sequence 
F(S). 
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R35: As elaborated in [DeIt04], locally historical paths can be maintained very effi- 
ciently. Since by Fact F30 locally historical paths include shortest paths, this yields the 
fastest known algorithm for fully dynamic all pairs shortest paths. 


Long Paths Property 


If we pick a subset S' of vertices at random from a graph G, then a sufficiently long path 
will intersect S with high probability. This can be very useful in finding a long path 
by using short searches. This property has been rediscovered many times and it has 
been exploited to design efficient algorithms for transitive closure and shortest paths 
(see e.g., [DeIt06, Ki99, UlYa91, Zw98}). 


FACT 


F32: (Ullman and Yannakakis [UlYa91]) Let S C V be a set of vertices chosen uni- 
formly at random. Then the probability that a given simple path has a sequence of more 
than (cnlogn)/|S| vertices, none of which is from S, for any c > 0, is, for sufficiently 
large n, bounded by 2~°° for some positive a. 


REMARK 


R36: As shown in [Zw98], it is possible to choose set S deterministically by a reduction 
to a hitting set problem [Ch79, Lo75]. A similar technique has also been used in [Ki99]. 


Reachability Trees 


A special tree data structure has been widely used to solve dynamic path problems 
on directed graphs. The first appearance of this tool dates back to 1981, when Even 
and Shiloach showed how to maintain a breadth-first tree of an undirected graph under 
any sequence of edge deletions [EvSh81]; they used this as a kernel for decremental 
connectivity on undirected graphs. Later on, Henzinger and King [HeKi99] showed 
how to adapt this data structure to fully dynamic transitive closure in directed graphs. 
King [Ki99] designed an extension of this tree data structure to weighted directed graphs 
for solving fully dynamic all pairs shortest paths. 


PROBLEM 

In the unweighted directed version, the goal is to maintain information about breadth- 
first search (BFS) on a directed graph G undergoing deletions of edges. In particular, 
in the context of dynamic path problems, we are interested in maintaining BFS trees of 
depth up to d, with d < n. Given a directed graph G = (V, E) and a vertex r € V, we 
would like to support any intermixed sequence of the following operations: 
Delete(x,y): delete edge (x,y) from G. 

Level(u): return the level of vertex u in the BFS tree of depth d rooted at r (return 
+oo if u is not reachable from r within distance d). 


FACT 


F33: [Ki99] Maintaining BFS levels up to depth d from a given root requires O(md) 
time in the worst case throughout any sequence of edge deletions in a directed graph 
with m initial edges. 
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REMARKS 


R37: Fact F33 means that maintaining BFS levels requires d times the time needed 
for constructing them. Since d < n, we obtain a total bound of O(mn) if there are no 
limits on the depth of the BFS levels. 


R38: As was shown in [HeKi99, Ki99], it is possible to extend the BFS data structure 
presented in this section to deal with weighted directed graphs. In this case, a shortest 
path tree is maintained in place of BFS levels: after each edge deletion or edge weight 
increase, the tree is reconnected by essentially mimicking Dijkstra’s algorithm rather 
than BFS. Details can be found in [Ki99]. 


Matrix Data Structures 


We now consider matrix data structures for keeping information about paths in dynamic 
directed graphs. As we have seen above (Path Problems and Kleene Closures), Kleene 
closures can be constructed by evaluating polynomials over matrices. It is therefore 
natural to consider data structures for maintaining polynomials of matrices subject to 
updates of entries, like the one introduced in [DeIt08]. 


PROBLEM 

In the case of Boolean matrices, the problem can be stated as follows. Let P be a 
polynomial over n x n Boolean matrices with constant degree, constant number of 
terms, and variables X;...X;,. We wish to maintain a data structure for P subject to 
any intermixed sequence of update and query operations of the following kind: 
SetRow(i, AX, X,): sets to one the entries in the i-th row of variable X, of polynomial 
P corresponding to one-valued entries in the i-th row of matrix AX. 

SetCol(i, AX, X;,): sets to one the entries in the i-th column of variable X, of polyno- 
mial P corresponding to one-valued entries in the i-th column of matrix AX. 
Reset(AX, Xz): resets to zero the entries of variable X;, of polynomial P corresponding 
to one-valued entries in matrix AX. 

Lookup(): returns the maintained value of P. 

We add to the previous four operations a further update operation especially designed 
for maintaining path problems: 

LazySet(AX, X;): sets to 1 the entries of variable X;, of P corresponding to one-valued 
entries in matrix AX. However, the maintained value of P might not be immediately 
affected by this operation. 


REMARK 


R39: Let Cp be the correct value of P that we would have by recomputing it from 
scratch after each update, and let Mp be the actual value that we maintain. If no 
LazySet operation is ever performed, then always Mp = Cp. Otherwise, Mp is not 
necessarily equal to Cp, and we guarantee the following weaker property on Mp: if 
Cp[u, v] flips from 0 to 1 due to a SetRow/SetCol operation on a variable X;,, then 
Mplu, v] flips from 0 to 1 as well. This means that SetRow and SetCol always correctly 
reveal new 1’s in the maintained value of P, possibly taking into account the 1’s in- 
serted through previous LazySet operations. This property is crucial for dynamic path 
problems. 
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FACTS 


F34: [DelIt08] Let P be a polynomial with constant degree of matrices over the Boolean 
semiring. Any SetRow, SetCol, LazySet, and Reset operation on a polynomial P can 
be supported in O(n”) amortized time. Lookup queries are answered in optimal time. 


F35: [Delt06] Let P be a polynomial with constant degree of matrices over the {min, +} 
semiring. Any SetRow, SetCol, LazySet, and Reset operation on variables of P can be 
supported in O(D-n?) amortized time, where D is the maximum number of different 
values assumed by entries of variables during the sequence of operations. Lookup queries 
are answered in optimal time. 


Dynamic Transitive Closure 


In this subsection we survey the best known algorithms for fully dynamic transitive 
closure. Given a directed graph G with n vertices and m edges, the problem consists of 
supporting any intermixed sequence of operations of the following kind: 

Insert(u,v): insert edge (u,v) in G; 

Delete(u,v): delete edge (u,v) from G; 

Query(x,y): answer a reachability query by returning “yes” if there is a path from 
vertex x to vertex y in G, and “no” otherwise. 


FACTS 


F36: A simple-minded solution to this problem consists of maintaining the graph under 
insertions and deletions, searching if y is reachable from x at any query operation. This 
yields O(1) time per update (Insert and Delete), and O(m) time per query, where m 
is the current number of edges in the maintained graph. 


F37: Another simple-minded solution would be to maintain the Kleene closure of the 
adjacency matrix of the graph, rebuilding it from scratch after each update operation. 
Using the recursive decomposition of Munro [Mu71] discussed in the section on Path 
Problems and Kleene Closures and fast matrix multiplication, this takes constant time 
per reachability query and O(n”) time per update, where w < 2.3727 is the current best 
exponent for matrix multiplication [CoWi90, Va12]. 


REMARKS 


R40: Despite many years of research in this topic, no better solution to this problem 
was known until 1995, when Henzinger and King [HeKi99] proposed a randomized Monte 
Carlo algorithm with one-sided error supporting a query time of O(n/logn) and an 
amortized update time of O(nin->8 log? n), where ri is the average number of edges in 
the graph throughout the whole update sequence. Since 7 can be as high as O(n?), 
their update time is O(n?!6 log? n). 


R41: Khanna, Motwani and Wilson [KhMoWi96] proved that, when a lookahead of 
Q(n°-!8) in the updates is permitted, a deterministic update bound of O(n?:!8) can be 
achieved. 
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R42: King and Sagert [KiSa02] showed how to support queries in O(1) time and 
updates in O(n?:?°) time for general directed graphs and O(n?) time for directed acyclic 
graphs; their algorithm is randomized with one-sided error. These bounds were further 
improved by King [Ki99], who exhibited a deterministic algorithm on general digraphs 
with O(1) query time and O(n? logn) amortized time per update operations, where 
updates are insertions of a set of edges incident to the same vertex and deletions of an 
arbitrary subset of edges. 


R43: Using a completely different approach, Demetrescu and Italiano [DeIt08] obtained 
a deterministic fully dynamic algorithm that achieves O(n?) amortized time per update 
for general directed graphs. Sankowski [Sa04] showed how to make the amortized bound 
worst-case. 


R44: We note that each update might change a portion of the transitive closure as 
large as Q(n”). Thus, if the transitive closure has to be maintained explicitly after 
each update so that queries can be answered with one lookup, O(n?) is the best update 
bound one could hope for. 


R45: By combining in a novel way techniques of Italiano [It86, It88], King [Ki99], 
King and Thorup [KiTh01] and Frigioni et al. [FrMiZa01], Roditty [Ro08] showed how 
to reduce from O(n*) to O(mn) the preprocessing time required by the algorithm of 
Demetrescu and Italiano [DeIt08). 


R46: If one is willing to pay more for queries, Demetrescu and Italiano [DelIt05] 
showed how to break the O(n?) barrier on the single-operation complexity of fully 
dynamic transitive closure: building on a previous path counting technique introduced 
by King and Sagert [KiSa02], they devised a randomized algorithm with one-sided 
error for directed acyclic graphs that achieves O(n!-°”°) worst-case time per update and 
O(n°-°’>) worst-case time per query. 


R47: The bounds of Demetrescu and Italiano [DeIt05] were extended to general 
directed graphs by Sankowski [Sa04]. Sankowski [Sa04] also showed how to achieve 
O(n14%°) worst-case time both per update and per query. 


R48: Further trade-offs between queries and updates were given by Roditty and 
Zwick [RoZw08]. They presented one deterministic algorithm, with amortized update 
time of O(m,/n) and worst-case query time of O(,/n ), and one randomized algorithm 
with amortized update time of O(m°-°8n) and worst-case query time of O(m°-*%). Both 
algorithms are competitive for sparse graphs. 


King’s O(n? logn) Update Algorithm 


King [Ki99] devised the first deterministic near-quadratic update algorithm for fully 
dynamic transitive closure. The algorithm is based on the tree data structure considered 
in §10.2.4 (Reachability Trees) and on the logarithmic decomposition discussed in §10.2.4 
(Path Problems and Kleene Closures). It maintains explicitly the transitive closure of a 
graph G in O(n? logn) amortized time per update, and supports inserting and deleting 
several edges of the graph with just one operation. Insertion of a bunch of edges incident 
to a vertex and deletion of any subset of edges in the graph require asymptotically the 
same time of inserting/deleting just one edge. 
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APPROACH 

The algorithm maintains logn + 1 levels: level 1, 0 < i < logn, maintains a graph G; 
whose edges represent paths of length up to 2° in the original graph G. Thus, Go = G 
and Gog n is the transitive closure of G. 


FACTS 


F38: Each level i is built on top of the previous level i—1 by keeping two trees of depth 
< 2 rooted at each vertex v of G: an out-tree OUT;(v) maintaining vertices reachable 
from v by traversing at most two edges in G;_;, and an in-tree JN;(v) maintaining 
vertices that reach v by traversing at most two edges in G;_1. An edge (a, y) will be in 
G; if and only if « € IN;(v) and y € OUT;,(v) for some v. 


F39: The 2logn trees IN;(v) and OUT;(v) are maintained with instances of the BFS 
tree data structure considered in the section on Reachability Trees. 


F40: To update the levels after an insertion of edges around a vertex v in G, the 
algorithm simply rebuilds IN;(v) and OUT;(v) for each i, 1 < i < logn, while other 
trees are not touched. This means that some trees might not be up to date after an 
insertion operation. Nevertheless, any path in G is represented in at least the in/out 
trees rooted at the latest updated vertex in the path, so the reachability information is 
correctly maintained. This idea is the key ingredient of King’s algorithm. 


F41: When an edge is deleted from G;, it is also deleted from any data structures 
IN;(v) and OUT;(v) that contain it. For details, see [Ki99]. 


Demetrescu and Italiano’s O(n”) Update Algorithm 


The algorithm by Demetrescu and Italiano [DeIt08] is based on the matrix data structure 
considered in the section on Matrix Data Structures and on the recursive decomposition 
discussed in the section on Path Problems and Kleene Closures. It maintains explicitly 
the transitive closure of a graph in O(n?) amortized time per update, supporting the 
same generalized update operations of King’s algorithm, i.e., insertion of a bunch of 
edges incident to a vertex and deletion of any subset of edges in the graph with just one 
operation. This is the best known update bound for fully dynamic transitive closure 
with constant query time. 


APPROACH 
The algorithm maintains the Kleene closure X* of the n x n adjacency matrix X of the 
graph as the sum of two matrices X, and Xo. 


NOTATION 
Let V, be the subset of vertices of the graph corresponding to the first half of indices of 
X, and let V2 contain the remaining vertices. 


FACTS 


F42: Both matrices X; and X»2 are defined according to Munro’s equations given 
in the section on Path Problems and Kleene Closures, but in such a way that paths 
appearing due to an insertion of edges around a vertex in V; are correctly recorded in 
X 1, while paths that appear due to an insertion of edges around a vertex in V2 are 
correctly recorded in X2. Thus, neither X; nor X2 encodes complete information about 
X*, but their sum does. 


Section 10.2. Dynamic Graph Algorithms 1231 


F43: In more detail, assuming that X is decomposed in sub-matrices A, B, C, D 
as explained in the section on Path Problems and Kleene Closures, and that X,, and 
X» are similarly decomposed in sub-matrices FE), F,, G,, H, and Ee, Fo, Go, Ho, 
the algorithm maintains X,; and X2 with the following 8 polynomials using the data 
structure discussed in the section on Matrix Data Structures: 


Q=A+BP°C Ey = E,BH3CE, 

Fy = B72 BP Fo = E,BH3 

G, = PCE? Gy = H3CE, 

Hy = PCE? BP R=D+CE?7B 
where P = D*, E, = Q*, and Hj = R* are Kleene closures maintained recursively as 
smaller instances of the problem of size n/2 x n/2. 


F44: To support an insertion of edges around a vertex in Vj, strict updates are 
performed on polynomials Q, F), Gi, and H, using SetRow and SetCol, while Fa, 
F5, Go, and R are updated with LazySet. 


F45: Insertions around V2 are performed symmetrically, while deletions are supported 
via Reset operations on each polynomial in the recursive decomposition. 


F46: Finally, P, E,, and Hz are updated recursively. The low-level details of the 
method appear in [DeIt08]. 


Dynamic Shortest Paths 


In this subsection we survey the best known algorithms for fully dynamic all pairs 
shortest paths (in short APSP). Given a weighted directed graph G with n vertices and 
m. edges, the problem consists of supporting any intermixed sequence of operations of 
the following kind: 

Update(u,v,w): updates the weight of edge (u,v) in G to the new value w (if w = +00 
this corresponds to edge deletion); 

Query(z,y): returns the distance from vertex x to vertex y in G, or +co if no path 
between them exists. 


NOTATION 2, 
In the the following, we use O(f(n)) to denote O(f(n) polylog(n)). 


REMARKS 


R49: The dynamic maintenance of shortest paths has a remarkably long history, as 
the first papers date back to 35 years ago [Lo67, Mu67, Ro68]. Since then, many 
dynamic shortest paths algorithms have been proposed (see, e.g., [EvGa85, FrMaNa98, 
FrMaNa00, RaRe96a, RaRe96b, Ro85]), but their running times in the worst case were 
comparable to recomputing APSP from scratch. 


R50: The first dynamic shortest path algorithms which are provably faster than re- 
computing APSP from scratch only worked on graphs with small integer weights. 


R51: In particular, Ausiello et al. [AuItMaNa91] proposed a decrease-only shortest 
path algorithm for directed graphs having positive integer weights less than C: the 
amortized running time of their algorithm is O(Cnlogn) per edge insertion. 
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R52: Henzinger et al. [HeKiRaSu97] designed a fully dynamic algorithm for APSP 
on planar graphs with integer weights, with a running time of O(n*/*log(nC)) per 
operation. 


R53: This bound has been improved by Fakcharoemphol and Rao in [FaRa06], who 
designed a fully dynamic algorithm for single-source shortest paths in planar directed 
graphs that supports both queries and edge weight updates in O(n4/° log!8/ > nm) amor- 
tized time per edge operation. 


R54: The first big step on general graphs and integer weights was made by King [Ki99], 
who presented a fully dynamic algorithm for maintaining all pairs shortest paths in 
directed graphs with positive integer weights less than C: the running time of her 
algorithm is O(n?-°,/Clogn ) per update. 


R55: Demetrescu and Italiano [Delt06] gave the first algorithm for fully dynamic 
APSP on general directed graphs with real weights assuming that each edge weight can 
attain a limited number S' of different real values throughout the sequence of updates. 
In particular, the algorithm supports each update in O(n?>\/S log? n) amortized time 
and each query in O(1) worst-case time. 


R56: The same authors discovered the first algorithm that solves the fully dynamic all 
pairs shortest paths problem in its generality [DeIt04]. The algorithm maintains explic- 
itly information about shortest paths, supporting any edge weight update in O(n? log? n) 
amortized time per operation in directed graphs with non-negative real edge weights. 
Distance queries are answered with one lookup and actual shortest paths can be recon- 
structed in optimal time. 


R57: Using the same approach as Demetrescu and Italiano [DeIt04], but with a differ- 
ent smoothing strategy, Thorup [Th04] showed how to improve slightly to O(n?(log n + 
log?((m-+n)/n))) the amortized bound per update for the fully dynamic APSP problem, 
while still maintaining constant query times. Thorup’s algorithm works with negative 
weights as well. 


R58: We note that each update might change a portion of the distance matrix as large 
as Q(n”). Thus, if the distance matrix has to be maintained explicitly after each update 
so that queries can be answered with one lookup, O(n?) is the best update bound one 
could hope for. 


R59: The currently best worst-case bound per update for the fully dynamic APSP 
problem is much higher than known amortized bounds. Indeed Thorup [Th05] has 
shown a worst-case bound per update of O(n?*7°). 


R60: In the special case of unweighted graphs, Roditty and Zwick [RoZw11] have 
shown how to support updates in O(m,/n) amortized time and queries in O(n3/*) 
worst-case time. Those bounds are competitive for sparse graphs. 


R61: Other deletions-only algorithms for APSP, in the simpler case of unweighted 
graphs, are presented in [BaHaSe07]. 


R62: As shown by Sankowski [Sa05], dynamic shortest distances in unweighted graphs 
can be maintained in O(n''9%*) randomized time per update and O(n!?88) randomized 
time per query. 
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King’s O(n?°\/Clogn) Update Algorithm 


The dynamic shortest paths algorithm by King [Ki99] is based on the long paths property 
discussed in the section on Long Paths Property and on the tree data structure of the 
section on Reachability Trees. Similarly to the transitive closure algorithms previously 
described, generalized update operations are supported within the same bounds, i.e., 
insertion (or weight decrease) of a bunch of edges incident to a vertex, and deletion (or 
weight increase) of any subset of edges in the graph with just one operation. 


APPROACH 

The main idea of the algorithm is to maintain dynamically all pairs shortest paths up 
to a distance d, and to recompute longer shortest paths from scratch at each update 
by stitching together shortest paths of length < d. For the sake of simplicity, we only 
consider the case of unweighted graphs: an extension to deal with positive integer 
weights less than C is described in [Ki99]. 


FACTS 


F47: To maintain shortest paths up to distance d, similarly to the transitive closure 
algorithm by King described in §10.2.5, the algorithm keeps a pair of in/out shortest 
paths trees [N(v) and OUT(v) of depth < d rooted at each vertex v. Trees IN(v) 
and OUT(v) are maintained with the decremental data structure mentioned in §10.2.4 
(Reachability Trees). It is easy to prove that, if the distance d,, between any pair of 
vertices x and y is at most d, then dzy is equal to the minimum of dz, + dyy over all 
vertices v such that « € IN(v) and y € OUT(v). To support updates, insertions of 
edges around a vertex v are handled by rebuilding only [N(v) and OUT(v), while edge 
deletions are performed via operations on any trees that contain them. The amortized 
cost of such updates is O(n?d) per operation. 


F48: To maintain shortest paths longer than d, the algorithm exploits the long paths 
property of Fact F30: in particular, it hinges on the observation that, if H is a random 
subset of O((n log n)/d) vertices in the graph, then the probability of finding more than 
d consecutive vertices in a path, none of which is from H, is very small. Thus, if we 
look at vertices in H as “hubs,” then any shortest path from x to y of length > d can be 
obtained by stitching together shortest subpaths of length < d that first go from x to 
a vertex in H, then jump between vertices in H, and eventually reach y from a vertex 
in H. This can be done by first computing shortest paths only between vertices in H 
using any cubic-time static all-pairs shortest paths algorithm, and then by extending 
them at both endpoints with shortest paths of length < d to reach all other vertices. 
This stitching operation requires O(n?|H|) = O((n? log n)/d) time. 


F49: Choosing d = V/nlogn yields an O(n?°,/logn) amortized update time. As 
mentioned in the section on the Long Paths Property, since H can be computed deter- 
ministically, the algorithm can be derandomized. For further details, see [Ki99]. 


Demetrescu and Italiano’s O(n? log’ n) Update Algorithm 


Demetrescu and Italiano [DeIt04] devised the first deterministic near-quadratic update 
algorithm for fully dynamic all-pairs shortest paths. This algorithm is also the first 
solution to the problem in its generality. It is based on the notions of locally shortest 
path, locally historical path, and historical paths in a graph subject to a sequence of 
updates, as discussed in the section on Locally Shortest Paths. 
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APPROACH 

The main idea is to maintain dynamically the locally historical paths of the graph in a 
data structure. Since by Fact F30 shortest paths are locally historical, this guarantees 
that information about shortest paths is maintained as well. 


FACTS 


F50: To support an edge weight update operation, the algorithm implements the 
smoothing strategy mentioned in the section on Locally Shortest Paths and works in 
two phases. It first removes from the data structure all maintained paths that contain 
the updated edge: this is correct since historical paths, in view of their definition, are 
immediately invalidated as soon as they are touched by an update. This means that 
also locally historical paths that contain them are invalidated and have to be removed 
from the data structure. As a second phase, the algorithm runs an all-pairs modification 
of Dijkstra’s algorithm [Di59], where at each step a shortest path with minimum weight 
is extracted from a priority queue and it is combined with existing historical paths to 
form new locally historical paths. At the end of this phase, paths that become locally 
historical after the update are correctly inserted in the data structure. 


F51: The update algorithm spends O(log) time for each of the O(zn?) new locally 
historical path (see Fact F31). Since the smoothing strategy lets z = O(logn) and 
increases the length of the sequence of updates by an additional O(log n) factor, this 
yields O(n? log? n) amortized time per update. For further details, see [DeIt04]. 


10.2.4 Research Issues 


In this work we have surveyed the algorithmic techniques underlying the fastest 
known dynamic graph algorithms for several problems, both on undirected and on di- 
rected graphs. Most of the algorithms that we have presented achieve bounds that 
are close to optimum. In particular, we have presented fully dynamic algorithms 
with polylogarithmic amortized time bounds for connectivity and minimum spanning 
trees [HoDeTh01] on undirected graphs. It remains an interesting open problem to show 
whether polylogarithmic update bounds can be achieved also in the worst case: we recall 
that for both problems the current best worst-case bound is O(,/n) per update, and it 
is obtained with the sparsification technique [EpGaltNi97] described in Section 10.2.2. 


For directed graphs, we have shown how to achieve constant-time query bounds 
and nearly-quadratic update bounds for transitive closure and all pairs shortest paths. 
These bounds are close to optimal in the sense that one update can make as many 
as Q(n”) changes to the transitive closure and to the all-pairs shortest paths matrices. 
While the quadratic bounds for dynamic transitive closure are worst-case, the nearly 
quadratic bounds for dynamic shortest paths are amortized, and the best worst-case 
update bound in this case is only O(n?-7°). Can this worst-case bound be improved? 
Furthermore, if the problem is just to maintain reachability or shortest paths between 
two fixed vertices s and t, no solution better than the static is known. If one is willing 
to pay more for queries, Demetrescu and Italiano [Delt05] have shown how to break 
the O(n) barrier on the single-operation complexity of fully dynamic transitive closure 
for directed acyclic graphs. It remains an interesting open problem to show whether 
effective query/update tradeoffs can be achieved for shortest paths problems in general 
graphs. 
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Finally, dynamic algorithms for other fundamental problems such as matching and 
flow problems deserve further investigation. 


Further Information 


Research on dynamic graph algorithms is published in many computer science journals, 
including Algorithmica, Journal of ACM, ACM Transactions on Algorithms, Journal 
of Algorithms, Journal of Computer and System Science, SIAM Journal on Computing 
and Theoretical Computer Science. Work on this area is published also in the proceed- 
ings of general theoretical computer science conferences, such as the ACM Symposium 
on Theory of Computing (STOC), the IEEE Symposium on Foundations of Computer 
Science (FOCS) and the International Colloquium on Automata, Languages and Pro- 
gramming (ICALP). More specialized conferences devoted exclusively to algorithms are 
the ACM-SIAM Symposium on Discrete Algorithms (SODA), and the European Sym- 
posium on Algorithms (ESA). 
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INTRODUCTION 


Research on graph drawing has been conducted within several diverse areas, includ- 
ing discrete mathematics (topological graph theory, geometric graph theory, order the- 
ory), algorithmics (graph algorithms, data structures, computational geometry, VLSI), 
and human-computer interaction (visual languages, graphical user interfaces, software 
visualization). In this section, we overview two different aspects of the current research 
in graph drawing: the study of the graph theoretic properties of families of geometric 
representations of graphs and the algorithmic issues involved in computing a drawing 
of a graph that satisfies a given set of geometric constraints. 


10.3.1 Types of Graphs and Drawings 


Graph drawing concerns geometric representations of graphs, and it has important 
applications to key computer technologies such as software engineering, database sys- 
tems, visual interfaces, and computer-aided design. 


Types of Graphs 


First, we define some terminology on graphs pertinent to graph drawing. Throughout 
this section, let n and m be the number of graph vertices and edges, respectively, and 
let d be the maximum vertex degree (i.e., number of incident edges). 
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DEFINITIONS 
D1: A degree-k graph is a graph with maximum degree d < k. 


D2: A transitive edge of a digraph is an edge (u,v) such that there is a directed 
path from u to v not containing edge (wu, v). 


D3: A reduced digraph is a digraph without transitive edges. 
D4: A source vertex of a digraph is a vertex without incoming edges. 


D5: A sink vertex of a digraph (also called a target) is a vertex without outgoing 
edges. 


D6: An st-digraph (also called a bipolar digraph) is an acyclic digraph with exactly 
one source and one sink, which are joined by an edge. 


D7: A biconnected graph is a 2-connected graph; that is, any two vertices are joined 
by two vertex-disjoint paths. 


D8: A triconnected graph is a 3-connected graph; that is, any two vertices are joined 
by three (pairwise) vertex-disjoint paths. 


D9: A rooted tree is a directed tree with a distinguished vertex, called the root, such 
that each vertex lies on a directed path to the root. (We observe that this reverses the 
usual convention.) 


D10: A binary tree is a rooted tree such that each vertex has at most two incoming 
edges. 


D11: A ternary tree is a rooted tree such that each vertex has at most three incoming 
edges. 


D12: A layered (di)graph is a (di)graph whose vertices are partitioned into sets, 
called layers. A rooted tree can be viewed as a layered digraph where the layers are sets 
of vertices at the same distance from the root. 


D13: A k-layered (di)graph layered (di)graph has & layers. 


Types of Drawings 


In a drawing of a graph, vertices are represented by points (or by geometric figures such 
as circles or rectangles) and edges are represented by curves such that any two edges 
intersect at most in a finite number of points. The following definitions are relative to 
drawings in the plane, which are the main subject of this section. 


DEFINITIONS 
D14: Ina polyline drawing, each edge is a polygonal chain (see Figure 10.3.1(a)). 


D15: In a straight-line drawing, each edge is a straight-line segment (see Fig- 
ure 10.3.1(b)). 
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D16: In an orthogonal drawing, each edge is a chain of horizontal and vertical 
segments (see Figure 10.3.1(c)). 


D17: A bend in a polyline drawing is a point where two segments belonging to 
the same edge meet (see Figure 10.3.1(a)). 


D18: An orthogonal representation of an orthogonal drawing is in terms of the 
bends along each edge and the angles around each vertex. 


D19: A crossing is a point of a graph drawing where two edges intersect (see 
Figure 10.3.1(b)). 


D20: A grid drawing is a polyline drawing such that the vertices, crossings, and 
bends all have integer coordinates. 


D21: Ina planar drawing, no two edges cross (see Figure 10.3.1(d)). 
D22: A planar (di)graph is a (di)graph that admits a planar drawing. 


D23: An imbedded (di)graph is a planar (di)graph with a prespecified topological 
imbedding (i.e., set of faces), which must be preserved in the drawing. 


D24: Inan upward drawing of a digraph, each edge is monotonically nondecreasing 
in the vertical direction (see Figure 10.3.1(d)). 


D25: An upward planar digraph admits an upward planar drawing. 


D26: Ina layered drawing of a layered graph (also called a hierarchical drawing), 
the vertices in the same layer all lie on the same horizontal line. 


D27: A face is a region of a planar drawing, and the unbounded region is called the 
external face. 


D28: An outerplanar (di)graph is a planar (di)graph that admits a planar drawing 
with all vertices on the boundary of the external face. 


D29: A series-parallel digraph is a planar digraph with a single source s and a single 
sink t recursively defined as follows: (i) a single edge (s,t) is a series-parallel digraph. 
Given two series-parallel digraphs G’ and G” with sources s’ and s”, respectively, and 
sinks t’ and t’”, respectively, (ii) the digraph obtained by identifying ¢t’ with s” is a 
series-parallel digraph; (iii) the digraph obtained by identifying s’ with s” and t’ with 
t’’ is a series-parallel digraph. 


D30: A series-parallel graph is the underlying undirected graph of a series-parallel 
digraph. 


D31: <A convex drawing is a planar straight-line drawing of a graph such that the 
boundary of each face is a convex polygon. 


D32: A visibility drawing of a graph is based on a geometrically visible relation; 
e.g., the vertices might be drawn as horizontal segments, and the edges associated with 
vertically visible segments. 
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D33: <A dominance drawing is an upward drawing of an acyclic digraph such that 
there exists a directed path from vertex u to vertex v if and only if x(u) < a(v) and 
y(u) < y(v), where x(-) and y(-) denote the coordinates of a vertex. 


D34: An hv-drawing is an upward orthogonal straight-line drawing of a binary tree 
such that the drawings of the subtrees of each node are separated by a horizontal or 
vertical line. 


EXAMPLE 


El: In Figure 10.3.1 the first three drawings are of the complete bipartite graph 13 3. 


(a) (b) (c) (d) 


Figure 10.3.1: Drawings: (a) polyline; (b) straight-line; (c) orthogonal; (d) planar up- 
ward. 


REMARKS 


R1: Polyline drawings provide great flexibility since they can approximate drawings 
with curved edges. However, edges with more than two or three bends may be difficult 
to “follow” for the eye. Also, a system that supports editing of polyline drawings is 
more complicated than one limited to straight-line drawings. Hence, depending on the 
application, polyline or straight-line drawings may be preferred. 


R2: If vertices are represented by points, then orthogonal drawings exist only for 
graphs of maximum vertex degree 4. 


R3: The series-parallel (di)graphs defined above are often called two-terminal series- 
parallel (di)graphs. Throughout this section series-parallel (di)graphs have no multi- 
ple edges. 


10.3.2 Combinatorics of Some Geometric Graphs 


Geometric graphs [PS90, OBS92] are straight-line drawings. Geometric graphs are 
often studied under the assumption that they satisfy some additional geometric con- 
straints. For example, many papers study the structure of geometric graphs that avoid 
forbidden edge configurations, such as the k-quasi planar graphs where no k mutually 
crossing edges are allowed (see, e.g. [ATO7, AAP*97, FPS11, PRTT06, DDLM12b]), 
k-planar graphs, where no edge can be crossed more than k times (see, e.g. [BKRSO1, 
FM07, Kor08, HELP12]), the ACL, graphs where any two crossing edges cannot form 
an angle smaller than a given value a (see, e.g. [AFT11, ACDT11, ABS11a, ABS11b, 
AFK+12, DDE+12, DDLM11, DEL10, DEL11, DGMW11, DDEL11, vK11] and [DL12] 
for a survey). 
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In this section we shall, however, concentrate on another broad family of geometric 
graphs, namely, the proximity graphs. Results about some other types of geometric 
graphs are described in Section 10.3.7. A proximity graph is a geometric graph such 
that pairs of adjacent vertices are represented as pairs of points that are deemed to 
be “sufficiently close,” while pairs of non-adjacent vertices are “relatively far” from 
each other. At a first, broad approximation, the definition of closeness in a proximity 
drawing can be either based on the concept of proximity region or based on a global 
proximity measure. In a proximity region based proximity graph two or more vertices 
are adjacent if and only if some suitably defined region that describes the neighborhood 
of these vertices contains at most k other vertices, for a given integer value k > 0. 
Global proximity, in turn, gives rise to geometric graphs where the overall sum of the 
lengths of the edges is minimized. See, e.g., the survey by Jaromczyk and Toussaint 
[JT92] for extensive lists of different definitions of proximity. 

It is worth observing that the problem of analyzing the combinatorial properties of 
a given type of proximity graph naturally raises the question of the characterization of 
those graphs which admit the given type of straight-line drawing. This, in turn, leads 
to the investigation of the design of efficient algorithms for computing such a drawing 
when one exists. Although these questions are far from being resolved in general, many 
partial answers have appeared in the literature. See, e.g., [Lio] for an up-to-date survey 
of these questions. We report below some basic facts and results. 


Delaunay Triangulations 


DEFINITIONS 


D35: A Delaunay triangulation is a planar straight-line drawing with all internal 
faces triangles and such that two adjacent vertices u and v are adjacent if and only if 
all disks having @,% as a chord do not contain any other vertices in their interior (the 
disks are assumed to be open sets). 


D36: A planar triangulated graph is Delaunay drawable if it admits a drawing that 
is a Delaunay triangulation. 


D37: A Voronoi diagram is the dual graph of a Delaunay triangulation. 


FACTS 
F1: All Delaunay drawable triangulations are 1-tough and have perfect matchings [Dil90b]. 
F2: All maximal outerplanar graphs are Delaunay drawable [Dil90a]. 


F3: Any triangulation without chords or non-facial triangles is Delaunay drawable [DS96]. 


REMARKS 


R4: Di Battista and Vismara [DV96] give a characterization based on a non-linear 
system of equations involving the angles in the triangulation. 


R5: Liotta and Meijer have studied the combinatorial properties of a Voronoi diagram. 
In particular, a characterization of Voronoi drawable trees can be found in [LM03]. 
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G-drawings and Rectangle of Influence Drawings 


DEFINITIONS 


D38: In 1985, Kirkpatrick and Radke [KR85, Rad88] introduced a family of proximity 
regions called 3-neighborhoods, denoted by Riu, v, 8] and defined as follows: 


1. For 6 = 0, Ru, v, 6] is the line segment WW. 

2. For 0 < 8 < 1, Rlu,v,G] is the intersection of the two closed disks of radius 
d(u,v)/(28) passing through both u and v. 

3. For 1 < 8 < ~w, R[u,v, 6] is the intersection of the two closed disks of radius 
Gd(u,v)/2 and centered on the line through wu and v. 

4, For 8 = ov, R[u, v, f] is the closed infinite strip perpendicular to the line segment 
WW. 


D39: For a given real value 6 > 0, a 6-drawing is a geometric graph where two 
vertices u and v are adjacent if and only if R[u, v, 6] does not contain any vertices other 
than u and v in its interior. A Gabriel graph is a 3-drawing for 6 = 1. 


D40: For a given real value 8 > 0, a weak 6-drawing is a geometric graph where 
if two vertices u and v are adjacent then R[u, v, 3] does not contain any vertices other 
than u and v in its interior. 


D41: The rectangle of influence graph is a geometric graph such that there exists 
an edge (u,v) if and only if the axis-aligned rectangle having u and v at opposite corners 
does not contain any other vertices (the rectangle is assumed to be an open set in some 
papers and a closed set in some other papers). 


D42: A weak rectangle of influence drawing is a geometric graph where if two 
vertices u and v are adjacent then the axis-aligned rectangle having u and v at opposite 
corners does not contain any other vertices. 


FACTS 


F4: Families of graphs that admit a 6-drawing for different values of (@ are stud- 
ied in [BDLL95, BLL96, LS93, LL97, IRO7, SIRO8]. Weak 6-drawings are studied 
in [DLW06, LL97, PV04] 


F5: Different families of graphs that admit a rectangle of influence drawing are de- 
scribed in [LLMW98]. Weak rectangle of influence drawings are studied in [BBM99, 
MMNO09, AB12]. 


Minimum Spanning Trees 


DEFINITIONS 


D43: A minimum spanning tree of a set P of points is a connected, straight-line 
drawing that has P as vertex set and minimizes the total edge length. 


D44: A tree T is drawable as a minimum spanning tree if there exists a set P of 
points such that the minimum spanning tree of P is isomorphic to T. (The problem is 
that whatever plane locations are assigned to vertices of the tree JT’, perhaps the image 
of T itself is not the minimum spanning tree for those vertex locations.) 


Section 10.3. Drawings of Graphs 1245 


FACTS 


The problem of testing whether a tree can be drawn as a Euclidean minimum spanning 
tree in the plane is essentially solved. The 3-dimensional counterpart of the problem is 
not yet solved. 


F6: Monma and Suri [MS92] show that each tree with maximum vertex degree at 
most five can be drawn as a minimum spanning tree of some set of vertices. There is a 
linear time (real RAM) algorithm. No tree with maximum degree greater than six can 
be drawn as a minimum spanning tree. 


F7: Eades and Whitesides [EW96] show that it is NP-hard to decide whether trees of 
maximum degree equal to six can be drawn as minimum spanning trees. 


F8: No trees with maximum degree greater than twelve can be drawn as a Euclidean 
minimum spanning tree in 3D-space, while all trees with vertex degree at most nine are 
drawable [LD95]. 


F9: King [Kin06] improves this last result by showing that all trees whose vertices 
have vertex degree at most ten can be realized as a Euclidean minimum spanning tree 
in 3-dimensional space. 


One of the most challenging questions in the seminal paper by Monma and Suri [MS92] 
was about the area required by a minimum weight drawing of a tree. Namely, the 
construction by Monma and Suri used a grid of size O(2"’) x O(2”’) and the authors 
conjectured an exponential lower bound for minimum weight drawings of trees with 
maximum vertex degree five (i.e., the existence of a tree T with n vertices such that any 
minimum weight drawing of T requires area at least c” x c” for some constant c > 1). 


F10: The above long standing conjecture by Monma and Suri was recently proved to 
be correct by Angelini et al. [ABCt11], who describe a tree T with n vertices having 
maximum degree five such that in any minimum weight drawing of T the ratio between 
the longest and the shortest edge is 2°(), which implies that the drawing requires 
exponential area. 


F11: Frati and Kaufmann [FK11] proved that the exponential area lower bound of 
minimum weight drawings of trees does not hold for maximum vertex degree smaller 


than five. 


F12: The area bound for a complete binary tree has been further reduced in [DDLM10, 
DDLM 12a]. 


Minimum Weight Triangulations 


DEFINITIONS 


D45: A triangulation T of a set P of points on the plane is a straight-line drawing 
whose vertices are the elements in P and all internal faces are triangles. 


D46: T isa minimum weight triangulation if it is a triangulation of P that 
minimizes the total edge length. 
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D47: A minimum weight drawing of a planar triangulated graph G is a straight-line 
drawing [ of G with the additional property that [ is a minimum weight triangulation 
of the points representing the vertices. 


D48: Ifa graph admits a minimum weight drawing it is called minimum weight 
drawable; otherwise, it is called minimum weight forbidden. 


FACTS 


Little is known about the problem of constructing a minimum weight drawing of a 
planar triangulation. 


F13: The problem of computing a Euclidean minimum weight triangulation of a set 
of points in the plane is NP-hard [MR08}. 


F14: All maximal outerplanar triangulations are minimum weight drawable and a 
linear time (real RAM) drawing algorithm for computing a minimum weight drawing 
of these graphs is also known [LL96]. 


This naturally leads us to investigate the internal structure of minimum weight drawable 
triangulations. 


F15: In [LL02] Lenhart and Liotta examine the endoskeleton of a triangulation: that 
is, the subgraph induced by the internal vertices of the triangulation. They construct 
skeletons that cannot appear in any minimum weight drawable triangulation, skele- 
tons that do appear in minimum weight drawable triangulations, and skeletons that 
guarantee minimum weight drawability. 


F16: Wang, Chin, and Yang [WCY00] also focus on the minimum weight drawability 
of triangulations and show examples of triangulations with acyclic skeletons that do not 
admit a minimum weight drawing. 


F17: There exists an infinite class of minimum weight drawable triangulations that 
cannot be realized as Delaunay triangulations (that is, for any triangulation T of the 
class, there does not exist a set P of points such that the Delaunay triangulation of P 
is isomorphic to T) [LLO02] . 


It is worth remarking that the study of the geometric differences between the minimum- 
weight and Delaunay triangulations of a given set of points in order to compute good 
approximations of the former has a long tradition (see, e.g., [Kir80, LK96, MZ79]); 
little is known about the combinatorial difference between Delaunay triangulations and 
minimum-weight triangulations. 


Open Problems 


Pl. Give a complete combinatorial characterization of Delaunay drawable triangula- 
tions. 

P2. Let T be a tree with maximum vertex degree at most twelve. Is there a polynomial 
time algorithm to decide whether T can be drawn as a Euclidean minimum spanning 
tree in 3D-space? If so, compute such a drawing. 
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P3. Define new families of minimum weight drawable triangulations. For example, 
characterize the class of triangulations with acyclic skeleton that admit a minimum 
weight drawing. 


P4. Investigate the combinatorial relationship between minimum weight and Delau- 
nay drawable triangulations. Are there any Delaunay drawable and minimum weight 
forbidden triangulations? 


P5. Further study the combinatorial structure of proximity graphs. For example, 
characterize the family of Gabriel drawable triangulations, that is, the family of those 
triangulations that admit a straight-line drawing where the angles of each triangular 
face are less than 7/2. 


10.3.3. Properties of Drawings and Bounds 


For various classes of graphs and drawing types, many universal/existential upper 
and lower bounds for specific drawing properties have been discovered. Such bounds 
typically exhibit tradeoffs between drawing properties. A universal bound applies to all 
the graphs of a given class. An existential bound applies to infinitely many graphs of the 
class. Whenever we give bounds on the area or edge length, we assume that the drawing 
is constrained by some resolution rule that prevents it from being arbitrarily scaled 
down, reduced by an arbitrary scaling (e.g., requiring a grid drawing, or stipulating a 
minimum unit distance between any two vertices). 


Properties of Drawings 


In computing graph drawings, we would like to take into account a variety of properties. 
For example, planarity and the display of symmetries are highly desirable in visualiza- 
tion applications. Or we may want to display trees and acyclic digraphs with upward 
drawings. In general, to avoid wasting valuable space on a page or a computer screen, 
it is important to keep the area of the drawing small. Moreover, it is typically desirable 
to maximize the angular resolution and to minimize the other measures. 


DEFINITIONS 

D49: The crossing number y of a drawing is its total number of edge-crossings. 
D50: The area of a drawing is the area of its convex hull. 

D51: The total edge length of a drawing is the sum of the lengths of the edges. 


D52: The number of bends of a polyline drawing is the total number of bends 
on the edges of a drawing. 


D53: The maximum number of bends of a polyline drawing is the maximum 
number of bends on any edge. 


D54: The angular resolution p in a polyline drawing is the smallest angle formed 
by any two edges or segments of edges, incident on the same vertex or bend. 


D55: The aspect ratio of a drawing is the ratio of the longest side to the shortest 
side of the smallest rectangle with horizontal and vertical sides covering the drawing. 
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EXAMPLES 


The need to satisfy different drawing properties at the same time leads to formalizing 
many graph drawing problems as multi-objective optimization problems (e.g., construct 
a drawing with minimum area and minimum number of crossings), so that tradeoffs are 
inherent in solving them. 


E2: Figure 10.3.2(a-b) below shows two drawings of K4, the complete graph on four 
vertices. The drawing of part (a) is planar, while the drawing of part (b) “maximizes 
symmetries.” It can be shown that no drawing of Ky, is optimal with respect to both 
criteria, i.e., the maximum number of symmetries cannot be achieved by a planar draw- 


ing. 
(a) (b) (ec) (d) 


Figure 10.3.2: (a-b) Tradeoff between planarity and symmetry in drawing Ky. (c- 
d) Tradeoff between planarity and upwardness in drawing an acyclic 
digraph G. 


E3: Figure 10.3.2(c-d) shows two drawings of the same acyclic digraph G. The drawing 
of part (c) is upward, while the drawing of part (d) is planar. It can be shown that 
there is no drawing of G that is both planar and upward. 


Bounds on the Area 


Tables 10.3.1-10.3.3 below summarize selected universal upper bounds and existential 
lower bounds on the area of drawings of graphs. In the tables, a is an arbitrary constant 
0<a< 1, band care fixed constants 1 < b < c, and € is an arbitrary positive constant. 
The abbreviations “PSL” and “PSLg” are used for “planar straight-line” and “planar 
straight-line grid,” respectively. 


In general, the effect of bends on the area requirement is dual. On one hand, bends 
occupy space and hence negatively affect the area. On the other hand, bends may help 
in routing edges without using additional space. 


Section 10.3. Drawings of Graphs 


1249 


Table 10.3.1: Universal upper and existential lower bounds on the area of 


trees. 
CLASS OF 
AREA 
GRAPHS DRAWING TYPE 
1 Fibonacci trees strictly upward PSLg n) O(n) 
2 AVL trees strictly upward PSLg n) O(n) 
3 erate strictly upward PSLg Q(n) O(n) 
4 binary trees PSLg Q(n) O(n) 
? upward order 
5 binary trees preserving PSlig Q(n log n) O(n log n) 
; order preserving LS 
6 binary trees PSLog Q(n) O(n"?) 
; : upward order 2 2 
7 binary trees preserving PSLOs Q(n*) O(n) 
. upward planar 
8 binary trees érhowoual rid QO(nloglogn) | O(nlog log n) 
9 arena planar polyline grid Q(n) OeloL een) 
eee gare order preserving ms pte Set 
P ee upward planar polyline 
10 binary trees suid onder preserans Q(n log n) O(n log n) 
11 ternary trees PSLog Q(n) OG?) 
; order preserving 2 2 
12 ternary trees PSLog Q(n*) O(n) 
a upward planar 
13 ternary trees Srbhosonal arid Q(n log n) O(nlogn) 
14 ternary trees RiBNerOr hos Bid Q(n) O(n) 
order preserving 
degree-O(n? ) 
I rooted trees peue a) ay 
degree-O(n*) upward planar polyline 
* rooted trees grid nH) On) 
17 rooted trees PSLg n) O(nlog n) 
18 rooted trees upward PSLg n) O(nlogn) 
: oe strictly upward order Lhe 
19 rooted trees pioceryitié PSL Q(n) O(n'T*) 
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Table 10.3.2: Universal upper and existential lower bounds on the area of 
directed trees. 


1 nase areca strictly upward PSLg Q(n log n) O(nlog n) 
directed binary strictly upward order ii *s 
a trees preserving PSLg oe) On) 
: : strictly upward order 
: nomad somery. preserving planar Q(n?) O(n?) 
trees : ; 
polyline grid 
4 directed trees strictly upward PSLg Q(n log n) O(nlogn) 
F : strictly upward order - Pa 
5 directed trees presorvai@ PSba Q(b") O(c”) 
strictly upward order 
6 directed trees preserving planar Q(n?) O(n?) 
polyline grid 


FACTS 


F18: Linear or almost-linear bounds on the area can be achieved for several families 
of trees; typically superlinear bounds are associated with order preserving drawings 
(Table 10.3.1). 


F19: No linear area lower bounds exist for upward drawings of directed trees (Ta- 
ble 10.3.2). Exponential lower bounds are known for the drawing conventions of rows 2 
and 5 of Table 10.3.2. 


F20: Almost linear area can be achieved for undirected outerplanar graphs (rows 1-3 
of Table 10.3.3) and for non-planar drawings of degree-4 graphs (row 18 of Table 10.3.3). 
Subquadratic area upper bounds and/or superlinear lower bounds are known for undi- 
rected SP-graphs (rows 6-7 of Table 10.3.3). Exponential lower bounds are known for 
upward drawings of directed acyclic graphs (rows 8 and 20 of Table 10.3.3). 


F21: A quadratic area bound is achieved only at the expense of a linear number of 
bends (row 22 of Table 10.3.3). 


See Table 10.3.6 below for tradeoffs between area and aspect ratio in drawings of trees. 


Bounds on the Angular Resolution 


Table 10.3.4 below summarizes selected universal lower bounds and existential upper 
bounds on the angular resolution of drawings of graphs. Here c is a fixed constant with 
c>l. 
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Table 10.3.3: Universal upper and existential lower bounds on the area of 
planar graphs and digraphs. 


CLASS OF 
AREA 
GRAPHS DRAWING TYPE 
outerplanar ; 1.48 
1 otaphs outerplanar SLg Q(n) O(n'**) 
degree-d 
2 outerplanar outerplanar SLg Q(n) O(dlog n) 
graphs 
outerplanar outerplanar polyline 
3 sions a Q(n) O(n log n) 
directed Ps eer ynes ere 
4 outerplanar a ane er one Q(b") 
SLg 
graphs 
directed Re ey ren ene 
5 outerplanar vee a ees ae a : Q(n?) 
Sesohs polyline gri 
series-parallel Vleet 2 
6 snaps PSLg Q(n2 ) O(n*) 
series-parallel : . Vlogn 3/2 
7 staple planar polyline grid Q(n2 ) O(n?/*) 
wea acer at upward planar 
8 eta embedding preserving Q(b") O(c”) 
digraphs 
SLg 
series-parallel 9 2 
9 disraphe upward PSLg Q(n*) O(n*) 
13 planar graphs planar polyline grid Q(n?) O(n?) 
. ; PSL (with angular bi 
14 planar graphs fespiution >) Q(cP") 
15 planar graphs PSLg Q(n?) O(n?) 
16 baconnectad PSL convex grid O(n?) O(n?) 
planar graphs 
17 planar graphs planar orthog grid O(n?) O(n?) 
planar degree-4 . 2 
18 Sraphs orthogonal grid Q(n log n) O(n log n) 
19 general graphs polyline grid Q(n + x) O((n + x)?) 
20 | up planar digraphs upward PSLg Q(b") O(c”) 
reduced planar upward PSLg 2 9 
a st-digraphs dominance ne) One) 
22 | up planar digraphs | up planar grid polyline Q(n?) O(n?) 
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Table 10.3.4: Universal lower bounds and existential upper bounds on angular 
resolution. 


CLASS OF GRAPHS DRAWING TYPE ANGULAR RESOLUTION 


outerplanar graph planar straight-line Q(5) O(5) 
series-parallel graph planar straight-line Q(z) O(4) 
general graph straight-line Q(z) o('$s4) 
planar graph straight-line Q(5) O(5) 
planar graph planar straight-line Q(4) O( og) 
planar graph planar straight-line (+) O(4) 
planar graph planar polyline Q(4) O(4) 


Bounds on the Number of Bends 


Table 10.3.5 summarizes selected universal upper bounds and existential lower bounds 
on the total and maximum number of bends in orthogonal drawings. Some bounds are 
stated for n > 5 or n > 7 because the maximum number of bends is at least 2 for A4 and 
at least 3 for the skeleton graph of an octahedron, in any planar orthogonal drawing. 


Table 10.3.5: Orthogonal drawings: universal upper bounds and existential 
lower bounds on the number of bends. Notes: ¢ n > 7; $n> 5. 


CLASS OF DRAWING TOTAL # BENDS MAX # BENDS 
GRAPHS TYPE 
deg-4 orthog >n < 2n+2 >2 <2 
planar deg-4 orthog planar >2n-2 <2n+2 >2 y) 
imbedded deg-4 orthog planar >2n-2 < 2n +2 > <3 
biconnected 
fiseddea deg orthog planar >2n-2 <2n+2 >3 <3 
triconnected 4 3 
tanaadeddes La orthog planar | > 3(n—1)+2 | <5n+4 >2 <2 
imbedded deg-3 ¢ | orthog planar > an +1 S sn +1 >1 <1 


Tradeoff Between Area and Aspect Ratio 


The ability to construct area-efficient drawings is essential in practical visualization 
applications, where screen space is at a premium. However, achieving small area is not 
enough: e.g., a drawing with high aspect ratio may not be conveniently placed on a 
workstation screen, even if it has modest area. Hence, it is important to keep the aspect 
ratio small. Ideally, one would like to obtain small area for any given aspect ratio in a 
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wide range. This would provide graphical user interfaces with the flexibility of fitting 
drawings into arbitrarily shaped windows. A variety of tradeoffs for the area and aspect 
ratio arise even when drawing graphs with a simple structure, such as trees. 


Table 10.3.6 below summarizes selected universal bounds that can be simultaneously 
achieved on the area and the aspect ratio of various types of drawings of trees. 


Table 10.3.6: Trees: universal upper bounds simultaneously achievable for 
area and aspect ratio. 


CLASS OF 

GRAPHS DRAWING TYPE AREA ASPECT RATIO 

binary tree PSLg O(n) [O(1), O(n*)] 

binary tree PSLg order preserving O(n log n) [O(1), O(n/ log n)] 

binary tree PSLg order preserving O(n log log n) O (ers) 

binary tree PSLog O(n log log n) low, O (ere )| 

binary tree up planar orthog grid O(nloglogn) | O(nlog log n/ log? n) 

binary tree upward PSLog O(n log n) [O(1), O(n/ log n)] 

deg-4 tree orthog grid O(n) O(1) 

orthog grid, leaves on 
deg-4 tree area tail O(n log n) O(1) 
rooted upward planar polyline F 

deg-O(n*) tree grid OO) OO) 

rooted tree aowend pee layered O(n?) O(1) 

grid 
rooted tree upward PSLg O(n log n) O(n/ log n) 


In Table 10.3.6, a is an arbitrary constant with 0 < a < 1, while ¢ is an arbitrary 
positive constant. The abbreviation “PSLog” is used for “planar straight-line orthgonal 
grid,” that is, a straight-line grid drawing where the edges are either horizontal or ver- 
tical straight-line segments. Only for a few cases there exist algorithms that guarantee 
efficient area performance and that can accept any user-specified aspect ratio in a given 
range. For such cases the aspect ratio in Table 10.3.6 is given as an interval. 


REMARKS 


R6: While upward planar straight-line drawings are the most natural way of visualizing 
rooted trees, the existing drawing techniques are unsatisfactory with respect to either the 
area requirement or the aspect ratio. The situation is similar for orthogonal drawings. 
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R7: Regarding polyline drawings, linear area can be achieved with a prescribed as- 
pect ratio. However, experiments show that this is done at the expense of a somehow 
aesthetically unappealing drawing. 


R8: For nonupward drawings of trees, linear area and optimal aspect ratio are possible 
for planar orthogonal drawings, and a small (logarithmic) amount of extra area is needed 
if the leaves are constrained to be on the convex hull of the drawing (e.g., pins on the 
boundary of a VLSI circuit). However, the nonupward drawing methods do not seem 
to yield aesthetically pleasing drawings, and are suited more for VLSI layout than for 
visualization applications. 


Tradeoff between Area and Angular Resolution 


Table 10.3.7 summarizes selected universal bounds that can be simultaneously achieved 
on the area and the angular resolution of drawings of graphs. Here b and c are fixed 
constants, b > 1 and c > 1. Universal lower bounds on the angular resolution exist 
that depend only on the degree of the graph. Also, substantially better bounds can be 
achieved by drawing a planar graph with bends or in a nonplanar way. 


Table 10.3.7: Universal upper bounds for area and lower bounds for angular 
resolution, simultaneously achievable. 


Seen DRAWING TYPE AREA ee oe 

trees planar straight-line O(n®) (4) 
planar graph straight-line O(d°n) Q( az) 
planar graph straight-line O(d3n) Q(5) 
planar graph | planar straight-line grid O(n?) Q(=5) 
planar graph planar straight-line O(b”) Q(<z) 
planar graph planar polyline grid O(n?) Q(5) 

FACTS 


F22: Any unordered tree has a planar straight-line drawing with perfect angular 
resolution! 


F23: There are ordered trees that require exponential area for any planar straight-line 
drawing having perfect angular resolution. 


Ordered preserving planar drawings of trees with perfect angular resolution and 
polynomial area can be obtained in the Lombardi drawing style. 


1A drawing has perfect angular resolution if for every vertex v the angle formed by any two consec- 
utive edges around v is FOE where d(v) is the degree of v. 
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Open Problems 


P6. Determine the area requirement of planar straight-line orthogonal drawings of 
binary and ternary trees. There are currently wide gaps between the known upper and 
lower bounds (Table 10.3.1 rows 6 and 11). 


P7. Determine the area requirement of (upward) planar straight-line drawings of trees. 
There is currently an O(logn) gap between the known upper and lower bounds (Ta- 
ble 10.3.1 rows 17 and 18). 


P8. Determine the area requirement of strictly upward planar order preserving straight- 
line drawings of rooted trees (Table 10.3.1 row 19). 


P9. Determine the area requirement of outerplanar straight-line grid drawings of outer- 
planar graphs. There is currently an O(n°-48) gap between the known upper and lower 
bounds (Table 10.3.3 row 1). 


P10. Determine the area requirement of planar straight-line grid drawings of series- 
parallel graphs. In particular it would be interesting to prove a subquadratic upper 
bound (Table 10.3.3 row 6). 


P11. Determine the area requirement of orthogonal (or, more generally, polyline) non- 
planar drawings of planar graphs. There is currently an O(log n) gap between the known 
upper and lower bounds (Table 10.3.3 row 18). 


P12. Close the gap between the 2(7) universal lower bound and the o('$s4) existen- 
tial upper bound on the angular resolution of straight-line drawings of general graphs 
(Table 10.3.4). 


P13. Close the gap between the Q(+,) universal lower bound and the O( 108.4 ) exis- 
tential upper bound on the angular resolution of planar straight-line drawings of planar 


graphs (Table 10.3.4). 


P14. Determine the best possible aspect ratio and area simultaneously achievable for 
(upward) planar straight-line and orthogonal drawings of trees (Table 10.3.6). 


10.3.4 Complexity of Graph Drawing Problems 


Tables 10.3.8-10.3.11 below summarize selected results on the time complexity of 
some fundamental graph drawing problems. 

It is interesting that apparently similar problems exhibit very different time com- 
plexities. For example, while planarity testing can be done in linear time, upward 
planarity testing is NP-hard. Note that, as illustrated in Figure 10.3.1(c-d), planarity 
and acyclicity are necessary but not sufficient conditions for upward planarity. While 
many efficient algorithms exist for constructing drawings of trees and planar graphs 
with good universal area bounds, exact area minimization for most types of drawings 
is NP-hard, even for trees. 
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Table 10.3.8: Time complexity of some fundamental graph drawing problems: 
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general graphs and digraphs. 


CLASS OF GRAPHS PROBLEM TIME COMPLEXITY 
general graph minimize crossings NP-hard 
minimize crossings in layered 
2-layered graph drawing with preassigned NP-hard 
order on one layer 
general graph maximum planar subgraph NP-hard 
test the existence of a 
general graph drawing where each edge is NP-hard 
crossed at most once 
planarity testing and 
general graph computing a planar Q(n) O(n) 
imbedding 
general graph maximal planar subgraph Q(n +m) O(n +m) 
general digraph upward planarity testing NP-hard 
imbedded digraph upward planarity testing Q(n) O(n?) 
biconnected : ‘ ae 4 
jchiee-pacallebaieaih upward planarity testing Q(n) O(n*) 
biconnected : : xi 2 
outenulanat upward planarity testing Q(n) O(n*) 
biconnected bipartite upward planarity testing Q(n) O(n) 
single-source digraph upward planarity testing Q(n) O(n) 
draw as the intersection 
general graph graph of a set of unit NP-hard 
diameter disks in the plane 
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Table 10.3.9: Time complexity of some fundamental graph drawing problems: 
planar graphs and digraphs. 


CLASS OF GRAPHS PROBLEM TIME COMPLEXITY 


planar straight-line drawing 


with prescribed edge lengths NP-hard 


planar graph 
planar straight-line drawing 


with max angular resolution NP-hard 


planar graph 
test the existence of a planar 
st-line drawing with 
imbedded graph prescribed angles betw pairs NP-hard 
of consecutive edges incident 
on a vertex 


test the existence of a planar 
st-line drawing with 
maximal planar graph | prescribed angles betw pairs Q(n) O(n) 
of consecutive edges incident 
on a vertex 


planar st-line grid drawing 
planar graph with O(n?) area and O(1/n?) Q(n) O(n) 
angular resolution 


planar polyline drawing with 
planar graph O(n?) area, O(n) bends, and Q(n) O(n) 
O(1/d) angular resolutions 


planar straight-line convex 
erid drawing with O(n?) area 


triconn planar graph and On) angular Q(n) O(n) 
resolution 
; planar st-line strictly convex 
triconn planar graph draws Q(n) O(n) 
up planar grid st-line 
ee tuced Diana dominance drawing with min Q(n) O(n) 


st-digraph ree 
up planar polyline grid 
drawing with O(n?) area & Q(n) O(n) 
O(n) bends 


upward planar 
digraph 
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Table 10.3.10: Time complexity of some fundamental graph drawing problems: 
planar graphs and digraphs. 


CLASS OF GRAPHS PROBLEM TIME COMPLEXITY 
planar orthogonal grid 
planar deg-4 graph drawing with min number of NP-hard 
bends 
: planar orthog grid drawing 
Dlahat-biconverted with min # bends and O(n?) Q(n) O(n° log n) 
deg-3 graph 
area 
planar biconnected planar orthog grid drawing 
deg-4 series-parallel | with min # bends and O(n?) Q(n) O(n*) 
graph area 
planar biconnected planar orthog grid drawing 
deg-3 series-parallel | with min # bends and O(n?) Q(n) O(n?) 
graph area 
planar orthog grid drawing 
imbedded deg-4 graph | with min # bends and O(n?) Q(n) O(n?) 
area 
planar orthog grid drawing 
planar deg-4 graph with O(n?) area and O(n) Q(n) O(n) 
bends 
test the existence of a PSLog = 
imbedded deg-4 graph drawing with rectangular Q(n) O( hen) 
faces 
test the existence of a PSLog 
planar deg-3 graph drawing with rectangular Q(n) O(n) 
faces 
ance entineeaeaiet test the existence of a planar 
PP eee orthog grid drawing with no Q(n) O(n) 
graph 
bends 
planar orthogonal planar orthog grid drawing NPchard 
representation with minimum area 
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Table 10.3.11: Time complexity of some fundamental graph drawing problems: 


trees. 
CLASS OF 
TIME COMPLEXITY 
GRAPHS PROBLEM 
fats draw as the Euclidean min spanning NP-hard 
tree of a set of points in the plane 
ewreed ted minimize area in planar orthogonal NP-hard 
grid drawing 
detected minimize total/ maximum edge length NP chard 
in planar orthogonal grid drawing 
minimize area in a planar st-line up 
rooted tree layered grid drawing that displays NP-hard 


symmetries and isom’s of subtrees 


minimize area in a planar 
straight-line up layered drawing that 


ted t . ; ; 
a rea displays symmetries and isom’s of 


Q(n) O(n*),k>1 


subtrees 
binary tree minimize area in hv-drawing Q(n) O(nv/n log n) 
planar straight-line up layered grid 
peor Ge drawing with O(n?) area am) OM) 
eoteA Ge planar polyline up grid drawing with O(n) O(n) 


O(n) area 
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Open Problems 


P15. Reduce the time complexity of upward planarity testing for imbedded digraphs 
(currently O(n”)), or prove a superlinear lower bound (Table 10.3.8). 

P16. Reduce the time complexity of upward planarity testing for series-parallel and 
outerplanar digraphs (currently O(n*) and O(n?), respectively), or prove a superlinear 
lower bound (Table 10.3.8). 

P17. Reduce the time complexity of bend minimization for planar orthogonal draw- 
ings of imbedded graphs (currently O(n3/?)), or prove a superlinear lower bound (Ta- 
ble 10.3.9). 

P18. Reduce the time complexity of bend minimization for planar orthogonal draw- 
ings of several families of graphs (deg-3 graphs, deg-3 and deg-4 series-parallel graphs, 
imbedded deg-4 graphs) (Table 10.3.9). 

P19. Determine the time complexity of testing the existence a planar orthogonal 
straight-line grid drawing with rectangular faces for planar deg-4 graphs (Table 10.3.10). 
P20. Reduce the time complexity of testing the existence a planar orthogonal grid 
drawing with no bends for deg-4 series-parallel graphs. The current bound is O(n°) 
deriving from bend minimization (Table 10.3.10). 


10.3.5 Example of a Graph Drawing Algorithm 


In this subsection we outline the algorithm by one of the authors [Tam87] for comput- 
ing, for an imbedded degree-4 graph G, a planar orthogonal grid drawing with minimum 
number of bends and using O(n”) area (see Table 10.3.7). This algorithm is the core of 
a practical drawing algorithm for general graphs (see §10.3.6 and Figure 10.3.3(d)). 


Graph Drawing Algorithm The algorithm consists of two main phases: 
1. Computation of an orthogonal representation for G, where only the bends and the 
angles of the orthogonal drawing are defined. 
2. Assignment of integer lengths to the segments of the orthogonal representation. 


REMARKS 


R9: Phase 1 uses a transformation into a network flow problem (Figure 10.3.3(a-c)), 
where each unit of flow is associated with a right angle in the orthogonal drawing. 
Hence, angles are viewed as a commodity that is produced by the vertices, transported 
across faces by the edges through their bends, and eventually consumed by the faces. 


R10: From the imbedded graph G we construct a flow network N as follows. The 
nodes of network N are the vertices and faces of G. Let deg(f) denote the number of 
edges of the circuit bounding face f. Each vertex v supplies o(v) = 4 units of flow, and 
each face f consumes 7(f) units of flow, where 


(f) 2deg(f)—4 if f is an internal face 
r(f)= 
2deg(f)+4 if f is the external face 


By Euler’s formula, 5>,,¢(v) = Sep; i.e., the total supply is equal to the total 
consumption. 


R11: Network N has two types of arcs: 
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e arcs of the type (v, f), where f is a face incident on vertex v; the flow in (v, f) 
represents the angle at vertex v in face f, and has lower bound 1, upper bound 4, 
and cost 0; 


e arcs of the type (f,g), where face f shares an edge e with face g; the flow in (f, g) 
represents the number of bends along edge e with the right angle inside face f, 
and has lower bound 0, upper bound +co, and cost 1. 


R12: The conservation of flow at the vertices expresses the fact that the sum of the 
angles around a vertex is equal to 27. The conservation of flow at the faces expresses 
the fact that the sum of the angles at the vertices and bends of an internal face is equal 
to m(p— 2), where p is the number of such angles. For the external face, the above sum 
is equal to m(p + 2). 


R13: It can be shown that every feasible flow ¢ in network N corresponds to an 
admissible orthogonal representation for graph G, whose number of bends is equal to 
the cost of flow ¢. Hence, an orthogonal representation for G with the minimum number 
of bends can be computed from a minimum cost flow in G. This flow can be constructed 
in O(n? logn) time with standard flow-augmentation methods. 


R14: Phase 2 uses a simple compaction strategy derived from VLSI layout, where 
the lengths of the horizontal and vertical segments are computed independently after a 
preliminary refinement of the orthogonal representation that decomposes each face into 
rectangles. The resulting drawing is shown in Figure 10.3.3(d). 


10.3.6 Techniques for Drawing Graphs 


In this subsection we outline some of the most successful techniques that have been 
devised for drawing general graphs. 


Planarization 


The planarization approach is motivated by the availability of many efficient and well- 
analyzed drawing algorithms for planar graphs (see Table 10.3.9). If the graph is non- 
planar, it is transformed into a planar graph by means of a preliminary planarization 
step that replaces each crossing with a fictitious vertex. The planarization approach 
consists of two main steps: in the first step a maximal planar subgraph G’ of the input 
graph G is computed; in the second step, all the edges of G that are not in G’ are added 
to G’ and the crossings formed by each added edge are replaced with dummy vertices. 
Clearly when adding an edge one wants to produce as few crossings as possible. The 
two optimization problems arising in the two steps of the planarization approach, i.e., 
the maximum planar subgraph problem and the edge insertion problem, are NP-hard. 
Hence, existing planarization algorithms use heuristics. The best available heuristic for 
the maximum planar subgraph problem is described in [JM96]. This method has a solid 
theoretical foundation in polyhedral combinatorics, and achieves good results in prac- 
tice. A sophisticated algorithm for edge insertion (that insert each edge minimizing the 
number of crossings over all possible imbeddings of the planar subgraph) is described 
in [GMW05]. See also [BCG] for more references. 
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(a) 


(c) 


(b) 


(d) 


Figure 10.3.3: (a) Imbedded graph G. (b) Minimum cost flow in network N: the 
flow is shown next to each arc; arcs with zero flow are omitted; arcs 
with unit cost are drawn with thick lines; a face f is represented by 
a box labeled with r(f). (c) Planar orthogonal grid drawing of G 
with minimum number of bends. (d) Orthogonal grid drawing of a 
nonplanar graph produced by a drawing method for general graphs 
based on the algorithm of this subsection. 


A successful drawing algorithm based on the planarization approach and a bend- 
minimization method [Tam87] is described in [TDB88] (Figure 10.3.3(d) was generated 
by this algorithm). It has been widely used in software visualization systems. 
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Layering 


The layering approach for constructing polyline drawings of directed graphs transforms 
the digraph into a layered digraph and then constructs a layered drawing. A typical 
algorithm based on the layering approach consists of the following main steps: 


1. Assign each vertex to a layer, with the goal of maximizing the number of edges 
oriented upward. 

2. Insert fictitious vertices along the edges that cross layers, so that each edge in the 
resulting digraph connects vertices in consecutive layers. (The fictitious vertices 
will be displayed as bends in the final drawing.) 

3. Permute the vertices on each layer with the goal of minimizing crossings. 

4. Adjust the positions of the vertices in each layer with the goal of distributing the 
vertices uniformly and minimizing the number of bends. 


Most of the subproblems involved in the various steps are NP-hard; hence heuristics 
must be used. The layering approach was pioneered by Sugiyama et al. [STT81] and 
since then, a lot of research has been devoted to all optimization problems defined in 
each of the four steps above (see, e.g., [ELS93, GKNV93, TNB04, HN02, JM97, Nag05, 
NY04, EK86, EW94, MSM99, BWZ10, BK02, BBBH10, C@¢MW11, CGMW10]). See 
also [HN] for more references. 


Physical Simulation 


This approach uses a physical model where the vertices and edges of the graph are viewed 
as objects subject to various forces. Starting from an initial random configuration, the 
physical system evolves into a final configuration of minimum energy, which yields the 
drawing. Rather than solving a system of differential equations, the evolution of the 
system is usually simulated using numerical methods (e.g., at each step, the forces are 
computed and corresponding incremental displacements of the vertices are performed). 
Drawing algorithms based on the physical simulation approach are often able to detect 
and display symmetries in the graph. However, their running time is typically high. 
The physical simulation approach was pioneered in [Ead84, KS80]. Sophisticated devel- 
opments and applications include [DH96, EH00, FR91, KK89, HK02, GGK04, GKN05, 
BPO7]. See also [Kob] for additional references. 


10.3.7 Selected Topics 


This subsection presents an overview of selected areas of graph drawing that have 
attracted increasing attention. 


Point-Set Embeddings and Universal Point Sets 


In the “typical” graph drawing setting, algorithms compute a layout of the graph by 
suitably choosing vertex locations in order to satisfy drawing requirements or to optimize 
aesthetic criteria. In some applications, however, one wants to lay out a graph onto pre- 
assigned vertex locations. A point-set embedding of a graph G to a set of points S is a 
drawing of G where the vertices are represented as points of S. 
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DEFINITIONS 


D56: Let G=(V, EF) bea graph. A k-coloring of G (k > 0) is a partition of V into 
k sets {V,Vo,...,Ve}. A graph G with a k-coloring is called a k-colored graph. A 
1-colored graph is simply called a graph. 


D57: Let S bea set of points in the plane. A k-coloring of S (k > 0) is a partition of 
S into k sets {$1, So,...,.S,}. A set of points S with a k-coloring is called a k-colored 
point set. A 1-colored set of points is simply called a point set. 


D58: Let G be a k-colored graph and let S be a k colored point set. G and S are 
compatible if |V;| < |.S;| for every i = 1,2,...,k. 


D59: Let G = (V,E) be a k-colored graph, let S be a k-colored point set, and let D 
be a type of drawing (planar, straight-line, orthogonal, etc.). A k-colored point-set 
embedding of G to S of type D is a drawing I of G of type D where each vertex 
v € V; is represented in I by a point of $;. A 1-colored point-set embedding of G to S 
of type D is simply called a point-set embedding of G to S of type D. Also, when the 
type of drawing is clear from the context we omit the specification “of type” D. 


D60: Let F be a family of k-colored graphs and let D be a type of drawing. A k-colored 
point set S is universal for F for drawings of type D if every graph G € F admits a 
k-colored point-set embedding to S' of type D. 


D61: An h-bend drawing (h > 0) is a polyline drawing where each edge has at most 
h bends. 


REMARKS 


R15: The definition of k-colored graph given above allows adjacent vertices to have 
the same color. 


R16: In the literature about point-set embeddability different versions of the problem 
have been investigated that are specializations of the general definition given above. 
Typically, the number of points considered is equal to the number of vertices of the 
input graph. Concerning the number of colors k, typical values considered are 1 or n. 
In the first case any vertex can be mapped to any point, while in the second one the 
coloring defines a one-to-one mapping between the vertices and the points. Values of k 
between 1 and n define a mapping of groups of vertices to group of points. Drawings 
are often required to be planar. Straight-line, polyline, and orthogonal drawings are 
those most considered. Straight-line and polyline have been studied also in the upward 
version. When studying polyline point-set embeddings one typically wants to keep the 
number of bends per edge as small as possible. 


R17: Universal point sets have been typically studied in the version with k = 1. When 
studying universal point sets one is typically interested in the size of a universal point 
set for a family and for a type of drawing. 


FACTS 


F24: Deciding whether an n-vertex planar graph admits a planar straight-line point-set 
embedding on a point set of size n is NP-complete [Cab06]. 
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F25: An n-vertex planar graph admits a planar straight-line point-set embedding on 
every point set of size n in general position if and only if it is outerplanar [GMPP91, 
CU96]. 


F26: A planar straight-line point-set embedding of an n-vertex tree on any point set 
of size n in general position can be computed in O(n log n) [BMS97]. 


F27: A planar straight-line point-set embedding of an n-vertex outerplanar graph on 
any point set of size n in general position can be computed in O(n log® n) [Bos02]. 


F28: A planar 2-bend point-set embedding of an n-vertex planar graph on any point 
set of size n can be computed in O(n logn) [KW02]. 


F29: There exists an n-vertex planar graph G and a point set S of size n such that 
in every polyline point-set embedding of G on S' there exists at least one edge with two 
bends [KW02]. 


F30: A planar n-colored O(n)-bend point-set embedding of an n-vertex n-colored 
planar graph on any n-colored point set of size n in general position can be computed 
in O(n?) [PWO1]. 


F31: There exists an n-vertex planar graph G and a point set S' of size n such that in 
every polyline n-colored point-set embedding of G on S$, almost surely there exist Q(n) 
edges with Q(n) bends [PW01, BDL08, Gor12). 


F32: For every 1 < k <n, a planar k-colored O(n)-bend point-set embedding of an n- 
vertex planar graph on any k-colored point set of size n can be computed in O(n? log n) 
[BDLO8}. 


F33: For every 1 < k < n, there exists an n-vertex k-colored planar graph G and a 
k-colored point set S' of size n such that in every polyline k-colored point-set embedding 
of G on S, there exist Q(n) edges with Q(n) bends [BDLO8]. 


F34: A planar 2-colored 5-bend point-set embedding of an n-vertex outerplanar graph 
on any 2-colored point set of size n can be computed in O(nlogn) time [DDL*08]. 


F35: For every positive integer h > 0, there exists an n-vertex 3-colored planar graph 
G (with n = O(h?)) and a 3-colored point set S$ of size n such that in every polyline 
3-colored point-set embedding of G on S, there exists at least one edge with more than 
h bends [DDLT 08}. 


F36: Deciding whether an n-vertex planar graph admits a planar orthogonal point-set 
embedding on a point set of size n is NP-complete [KKRW10]. 


F37: Deciding whether an n-vertex n-colored perfect matching admits a planar n- 
colored orthogonal grid point-set embedding on an n-colored point set of size n is NP- 
complete [KKRW10]. 


F38: Deciding whether an n-vertex n-colored perfect matching admits a planar n- 
colored orthogonal grid point-set embedding on an n-colored point set of size n is NP- 
complete [KKRW10]. 
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F39: It is possible to decide in O(n*) time whether an n-vertex n-colored perfect 
matching admits a planar n-colored orthogonal point-set embedding on an n-colored 
point set of size n [KKRW10]. 


F40: An upward planar 1-bend point-set embedding of an n-vertex series-parallel 
digraph on any point set of size n can be computed in O(n logn) [DDLWO6]. 


F41: An upward planar 2-bend point-set embedding of an n-vertex planar digraph on 
any point set of size n can be computed in O(nlogn) [GLMS07]. 


F42: No biconnected digraph with n > 4 vertices admits an upward planar straight-line 
point-set embedding on every convex point set of size n [BDD* 10}. 


F43: For every k > 3, there exists a 3k-vertex upward planar digraph that admits an 
upward planar straight-line point-set embedding on every convex point set of size 3k 
but not on every point set in general position of size 3k. [AFG*1]]. 


F44: For every odd n > 5, there exists an n-vertex directed tree T and a convex point 
set S' of size n such that T does not admit an upward planar straight-line point-set 
embedding on S. [BDD*10]. 


F45: Every n-vertex directed caterpillar admits an upward straight-line point-set em- 
bedding on every convex point set of size n [AFGT11]. 


F46: Every universal point set for straight-line planar drawings of n-vertex planar 
graphs has size at least 1.235n [Kur04]. 


F47: There exists a universal point set of size Bn? for planar straight-line drawings of 
n-vertex planar graphs [Bra0g]. 

logn 
log log n 
drawings of n-vertex simply-nested planar graphs [ADK* 12). 


F48: There exists a universal point set of size O(n( )?) for planar straight-line 


F49: There exists a universal point set of size n for planar 1-bend drawings of n-vertex 
planar graphs [ELLW10]. 


F50: There exists a universal point set of size 10n — 18 for planar 3-bend drawings of 
n-vertex planar graphs, when bends are required to be drawn at points of the point set 
[DEL* 12]. 


F51: There exists a universal point set of size O(nlogn) for planar 2-bend drawings 
of n-vertex planar graphs, when bends are required to be drawn at points of the point 
set [DEL* 12]. 

F52: There exists a universal point set of size O( een) for planar 1-bend drawings of 
n-vertex planar graphs, when bends are required to be drawn at points of the point set 
[DEL* 12]. 


F53: For every 1 < k <n, there exists a universal point set of size kn — k? + 1 for 
planar (3k + 7)-bend drawings of n-vertex k-colored planar graphs [DLT10]. 


F54: For every 1 < k <n, there exists a universal point set of size O(kn?) x O(kn?) 
for planar (6k + 5)-bend drawings of n-vertex k-colored planar graphs [DLT 10]. 
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F55: There exists a universal point set of size n? —2n +2 for planar 1-bend orthogonal 
drawings of n-vertex degree-4 trees [DFF* 12]. 


F56: There exists a universal point set of size 4n for planar geodesic orthogonal? 
drawings of n-vertex degree-4 trees [DFF* 12]. 


F57: There exists a universal point set of size n for planar geodesic orthogonal drawings 
of n-vertex degree-3 trees [DFF* 12]. 


F58: There exists a universal point set of size O(n”) for upward planar straight-line 
drawings of n-vertex directed paths [Mch12]. 


Simultaneous Embeddings 


In some application areas one has different types of relations defined over a set of 
elements. For example, in social network analysis different relationships among the same 
set of people can be studied. Analogously in biology, different algorithms can produce 
different phylogenetic trees on the same set of organisms. In terms of graphs we have 
two (or more) graphs with the same vertex set and different edge sets. In this situation 
it may be desirable to simultaneously visualize two (or more) drawings of the two (or 
more) graphs under consideration. A simultaneous embedding of & graphs (k > 1) 
consists of k drawings (one for each graph) such that every vertex is represented by the 
same point in all drawings. The typical requirement for a simultaneous embedding is 
that each single drawing is planar, while the union of all drawings may not be planar. 
We report below basic definitions and facts. See also [BKR] for more references. 


DEFINITIONS 


D62: Let G; = (V,F;) (¢=1,2,...,k) bea set of k planar graphs. A simultaneous 
embedding of G,,G2,...Gx is a set of k drawings I; of G; such that each vertex v is 
represented by the same point of the plane in all T;. 


D63: A simultaneous embedding with fixed edges is a simultaneous embedding 
where each edge e € E; 1 E; (1 < i,j < k) is drawn in the same way (i.e., by the same 
curve) in T; and T). 


D64: A simultaneous geometric embedding is a simultaneous embedding such 
that every T; (1 <i <k&) is a straight-line drawing. 
REMARKS 


R18: A simultaneous geometric embedding is also a simultaneous embedding with fixed 
edges, and a simultaneous embedding with fixed edges is also a simultaneous embedding. 


R19: Most of the research about simultaneous embeddings concentrated on studying 
whether two (or more) graphs from some families can always be simultaneous embedded 
or not. 


2In a geodesic orthogonal drawing the length of each edge is equal to the Manhattan distance of its 
endpoints. 
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FACTS 


F59: There exist a planar graph and an outerplanar graph that do not admit a 
simultaneous geometric embedding. [BCDt07] 


F60: There exist a planar graph and a path that do not admit a simultaneous geometric 
embedding. [BCD*07| 


F61: There exist a planar graph and a matching that do not admit a simultaneous 
geometric embedding. [CvKL* 10] 


F62: There exist two outerplanar graphs that do not admit a simultaneous geometric 
embedding. [BCD* 07] 


F63: There exists two trees that do not admit a simultaneous geometric embedding. 
[GKV09} 


F64: There exists a tree and a path that do not admit a simultaneous geometric 
embedding. [AGKN12] 


F65: There exist three paths that do not admit a simultaneous geometric embedding. 
[BCD* 07] 


F66: Every pair consisting of a tree and a matching both with n vertices admits a 
simultaneous geometric embedding. [CvKL* 10] 


F67: Every pair of caterpillars? with n vertices admits a simultaneous geometric 
embedding on an integer grid of size 3n x 3n. [BCD*07] 


F68: Every pair consisting of a caterpillar and a path both with n vertices admits a 
simultaneous geometric embedding on an integer grid of size n x 2n. [BCD*07] 


F69: Every pair of paths with n vertices admits a simultaneous geometric embedding 
on an integer grid of size n x n. [BCDtT07] 


F70: Every pair of cycles with n vertices admits a simultaneous geometric embedding 
on an integer grid of size 4n x 4n. [BCDT07| 


F71: For any fixed k > 1, deciding whether & planar graphs admit a simultaneous 
geometric embedding is NP-hard. [EBGJ*08] 


F72: For any fixed k > 18, deciding whether & planar graphs, each with a fixed 
imbedding, admit a simultaneous geometric embedding is NP-complete. [ADF 11] 


F73: There exist two outerplanar graphs that do not admit a simultaneous embedding 
with fixed edges. [Fra07| 


F74: Every pair consisting of a tree and a path both with n vertices admits a simulta- 
neous embedding with fixed edges on an integer grid of size O(n) x O(n”). The drawing 
of the path is a straight-line drawing, while in the drawing of the tree each edge has at 
most one bend. [EK05] 


3A caterpillar is a tree such that after removing all leaves we are left with a path. 
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F75: Every pair consisting of an outerplanar graph and a path both with n vertices 
admits a simultaneous embedding with fixed edges on an integer grid of size O(n) x 
O(n?). The drawing of the path is a straight-line drawing, while in the drawing of the 
outerplanar graph each edge has at most one bend. [DL07| 


F76: Every pair consisting of an outerplanar graph and a cycle both with n vertices 
admits a simultaneous embedding with fixed edges on an integer grid of size O(n?) x 
O(n?). In both drawings each edge has at most one bend. [DLO7] 


F77: Every pair consisting of a planar graph and a tree both with n vertices admits a 
simultaneous embedding with fixed edges. [Fra07] 


F78: For any fixed k > 2, deciding whether & planar graphs admit a simultaneous 
embedding with fixed edges is NP-complete. [GJPT06] 


F79: It is possible to decide in O(n) time whether two n-vertex planar graphs ad- 
mit a simultaneous embedding with fixed edges if one of them has a fixed imbedding. 
[ADF* 10] 


F80: It is possible to decide in O(n) time whether two n-vertex planar graphs admit 
a simultaneous embedding with fixed edges if their intersection is a biconnected graph. 
[HJL10} 


F81: It is possible to decide in O(n?) time whether three n-vertex planar graphs admit 
a simultaneous embedding with fixed edges if each of them has a fixed imbedding. 
[ADF 11] 


F82: For any fixed k > 14, deciding whether k& planar graphs, each with a fixed 
imbedding, admit a simultaneous embedding with fixed edges is NP-complete. [ADF 11] 


F83: Every pair of planar graphs with n vertices admits a simultaneous embedding 
on an integer grid of size O(n?) x O(n”). In both drawings each edge has at most two 
bends. [DL07| 


F84: Every pair of trees with n vertices admits a simultaneous embedding on an integer 
grid of size O(n?) x O(n”). In both drawings each edge has at most one bend. [EK05] 


Lombardi Drawings 


Inspired by the work of the American artist Mark Lombardi, Duncan et al. introduced 
and studied Lombardi drawings of graphs. In a Lombardi drawing of a graph edges 
are drawn with circular arcs and perfect angular resolution, i.e., consecutive edges are 
equiangularly spaced around each vertex. 


DEFINITIONS 


D65: A Lombardi drawing of a graph G is a drawing of G in the plane in which 
vertices are represented as points, edges are represented as line segments or circular arcs 
between their endpoints, and every vertex has perfect angular resolution. 


D66: A circular Lombardi drawing is a Lombardi drawing where all vertices lie on 
a circle. 
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D67: A k-circular Lombardi drawing is a Lombardi drawing where all vertices lie 
on k concentric circles (k > 1). 


D68: A k-Lombardi drawing (k > 1) of a graph G is a drawing of G in the plane 
in which vertices are represented as points, edges are represented as a sequence of line 
segments or circular arcs between their endpoints, and every vertex has perfect angular 
resolution. A k-Lombardi drawing is smooth if every edge is continuously differentiable, 


i.e., no edge in the drawing has a sharp bend. A non-smooth k-Lombardi drawing is 
said to be pointed. 


FACTS 


F85: Every ordered tree with n vertices admits a planar order preserving Lombardi 
drawing in area O(n3). [DEG*12b] 


F86: Every bipartite d-regular graph with n vertices admits a circular Lombardi 
drawing that can be constructed in O(dn log d) time. [DEG*12b] 


F87: Every 4k-regular graph with n vertices admits a circular Lombardi drawing that 
can be constructed in O(knlogk) time. [DEG*12b] 


F88: Every d-regular graph of odd degree with n vertices admits a circular Lombardi 
drawing if and only if has a perfect matching. When such a drawing exists it can be 


constructed in O(dn*°) time. [DEG*12b] 


F89: Every 3-regular bridgeless graph with n vertices admits a circular Lombardi 
drawing that can be constructed in O(n log® nlog log n) time. [DEG*12b] 


F90: When d= 2 (mod 4), it is NP-complete to test whether a d-regular graph has 
a circular Lombardi drawing. [DEG*12b] 


F91: Every outerplanar series-parallel graph admits a Lombardi drawing. [DEG*12b] 
F92: There exist planar graphs that do not admit planar Lombardi drawings. [DEG*12b] 


F93: Every Halin graph with n vertices admits a planar Lombardi drawing that can 
be computed in O(n) time. [DEG*12b] 


F94: There exist graphs that do not admit Lombardi drawings. [DEG* 12a] 
F95: Every graph has a smooth 2-Lombardi drawing. [DEG* 12a] 


F96: Every planar graph of maximum degree 3 has a smooth planar 2-Lombardi 
drawing. [DEGT 12a] 


F97: Every planar graph has a smooth planar 3-Lombardi drawing. [DEGT 12a] 


F98: Every planar graph has a pointed planar 2-Lombardi drawing. [DEG* 12a] 
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Drawings with Large Crossing Angles 


One of the most desirable properties when drawing a graph is planarity. Planar drawings 
are easily readable and aesthetically pleasant. This is not only suggested by intuition 
but also confirmed by experiments with end-users [PCA02, Pur00, WPCM02]. Unfor- 
tunately, since not all graphs are planar, crossings are often unavoidable. Some recent 
cognitive experiments [Hua07, HHE08], however, showed that if edges cross forming 
“large” angles, then crossings do not inhibit human performance in reading the graph. 
Based on this experiment an emerging research line in graph drawing studies non-planar 
drawings of graphs with large angles at crossings. We report below basic definitions and 
facts. See also [DL12] for more references. 


DEFINITIONS 


D69: A RAC drawing (Right Angle Crossing drawing) is a drawing of a graph 
where any two crossing edges form a 5 angle. 


D70: A 2-layer RAC drawing is a straight-line drawing of a bipartite graph such 
that each vertex partition set is drawn on a distinct horizontal line. 


D71: An ACE, drawing is a drawing of a graph where any two crossing edges form 
an angle that is exactly a (0 <a < 4). 


D72: An ACL, drawing is a drawing of a graph where any two crossing edges form 
an angle that is at least a (0 < a < 4). 


FACTS 


F99: Every straight-line RAC drawing with n vertices has at most 4n—10 edges. Also, 
for any k > 3 there exists a graph with n = 3k — 5 vertices and 4n — 10 edges that 
admits a straight-line RAC drawing [DEL11]. 


F100: Every straight-line RAC drawing with n vertices and 4n— 10 edges is 1-planar.* 
Also, for any k > 0 there exists a 1-planar graph with n = 8 + 4k vertices and 4n — 10 
edges that does not admit a RAC drawing [EL12]. 


F101: Deciding whether a graph admits a straight-line RAC drawing is NP-hard. 
[ABS11b] 


F102: Deciding whether a digraph admits a straight-line upward RAC drawing is 
NP-hard. [ACD*11] 


F103: There exists a O(n)-time algorithm to decide whether an n-vertex bipartite 
graph admits a 2-layer RAC drawing. If the drawing exists it can be computed in O(n) 
time. [DDEL11] 


F104: For any given k > 0, deciding whether a bipartite graph has a subgraph with 
at least k edges that admits a 2-layer RAC drawing is NP-hard. [DDEL11] 


F105: <A complete bipartite graph Kn,n. (m1 < ne) admits a straight-line RAC 
drawing if either n; < 2, or nj = 3 and ng < 4. [DEL10] 


4A 1-planar graph is a graph that admits a drawing where each edge is crossed at most once. 
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F106: There exists a planar graph G such that any straight-line RAC drawing of G 
requires Q(n?) area. [ACD*11] 


F107: A RAC drawing with n vertices and at most one bend per edge has at most 
6.5n — 13 edges. [AF Kt 12] 


F108: A RAC drawing with n vertices and at most two bends per edge has at most 
74.2n edges. [AFK*12] 


F109: Every graph admits a RAC drawing with at most three bends per edge and 
O(n*) area. [DEL11] 


F110: Every graph admits a RAC drawing with at most four bends per edge and 
O(n?) area. [DDLM11] 


F111: Every n-vertex graph with maximum vertex degree six admits a RAC drawing 
with at most two bends per edge and O(n”) area. [ACD*11] 


F112: Every n-vertex graph with maximum vertex degree three admits a RAC drawing 
with at most one bend per edge and O(n?) area. [ACD*11] 


F113: A straight-line ACL, drawing with n vertices has at most *(3n — 6) edges. 
[DGMW11] 


F114: A straight-line ACE, drawing with n vertices has at most 3(3n — 6) edges. 
[AFT 11] 


F115: An ACE,, drawing with n vertices and at most one bend per edge has at most 
27n edges. [AFT11] 


F116: An ACE, drawing with n vertices and at most two bends per edge has at most 
A77n edges. [AFT11] 


F117: Every graph with n vertices admits an ACL, drawing with at most one bend 
per edge and area O(n”). [DDLM11] 


REMARK 


R20: We conclude this section by mentioning that recently RAC simultaneous embed- 
dings, i.e., simultaneous embedding where crossings between edges of the two different 
graphs form right angles, have been introduced and studied by Argyriou et al. [ABKS12] 


Drawings with Few Slopes 


In a paper by Dujmovié et al. [DESW07] a new aesthetic requirement is introduced for 
straight-line drawings of graphs: the minimization of the number of distinct segment 
slopes used in the drawing. The idea is that using few slopes, especially if chosen from a 
“nice” set of slopes, should increase the readability. As an example, consider orthogonal 
drawings where only two “nice” slopes are used: the horizontal one and the vertical one. 
The study of straight-line drawings using few slopes is related to the study of the slope 
number of a graph introduced by Wade and Chu [WC94]. The slope number of a graph 
G is the minimum number of segment slopes in any straight-line drawing of G. It is 
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immediate to see that the slope number of G is at least [¢] and at most m, where d 
is the maximum vertex degree and m is the number of edges of G. More recently, the 
study of polyline drawings of graphs with the minimum number of slopes has also been 
considered. 


DEFINITIONS 


D73: Let T be a polyline drawing of a graph G. Let s be a segment of I and let @ be 
the straight-line containing s. The slope of s is the angle swept from the X-axis in the 
anticlockwise direction to @ (and is thus a value in [0,7)). 


D74: Let G bea graph. The slope number of G is the minimum number of slopes 
needed to construct any straight-line drawing of G. 


D75: Let G be a planar graph. The planar slope number of G is the minimum 
number of slopes needed to construct any planar straight-line drawing of G. 


D76: Let G be a plane graph, i.e., a planar graph with a fixed planar imbedding. The 
plane slope number of G is the minimum number of slopes needed to construct any 
planar straight-line drawing of G that preserves the given planar imbedding. 


FACTS 
F118: The plane slope number of every tree with maximum degree A is [4] . [DESW07]| 


F119: The plane slope number of every maximal outerplanar graph with n vertices is 
at most n. [DESW07] 


F120: The (outer)planar slope number of every outerplanar graph with maximum 
vertex degree d vertices is at most d— 1. For every d > 4 there exists an outerplanar 
graph of maximum vertex degree d whose (outer)planar slope number is at least d— 1. 
[KMW12] 


F121: The plane slope number of every n-vertex plane 2-tree is at most 2n — 3. For 
every n > 3 there exists an n-vertex plane 2-tree whose plane slope number is at least 
2n — 3. [DESW07| 


F122: The planar slope number of every n-vertex 2-tree is at most $n. [DESWO7] 
F123: The plane slope number of every n-vertex plane 3-tree is at most 2n — 2. For 
every n > 4 there exists an n-vertex plane 3-tree whose plane slope number is at least 


2n — 2. [DESWO7] 


F124: The planar slope number of every n-vertex planar 3-tree of maximum vertex 
degree d is at most d°. [JJK*] 


F125: The plane slope number of every n-vertex plane graph is at most 2n — 10. 
[DESW07]| 


F126: For every d > 5 there exist graphs with maximum vertex degree d whose slope 
number is arbitrarily large. [PP06, DSW07] 
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F127: The slope number of every graph with maximum vertex degree 3 is at most 4. 
[MP 12] 


F128: The planar slope number of every planar graph with maximum vertex degree d 
is at most 2°, [KPP11] 


F129: Every planar graph with maximum vertex degree d admits a polyline drawing 
with at most 1 bend per edge using at most 2d slopes. [KPP11] 


F130: Every planar graph with maximum vertex degree d > 3 admits a polyline 
drawing with at most 2 bends per edge using at most [¢] slopes. The only exception 
is the graph formed by the edges of an octahedron, which is 4-regular, but requires 3 


slopes. [KPP11] 


Witness Proximity and Approximate Proximity Drawings 


In a witness proximity drawing we look at a set of points that represent the vertices 
and at a set of points that play the role of the witnesses. The existence/absence of an 
edge in the drawing depends on the location of the witness points (the set of witness 
points and the set of points representing the vertices of the graph in a drawing may not 
coincide). 


DEFINITIONS 


D77: Ina positive witness proximity drawing I two vertices u,v are adjacent if 
and only if the proximity region of u and v contains at least one vertex that belongs to 
the set of witness points. 


D78: Ina negative witness proximity drawing u and v are adjacent if and only 
if their region of influence does not contain any of the witness points (it may, however, 
contain other vertices of the graph that are not witnesses). 


FACT 


F131: Witness proximity has been introduced and studied in a series of papers by 
Aronov, Dulieu, and Hurtado [ADH, ADH11la, ADH11b]. 


Proximity drawability imposes severe restrictions on the families of the representable 
graphs. For example, a tree can be realized as a minimum spanning tree only if its vertex 
degree is at most six. In order to overcome restrictions on the combinatorial structure 
of the drawable graphs, recent papers study straight-line drawings of graphs that are 
“good approximations” of proximity drawings. 


DEFINITIONS 


D79: A (1+¢)-EMST drawing is a planar straight-line drawing of a tree such that, 
for any fixed € > 0, the distance between any two vertices is at least Th the length of 
the longest edge in the path connecting them. 


D80: Let D be a disk with center c and radius r, and let ¢; and €2 be two nonnegative 
real numbers. The €1-shrunk disk of D is the disk centered at c and having radius arose 
the €2-expanded disk of D is the disk centered at c and having radius (1+ €2)r. An 
(€1,€2)-proximity drawing is a planar straight-line drawing where the proximity region 
of two adjacent vertices is defined by using ¢,-shrunk disks, while the region of influence 
of two nonadjacent vertices uses €9-expanded disks. 
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FACTS 


F132: in [DDLM10, DDLM12a] it is shown that every tree T has a (1 + ¢)-EMST 
drawing for any given ¢ > 0 and that this drawing can be computed in linear time in 
the real RAM model of computation. 


F133: In [EGK*12] it is proved that one can arbitrarily approximate a proximity 
drawing of any planar graph for some of the most studied definitions of proximity. 
Namely, it is shown that for any positive values of €,,€2 an embedded planar graph 
admits an (€1,€2)-Gabriel drawing and an (€1, €2)-Delaunay drawing and an (€1, €2)-G- 
drawing ( 1 < 6 < co) that preserve the given embedding. 


REMARKS 


R21: The previous results are, in a sense, tight since for each of the above types 
of proximity rules there are embedded planar graphs that do not have an embedding 
preserving (€1,€2)-proximity drawing with either ¢; = 0 or e2 = 0. 


R22: Approximate proximity drawings generalize weak proximity drawings. Namely, 
an (€1,€2)-proximity drawing is a weak proximity drawing if ¢; = 0 and €2 = on. 


Cluster Planarity 


When the graph to be displayed consists of hundreds or thousands of vertices, a complete 
visualization of the data is typically not effective for the user. To handle large graphs and 
their visualizations, a lot of attention has been devoted in the last decade to the study 
of graphs whose vertices are organized into a hierarchy of clusters. Making clusters with 
the vertices helps in the analysis of complex relational data, since it makes it possible to 
explore large graphs at different levels of detail by collapsing or expanding the clusters. 


DEFINITIONS 


D81: A clustered graph C = (G,T), also called c-graph, consists of an undirected 
graph G and a rooted tree T called an inclusion tree of C, such that: (i) the leaves 
of T are the vertices of G; (ii) each internal node yz of T has at least two children and 
represents a cluster V(,1) of the vertices of G that are leaves of the subtree rooted at wy. 


D82: A c-planar drawing (resp. an oc-planar drawing) of a c-graph C is a 
visualization of C' in the plane such that each vertex v of G is drawn as a point p(v), 
each edge of G is drawn as a simple Jordan arc, and each node yp of T is drawn as 
a simple closed region R(j) according to the following rules: (1) R() contains the 
drawing of G(m). (2) If V(u) Cc Viv), R(v) contains R(p). (3) If v ¢ V(p), p(v) is 
outside R(j). (4) There is no edge crossing, i.e., any two edges of G never cross. (5) 
There is no edge-region crossing, i.e., there is no edge of G that crosses the boundary of 
a region R(w) twice. 


D83: A c-graph is c-planar if it admits a c-planar drawing 
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FACTS 


F134: The problem of testing whether a c-graph is c-planar was first introduced in a 
paper by Feng, Cohen, and Eades [FCE95b] that inspired and motivated a sequence of 
papers on this topic. Feng et al. [FCE95b] describe a quadratic-time c-planarity testing 
algorithm for clustered graphs where each cluster induces a connected subgraph. 


F135: Linear-time testing algorithms for the same class of clustered graphs as the one 
studied by Feng, Cohen, and Eades are described in [CDFT08, Dah98, DKMO6]. 


REMARK 


R23: Feng et al. leave as open the problem of testing a c-graph for c-planarity when 
clusters can induce non-connected subgraphs; the time complexity of this problem is 
still unknown. 


FACTS 


F136: Special cases for which polynomial-time testing algorithms exist have been 
described in the literature, such as [DF09, JKK*09]. 


F137: The relationship between planarity and c-planarity has been studied in [CW06] 
and a planarization algorithm for c-graphs that are not c-planar is described in [DDM02]. 


F138: Assuming that a clustered graph C is c-planar, several algorithms and bounds 
are known for constructing c-planar drawings of C. [DDF09, EFN99, NKO7]| 


F139: Every c-planar graph admits a straight-line convex drawing [EFLNO6], even if 
the shape of each cluster is fixed in advance [AFK11]. Straight-line convex drawings 
might require exponential area [FCE95a]. 


F140: Extensions of the studies above to the case of a clustered graph with clusters 
that can partially intersect can be found in [DGL08, OS07]. 


10.3.8 Sources and Related Material 


Several books devoted to graph drawing have been published [Kam89, DETT99, 
JM03, KW01, NRO4, Sug02]. The proceedings of the annual Symposium on Graph 
Drawing are published by Springer-Verlag in the Lecture Notes in Computer Science 
series (volumes 7034, 6502, 5849, 5417, 4875, 4372, 3843, 3383, 2912, 2528, 2265, 1984, 
1731, 1547, 1353, 1190, 1027, 894). 


REMARKS 


R24: Special issues devoted to graph drawing have appeared in Algorithmica (vol. 
16, no. 1, 1996), Computational Geometry: Theory and Applications (vol. 9, no. 1-2, 
1998), the Journal of Visual Languages and Computing (vol. 6, 1995), International 
Journal of Foundations of Computer Science (vol. 17, no. 5, 2006), and the Journal of 
Graph Algorithms and Applications (vol. 3, no. 4, 1999; vol. 4, no. 3, 2000; vol. 6, no. 
1, 2002; vol. 6, no. 3, 2002; vol. 7, no. 4, 2003; vol. 8, no. 2, 2004; vol. 9, no. 1, 2005; 
vol. 9, no. 3, 2005; vol. 11, no. 2, 2007; vol. 12, no. 1, 2008; vol. 13, no. 3, 2009; vol. 
14, no. 1, 2010; vol. 15, no. 1, 2011; vol. 16, no. 1, 2012). 


Section 10.3. Drawings of Graphs 1277 


R25: Sites with pointers to graph drawing resources and tools include the WWW 
page maintained by Tamassia (http://www.cs.brown.edu/people/rt/gd.htm1), the 
WWW page maintained by Brandes (http: //graphdrawing.org/), the Graph drawing 
e-print archive (http://gdea.informatik.uni-koeln.de/), and the Graph-Archive 
(http://www. graph-archive.org/doku. php). 
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INTRODUCTION 


In this section we demonstrate algorithms for the recursively defined classes of trees, 
series-parallel graphs, treewidth-k graphs, cographs, cliquewidth-k graphs, and k-HB 
graphs. For convenience, definitions of these graph classes, which appear in §2.4 (among 
others given there), are repeated here. Our emphasis in this section is on the solution 
technique for each graph class and less so on the variety of problems solvable within 
each class per se. Accordingly, for each graph class, we will consider the problem of 
finding an independent set in a given graph (§5.3); where it is natural to do so, some 
additional problems are solved as well. For a much broader accounting using this same 
format and one that includes a host of other problems such as clique, dominating set, 
vertex coloring, matching, hamiltonian cycle, hamiltonian path, and others, the reader 
is directed to [BoPaTo08}. 


DEFINITIONS 


D1: A recursively constructed graph class is defined by a set (usually finite) of 
primitive or base graphs, in addition to one or more operations (called composition 
rules) that compose larger graphs from smaller subgraphs. Each operation involves 
fusing specific vertices from each subgraph or adding new edges between specific vertices 
from each subgraph. 
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D2: Each graph in a recursive class has a corresponding decomposition tree that 
shows how to build it from base graphs. 


D3: For a graph G, a set S of vertices is an independent set if no two vertices in S 
are adjacent. 


Algorithm Design Strategy 


Efficient algorithms for problems restricted to recursively constructed graph classes 
typically employ a dynamic programming approach as follows: first solve the problem 
on the base graphs defined for the given class; then combine the solutions for subgraphs 
into a solution for a larger graph that is formed by the specific composition rules that 
govern construction of members in the class. A linear-time algorithm is achieved by 
determining a finite number of equivalence classes that correspond to each node in a 
member graph’s decomposition tree. The number of such equivalence classes is constant 
with respect to the size of the input graph, but may depend upon a parameter (k) 
associated with the class. A polynomial algorithm can often be created if the number of 
equivalence classes required for the problem grows only polynomially with input graph 
size. Also key is that a graph’s decomposition tree be given or be computable efficiently. 


NOTATION: In the descriptions of the algorithms in this section, we use G.x to denote an 
attribute of a given graph G. When we write G.c = mazimum-cardinality independent 
set, then G.x carries with it two pieces of information: the size of a maximum-cardinality 
independent set, and one particular instance of such a set. Moreover, these two pieces 
of information are carried forward in computations and assignments involving G.x. 


REMARK 


Ri: When the simplest version of a problem (cardinality or existence) can be solved 
using a dynamic programming approach, then other more complicated versions (involv- 
ing vertex or edge weights, counting, bottleneck, min-max, etc.) can generally also be 
routinely solved. Following, we begin with the simple recursive class of trees. 


10.4.1 Algorithms on Trees 


DEFINITION 


D4: The graph with a single vertex r (and no edges) is a tree with root r (the sole 
base graph). Let (G,r) denote a tree with root r. Then (Gi,11) © (G2,r2) is a tree 
formed by taking the disjoint union of G; and G2 and adding an edge (11, r2). The root 
of this new tree is r = 1}. 


TERMINOLOGY NOTE: Technically, the pairs (Gr) in Definition D4 denote rooted trees. 
However, the specification of distinguished vertices r; and rz (and hence r) is relevant 
here only as a vehicle in the recursive construction. 


NOTATION: Given any tree (or subtree) G, the designated root is denoted by root|G]. 
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Maximum-Cardinality Independent Set in a Tree 


FACTS 


F1: Any independent set of vertices either includes root[G] or does not. Whether or 
not independent sets of two trees G1,Gz2 can be combined into an independent set of 
the tree G, 6 Gz depends only on these inclusions. 


NOTATION: The following notation is used in the description of Algorithm 10.4.1 below. 


e G.a = max-cardinality independent set that includes root[G]. 
e G.b = max-cardinality independent set that excludes root[G]. 


e G.c = max-cardinality independent set. 


F2: The following multiplication table suffices to describe all possible types of out- 
comes from the composition G; 6 Gz. In the table, rather than displaying G.a and 
G.b we simply specify a and b, respectively. The row by column product assumes the 
convention where subgraph (i.e., subtree) G, is on the left and subtree Gz is on the 
right. 


®B}lalb 
a 
b |b 


REMARK 


R2: The values for G.a, G.b, and G.c are known trivially for the base graph — a 
single-vertex tree (which is its own root). For composed graphs, the values may be 
computed via O(1) additions and comparisons across the outcomes in the table. There 
is only one product producing a possible member of G.a while G.b may be produced 
from a pair of possible products; the maximum of these yields the desired G.b. The final 
step in Algorithm 10.4.1 computes G.c, which, at the root of the decomposition tree, is 
the solution. 


Algorithm 10.4.1: Maximum-Cardinality Independent Set in a Tree 


Input:  TreeG=(V,E). 
Output: G.c (size and instance of a max-cardinality independent set in G). 
If |V| =1 
Gal 
G.b<—0 
Else 
IfG=G,@G2 
G.a< G\.a+ Go.b 
G.b © max {G,.b + Go.a, G,.b + G2.b} 
G.c © max {G.a, G.b} 


COMPUTATIONAL NOTE: ‘The decomposition tree for trees is easy to determine and, 
accordingly, can be assumed to be part of the instance. 
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Maximum-Weight Independent Set in a Tree 


Here we assume that each vertex in the graph is assigned a weight. Switching to a 
weighted version of the independent set problem is straightforward. 


NOTATION: The notation used in the Algorithm 10.4.2 is as follows: 
e G.d = max-weight independent set containing root[G]. 
e G.e = max-weight independent set without root|G]. 


e G.f = max-weight independent set. 


Algorithm 10.4.2: Maximum-Weight Independent Set in a Tree 


Input: TreeG=(V,E). 
Output: G.f (weight and instance of a max-weight independent set in G). 
If |V| =1 
G.d © weight (root[G]) 
G.e<« 0 
Else 
IfG=G,6G2 
G.d + G,.d + Go.e 
G.e + max {G}.e + Go.d, Gi.e + Go.e} 
G.f <— max {G.d,G.e} 


EXAMPLE 


El: Consider the tree T shown in Figure 10.4.1. Vertices are labeled t,u,...,z 
and beside each label is a vertex weight. Algorithms 10.4.1 and 10.4.2 are applied and 
the computations are summarized by the listing on the right. The 6-tuples aligned 
with each composed subgraph, G;,, correspond to values [G.a, G.b, G.c, G.d, G.e, G.f]. 
The maximum cardinality and maximum weight of any independent set is G.c = 5 
and G.f = 16, respectively; these are read from the computation for G 3. Standard 
backtracking can be applied to determine that the explicit solutions are sets {t, v, w, y, 2} 
and {u, y, z}, respectively. 


REMARKS 


R3: Many other problems such as variations of the vertex cover, dominating set, 
matching, or longest path problems could have been selected to represent the basic 
computation on trees (cf. [BoPaTo92], [BoPaTo08]). Note that some problems such as 
minimum bandwidth [GaGrJoKn78] are NP-complete on trees. 


R4: A number of problems are trivial when restricted to trees. For example, any 
tree with at least 2 vertices has maximum clique size of 2 and chromatic number 2; 
otherwise these are both 1. Also, no tree can contain a hamiltonian cycle, and a tree 
has a hamiltonian path if and only if it is a path. 
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v,3 y,5 
u,7 t,1 2,6 
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Go = {u} (1; 054, 7. 05°, 
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Gg G4 Gio G4 = {w} [1, 0, 1, 2, 0, 2 
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U t << uU t x 
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Figure 10.4.1: Max-cardinality and max-weight independent sets in a tree. 


R5: ~The definition and number of equivalence classes that are required to solve a 
given problem depend on both the graph class and the problem to be solved. The effect 
of graph class is demonstrated throughout in subsequent subsections of this section. 
To provide a sense of dependence on the problem, let the distance in connected graph 
G = (V, E) between U C V and W C V be the shortest (edge) length of a path from any 
u € U to any w € W, and let a K-independent set be a subset of V containing no two 
distinct vertices at distance « or less. Finding a maximum-cardinality «-independent set 
requires «+2 equivalence classes G.c and G.i: i =0...4, where: G.c is the maximum- 
cardinality «-independent sets; G.« denotes maximum-cardinality «-independent sets 
at distance at least & to the root; Gi: 7 = 0...«% —1 denotes maximum-cardinality 
«-independent sets at distance 7 to the root; multiplication table entry G1.i, G2.7 = min 
{i,j +1} ift+ 9 > « and null otherwise. Note that the resulting algorithm may be 
superlinear if « is not O(1). 


10.4.2 Algorithms on Series-Parallel Graphs 


Every series-parallel graph can be composed from single edges using only the three 
composition rules given in Definition D5 below. For illustrations of these rules, see 
§2.4.1. 


DEFINITION 


D5: A series-parallel graph with distinguished terminals | and r is denoted 
(G,l,r) and is defined recursively as follows: 
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(base graph) The graph consisting of a single edge (v, v2) is a series-parallel graph 
(G,l,r) with | = v, and r = vp. 


e The series operation (Gi,11,71) ©s (Go, l2,r2) forms a series-parallel graph by 
identifying r, with Jj. The terminals of the new graph are J, and ro. 


e The parallel operation (Gi, 1,11) ©p (G2, le, r2) forms a series-parallel graph by 
identifying 1; with lz and r; with rg. The terminals of the new graph are J, and 
T1- 


The jackknife operation (Gi,l1,71) ©; (G2, l2,r2) forms a series-parallel graph 
by identifying r, with l2; the new terminals are J; and rj. 


REMARK 


R6:  Series-parallel graphs are recognizable, and their decomposition trees can be con- 
structed in linear time (see §2.4.3). This leads to fast, often linear-time, dynamic pro- 
gramming algorithms for many problems when instances are confined to series-parallel 
graphs. 


Maximum-Cardinality Independent Set in a Series-Parallel Graph 


In the case of trees, only a single point of composition involving constituent subtrees, 
the root vertex, was relevant; a series-parallel graph G has two such points, its terminal 
vertices. These are denoted le ft|G] and right[G] in the following description. 


NOTATION: Algorithm 10.4.3 uses the following notation: 
e G.a = max cardinality independent set containing both le ft/G] and right[G]. 
e G.b = max cardinality independent set with left[G] but not right[G]. 
e G.c = max cardinality independent set with right[G] but not le ft[G]. 
e G.d = max cardinality independent set with neither left{G] nor right[G]. 


e G.e = max cardinality independent set. 


Multiplication Tables for Series, Parallel, and Jacknife Opera- 
tions 


©, |al|bic|d Op |a}|bjc}d O;|a}lbjycid 
a |b aja ala 
a |b b b |b 
c |cl/d c c Go|? |G 
d c|d d d d dj|d 
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Algorithm 10.4.3: Maximum-Cardinality Independent Set 
in a Series-Parallel Graph 


Input: — Series-parallel graph G = (V, E). 
Output: G.e (size and instance of a max-cardinality independent set in G). 
If |E|=1 
[G.a, G.b, G.c, G.d] <— [—o0, 1, 1, 0] 
Else 
If G=G1 Os Ge 
G.a + max {G,.a+ Go.a—1,G1.b+ Go.c} 
G.b © max {G.a+ G2.b — 1,G1.b + Go.d} 
G.cé max {G1.c+ Gg.a—1,G1.d+ Go.c} 
G.d «+ max {Gi.c + Go.b-—1,Gy.d+ G2.d} 


Else 
If G= G1 Op Go 
G.a<¢ G).a+ Go.a— 2 
G.b< G).b+ Go.b—-1 
G.c& Gy.c+ Go.c—1 
G.d¢+ G,.d+ Go.d 
Else 


IfG=G, OF Go 
G.a+ G,.a + max {Go.a,G2.b} — 1 
G.b eH G,.b + max {Go.c, G2.d} 
G.ct Gy.c + max {G2.a,G2.b} — 1 
G.d< G.d + Max {Go.c, G2.d} 
G.e + max {G.a, G.b, G.c, G.d} 


COMPUTATIONAL NOTE: Subtraction of values 2 and 1 in the respective computational 
expressions above avoids multiple counting when terminal vertices are fused. 


EXAMPLE 


E2: Algorithm 10.4.3 is demonstrated on the series-parallel graph G given to the 
left in Figure 10.4.2. Vertices are labeled in G as shown and to the right, the explicit 
computation is summarized. The 4-tuples exhibit values for G.a,G.b,G.c, and G.d. 
From the last computation, it follows that either G.b or G.d produces an optimum. In 
the first case, the set is {z,v,w}, while in the second, we have {y, v, w}. 


Subgraph 
Gi = (z,y) [—o0, 1, 1,0 
Go = (y, 2) [—oo, 1, 1,0 
. 7 G3 = (z,2) [—oo, 1, 1,0 
‘ G4 = (2, v) [—oo, 1,1,0 
Se Gs = (#,w) [—oo, 1, 1,0 
Ge = G1 Os Go [2, , i 
7] Ww G7 = G6 Op G3 [-o0, 1,1,1 
G Gg = G7 Oj Ga [—o0, 2,1,2 
Gg = Gg Oj Gs [—o0, 3,1,3 


Figure 10.4.2: Max-cardinality of an independent set in a series-parallel graph. 
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FACTS 


F3: Other problems solvable in linear time on series-parallel graphs include variations 
of clique, dominating set, matching, hamiltonian path, and hamiltonian cycle. Indeed, 
a series-parallel graph can have at most one hamiltonian cycle (cf. [Sy83]). 


F4: A series-parallel graph has chromatic number 3 if it is not bipartite; otherwise it 
has chromatic number 2 (because it has at least one edge). 


REMARKS 


R7: Following Fact F3, solving the traveling salesman problem (cf. §4.6) on series- 
parallel graphs reduces to deciding hamiltonicity. 


R8: Following Fact F4, the chromatic number for a series-parallel graph can be deter- 
mined in linear time by using depth-first search to simply test for the existence of an 
odd cycle. 


R9: In some problem settings, the jackknife operation can simply be neglected. For 
example, a hamiltonian graph must be 2-connected (i.e., no cut-vertices) but the jack- 
knife operation destroys this property; hence, the jackknife operation is not relevant in 
this case. On the other hand, if the aim is deciding the existence of a hamiltonian path, 
the jackknife operation is relevant. 


R10: Solutions to numerous other problems on series-parallel graphs follow the ma- 
chinery demonstrated by Algorithm 10.4.3. In addition to the references already cited in 


Remark R2, a good basic source dealing with problems such as vertex cover, maximum 
eulerian subgraph, Steiner subgraph, edge-covering, etc. is [Ri85]. 


10.4.3. Algorithms on Treewidth-s/ Graphs 


DEFINITIONS 


D6: A tree-decomposition of a graph G = (V, E) is a pair ({X; | i € I}, T), where 
{X; | 7 € I} is a family of subsets of V, and T is a tree with vertex set J such that: 


© Uier Xi =V, 

e for all edges (x,y) € E there is an element i € I with x,y € X;, and 

e for all triples i,j,k € I, if j is on the path from 7 to k in T, then X; (| Xp C Xj. 
D7: The width of a given tree-decomposition is measured as max j¢7{|X;| — 1}. 


D8: The treewidth of a graph G is the minimum width taken over all tree-decompositions 
of G. 


D9: A graph G is a treewidth-k graph if it has treewidth no greater than k. 
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FACT 


F5: Every treewidth-k graph has a tree-decomposition T such that T is a rooted binary 
tree [Sc89]. We write 
(G,X) = (G1, X1) ® (Ga, X2) 


where X C V is the set of vertices of G associated with root|T], and graphs G; and G2 
have tree-decompositions given by the left and right subtrees of T. This is enough to 
produce linear-time dynamic programming algorithms for many problems on treewidth- 
k, graphs, because each |X| < k +1. 


Maximum-Cardinality Independent Set in a Treewidth-k Graph 


NOTATION: For a graph G, let binary tree T be a tree-decomposition of G and let X C V 
be the set of vertices of G associated with root/T]. Also, 


e G[S] = max-cardinality independent set that contains S C X but not X — S. 


e G.max = max-cardinality independent set. 


Algorithm 10.4.4: Maximum-Cardinality Independent Set ina 
Treewidth-k Graph 


Input: Treewidth-k graph G = (V, E). 
Output: G.maz (size and instance of a max-cardinality independent set in G). 


If xX =V 
For all SC X 
If S contains two adjacent vertices 
G[S] < —oo 
Else 
G[S] < |S| 
Else 
If (G, X) = (Gi, X1) ® (Go, X2) 
For all SC X 
If S contains two adjacent vertices 
G[S] < —oo 
Else 


G[S] — max{Gi[Si] + G2[S2] —|519 S| —|S2NS|+ |S]: 
Si C X41, 82 C X2,51NX =SNX1,52NX =SNX2} 
G.maxz + max{G[S]:S Cc xX} 


EXAMPLE 


E3: Algorithm 10.4.4 is demonstrated on the treewidth-2 graph G shown in Figure 
10.4.3. T is a binary rooted tree-decomposition of G, and each 8-tuple exhibits values 
for G[S] for each S C X. However, only values larger than —oo within each 8-tuple are 
shown as the computation progresses. The maximum independent set has size 4, and 
the explicit solution is {a,c, e, g}. 


1300 Chapter 10. Graphs in Computer Science 
Subgraph 


(G1, X1 = {a,}, f}) 
0 — 0, {a} — 1, {0} — 1, {f} — 1, {, f} — 2] 


(G2, X2 = {b, c, d}) 
0 — 0, {b} — 1, {c} — 1, {d} > ]] 


(G3, X3 = {d,e, f}) 
(0 > 0, {d} — 1, fe} + 1, {f} - ]] 


(G4, X4 = {b, d, f}) = G2 @G3 
@ — 2, {b} — 2, {d} > 1, {f} — 2, {6, f} > 2] 


(Gs, X5 = {b, f, 9}) =G) @ G4 
0 — 3, {b} — 2, {f} — 2, {g} — 4, {b, f} — 2] 


Figure 10.4.3: Maximum-cardinality independent set in a treewidth-2 graph. 


REMARK 


R11: Many other problems including variations of independent set, dominating set, 
clique, m-vertex coloring (for arbitrary, fixed m), matching, and hamiltonian cycle/path 
problems can be solved in linear time for treewidth-k graphs (cf. [BoPaTo92]). 


Monadic Second-Order Logic Expressions for a Graph 


NOTATION: Let variables v; denote a vertex with domain V, e; denote an edge with 
domain E, V; denote a vertex set with domain 2” (subsets of V), and E; denote an 
edge set with domain 2”. 


DEFINITION 


D10: Monadic second-order logic (MSOL) for a graph G = (V, E) is a predicate 
calculus language in which predicates are constructed recursively as follows: 
e MSOL contains primitive predicates such as v4, = vj, ui € Vj, e, € Ej, and 
Incident (v;, €;). 
e If P and Q are MSOL predicates then each of (=P), (PA Q), and (P V Q) is also 
a MSOL predicate. 
e If P is a MSOL predicate and « is any variable, then (Sx)(P) and (Vx)(P) are 
also MSOL predicates. 


EXAMPLE 
E4: Some simple MSOL predicates are listed below. 


P+>Qs-PVQ 

PHQs(P>Q)A(Q-P) 

e; =e; > (Vv1) (Incident(v1, e;) + Incident(v1, e;)) 

Adjacent(v;, vj) = 7(vi = vj) A (de1) (Incident(v;, e1) A Incident(v;, e1)) 
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MSOL-Expressible Graph Problems 


Many important graph problems can be expressed in MSOL (cf. [Co90], [ArLaSe91], 
[BoPaTo91], [BoPaTo92], [CoMo93]). Here is a sampling of several such problems. 


IndependentSet(V1) <= (Vv2) (Vvs) ((ve € Vi A v3 € Vi) > 7 Adjacent(va, v3)) 
Clique(Vi) = (Vv2) (Vv3) ((v2 € Vi A v3 € Vi) > Adjacent(v2, v3)) 
DominatingSet(V) = (Vv2) (v2 € Vi V (Avs) (v3 € Vi A Adjacent(v2, v3))) 
VertexColorable, (Vi, ..., Vn) = (Vvo) (vo € Vi V..-V vo € Vm) 
A IndependentSet(V1) A...A IndependentSet (Vim) 
Matching(E,) 4 (Veo) (Ves) ((e2 € EF; Aes € Ey A 7 (e2 = e3)) —-> 
=(Av4) (Incident(v4, e2) A Incident(v4, e3))) 
Connected(E; ) a (VV2) (VVs3) (4 dv4) (v4 € Va) Va (Avs) (v5 € V3) V 
v6) (> (ve € V2) A 7 (ve € V3)) V 
Jez) (Avg) Avg) (e7 € FE, A vg € Vo A vo € V3 A 
Incident(vg, e7) A Incident(vg9, e7))) 
HamCycle(E,) < Connected(E,) A (Vv2) (de3) (des) (eg € Ex A e4 € Ey A 
a(e3 = eg) A Incident(ve, es) A Incident(v2, e4) A 
(Ves) ((e5 EE, A Incident(v2, es )) = (e5 =e3 Veé5 = e€4))) 
HamPath(E1) <= Connected(E,) A (Vv2) (de3) (des) (e3 € Er A e4 € Ey A 
Incident(v2, e3) A Incident(v2, e1) A 
(Ves) ((e5 € Ey A Incident(ve, e5)) 4 (e5 = eg V e5 = e4)) A 
(Sve) (de7) (Veg) ((eg € Ex A Incident(ve, eg)) + es = e7)) 


ae 


— 


— 


— 


=~ 


FACTS 


F6: Every MSOL-expressible problem can be solved in linear time for treewidth-k 
graphs [Co90], [ArLaSe91], [BoPaTo92], [CoMo93]. Moreover, this is the case for many 
variations of each MSOL problem, including existence, minimum or maximum cardinal- 
ity, minimum or maximum total weight, minimum-maximal or maximum-minimal sets, 
bottleneck weight, and counting. 


F7: Once a problem is expressed in MSOL, a linear-time dynamic programming 
algorithm can be created mechanically [BoPaTo92]. 


F8: The chromatic number problem (§5.1) for treewidth-& graphs is solvable in linear 
time, because every treewidth-k graph possesses a vertex coloring with at most k + 1 
colors. 


F9: For some problems, a MSOL expression cannot be written and a linear-time 
algorithm cannot be found. In these cases it may still be possible to develop a linear- 
time algorithm via an extension to MSOL [BoPaTo92], or to develop a polynomial-time 
algorithm. Polynomial time is achieved by constructing a polynomial-size data structure 
that corresponds to each node in the tree decomposition (see Remark R1; also see [Bo95] 
and [BoPaTo08}). 


REMARK 


R12: The literature contains hundreds of linear-time algorithms for problems on 
trees, series-parallel graphs, treewidth-k graphs, and related classes. Many of these 
algorithms are predicted by the results cited in Fact F6. For example, all of the linear- 
time algorithms given in [BoPaTo08] for problems on trees, series-parallel graphs, and 
treewidth-k graphs are predicted by these results. 
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Two Open Problems 


The hidden constant in the running time of a mechanically created algorithm can grow 
superexponentially with the number of quantifiers 4 and V present in the formula, so 
Fact F7 is impractical for complex formulas. Ad hoc methods often suffice to design an 
equivalent linear-time dynamic programming algorithm with a small hidden constant, 
but a computationally practical algorithm remains elusive. However, progress has been 
made for certain kinds of problems and graph classes [K198], [SaHuTaOg00], [Ka01], 
[KIMoSc02]. 


Open Problem 1. Determine an optimally efficient algorithm to create a linear time 
dynamic programming algorithm given an MSOL expression. 


Open Problem 2. Determine a procedure that given an MSOL expression produces a 
linear-time dynamic programming algorithm with minimum hidden constant. 


REMARKS 


R13: Chromatic index can be solved on treewidth-k graphs in polynomial time, by 
constructing a polynomial-size data structure that corresponds to each node in the tree 
decomposition (cf. [Bo90]). More recently, a linear-time algorithm has been developed 
for this chromatic index problem (cf. [ZhNaNi96}). 


R14: Algorithms on treewidth-k graphs can be adapted to solve the same problems on 
related classes such as Halin graphs, partial k-trees, bandwidth-k graphs, pathwidth-k 
graphs, branchwidth-k graphs, and k-terminal graphs (cf. [Wi87], [WiHe88]). 


10.4.4 Algorithms on Cographs 


One can produce linear-time dynamic programming algorithms for problems related to 
cographs. Here, in addition to the maximum-cardinality independent-set problem, we 
present algorithms for several other problems. 


Algorithm 10.4.5: Maximum-Cardinality Independent Set 
in a Cograph 


Input: | Cograph G = (V, E). 
Output: Gi (size and instance of a max-cardinality independent set in G). 
If |V| =1 
Gicl 
Else 
IfG=G,UG2 
Gic Grit Goi 
Else 
[fG= Gi x Go 
Gic max{G}.1, G2.i} 


DEFINITIONS 
D11: A clique in a graph is a maximal set of pairwise adjacent vertices (§5.3). 


D12: The chromatic number of a graph G is the minimum number of colors that 
can be used to color the vertices of G so that no two adjacent vertices get the same 
color (§5.1). 
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D13: <A dominating set in a graph G is a subset S of vertices such that every vertex 
in G is either in S or adjacent to a vertex in S (89.2). 


D114: A matching in a graph is a set of edges no two of which have an endpoint in 
common. 


D15: A cograph is defined recursively as follows: 
e (base graph) A graph with a single vertex is a cograph. 
e If G; and Gp are cographs, then the disjoint union G'; U G2 is a cograph. 
e If G; and G2 are cographs, then the cross-product G, x G2 is a cograph, which is 


formed by taking the union of G; and G2 and adding all edges (v1, v2) where v1 
is in G, and v9 is in Go. 


Three More Algorithms for Cographs 


Algorithm 10.4.6 solves both the maximum-clique and chromatic number problems in a 
cograph. 


Algorithm 10.4.6: Maximum-Clique and Chromatic Number 
in a Cograph 


Input: Cograph G = (V, £). 
Output: G.c (maximum clique in G, which is equal to chromatic number of G). 
If |V| =1 
G.ce+ 1 
Else 
If G=GiUG2 
G.c + max{G}.c, G2.c} 
Else 
If G=G1 x Ge 
G.ce Gi.c+ Go.c 


Algorithm 10.4.7: Minimum-Cardinality Dominating Set 
in a Cograph 


Input: | Cograph G = (V, E). 
Output: G.d (size and instance of a min-cardinality dominating set in G). 
If |V| =1 
Gd 1 
Else 
If G=G,UG, 
Else 
IfG= G, x Go 
G.d © min{G.d, Ga.d, 2} 
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Algorithm 10.4.8: Maximum-Cardinality Matching in a 
Cograph 


Input: | Cograph G = (V, E). 
Output: G.m (size and instance of a max-cardinality matching in G). 


If |V|=1 
G.m< 0 
Else 
If G=G,UG»2 


G.m + G1.m+Go.m 
Else 
IfG= Gy x Go 
G.m <= min{G.m 4+ |Va|, Goam+ |Vil, |(Yi| + |Vel) /2]} 


REMARKS 


R15: The right-hand side of the final assignment in Algorithm 10.4.8 is obtained by 
simplifying the more straightforward but less efficient formula below, wherein k denotes 
the number of matching edges with one endpoint in each of the subgraphs G; and Go. 


{k + min {G.m, |(|Vi| — #)/2]} + min {G2.m, |(|Va| — k)/2\}} 


max 
0<k<min{|Vi|,|V2|} 


R16: The hamiltonian cycle and hamiltonian path problems can be solved in linear 
time on cographs. Weighted versions of the independent set, clique, and dominating set 
problems are also solvable in linear time on cographs by extending Algorithms 10.4.5 
through 10.4.7. However, weighted versions of the matching and hamiltonian problems 
do not appear to be solvable in linear time on cographs (although they are solvable in 
polynomial time). Intuitively, the reason is that the cross product operation adds too 
many edges, where each edge potentially has a different weight. 


EXAMPLE 


E5: Algorithms 10.4.5 through 10.4.8 are demonstrated on the cograph G shown 
in Figure 10.4.4. T denotes the tree decomposition of G, and each 4-tuple exhibits 
values for G.i,G.c,G.d, and G.m. The maximum independent set has size 3, for ex- 
ample {a,b,c}. The maximum clique has size 5, given by {c,d,e,g,h}. The minimum 
dominating set has size 2, for example {a, f}. The maximum matching has size 4, for 


example {(a, f); (0, 9); te; h), (d, e)}. 
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Subgraph 
G a Gi = {a} 10) 
b a G2 > {b} oR Fei 
Cc g G3 = {ce} 1,1, 1,0] 
a h G4 = {d} 1 ae Ue WG 
e Gs = {e} 1,1, 1, 0} 
Ge = {f} 1,1,1,0 
Gz = {g} 1,1,1,0] 
Gg = {h} 1,1, 1,0] 
Gg = G1 UGo 2,1, 2,0] 
Gio = G3 x G4 125 dll 
Gi1 = G7 xX Gg 12 de 
Gi2 = Gio x G5 1,3, oe 1] 
Gi3 = Gg UG11 Dynes 
Gia = Go UG12 3,3, 3, 1] 
Gi5 = Gi3 X G4 3,5, 2,4| 


abcdefagh 


Figure 10.4.4: Illustrating Algorithms 10.4.5 through 10.4.8. 


10.4.5 Algorithms on Cliquewidth-k Graphs 


DEFINITION 


D16: Let [k] denote the set of integers {1,2,...,k}. A cliquewidth-k graph is 
defined recursively as follows: 


e (base graph) Any graph G with V(G) = {v} and I(v) € [k] is a cliquewidth-k 
graph. 


e If G; and G are cliquewidth-k graphs and i, 7 € [k] with i 4 7, then: 


— The disjoint union G' U G2 is a cliquewidth-k graph. 

— The graph (G1);xj is a cliquewidth-k graph, where (G1); is formed from 
G, by adding all edges (v1, v2) such that I(v,) = 7% and I(v2) = 7. 

— The graph (G);-,; is a cliquewidth-k graph, where (G'1),;-,; is formed from 
G1 by switching all vertices with label 7 to label 7. 


Maximum-Cardinality Independent Set of a Cliquewidth-k Graph 


NOTATION: Algorithm 10.4.9 below uses the following notation: 


e G[S] = max-cardinality independent set that contains only labels from S C [k]. 


e G.max = max-cardinality independent set. 
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Algorithm 10.4.9: Maximum-Cardinality Independent Set in a 
Cliquewidth-k Graph 


Input: — Cliquewidth-k graph G = (V, E). 
Output: G.maz (size and instance of a max-cardinality independent set in G). 
If V = {v} 
For all S' C [k] 
Iflwv)eS 
G[Ss] <1 
Else 
G[S] — 0 
Else 
If G=G,UG, 
For all S' C [k] 
G[S] — G4[S] + G2[S] 
Else 
IfG= (Gi)ixj 
For all S' C [k] 
G[S] <— max{Gi[S — {a}], GilS — {7}]} 
Else 
IfG= (Cpe 
For all SC [k] 
Ifyjes 
G[S] — Gi[S U {i}] 
Else 
G[S] — Gi[S — {2}] 
G.max + max{G[S] : SC [k]} 


EXAMPLE 


E6: Algorithm 10.4.9 is demonstrated on the cliquewidth-3 graph G shown in Figure 
10.4.5. T denotes the decomposition tree of G. Each 8-tuple consists of G[0], G[{1}], 
G[{2}], G[{3}], G[{1, 23], G[{1, 3}], G42, 3}], and G[{1, 2,3}]. The maximum indepen- 
dent set has size 3, given by either {a,c,e} or {b,d, f}. 


A Subset of the MSOL Expressions for a Graph 


Many problems including variations of independent set, dominating set, clique, and 
m-vertex colorability (for any fixed m) can be solved in linear time on cliquewidth-k 
graphs, provided that a decomposition tree is known. These problems are all expressible 
using a certain subset of the MSOL expressions. 


DEFINITION 


D17: The MSOL’ set of expressions for a graph G = (V, FE) is the subset of MSOL 
expressions restricted to variables v; with domain V, e; with domain E, and V; with 
domain 2”. The MSOL/ set contains primitive predicates such as v; = v;, Incident(v, 
ej), and vu; € V;. MSOL’ permits the logical operators (=, A, V) and quantifiers (3, 
V). Thus, MSOL’ is the same as MSOL without the edge-set variables E; and without 
primitive predicates such as e; € E; that refer to edge-set variables. 
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Subgraph 

Gi = {a} [0,0,1,0,1,0,1,1 
G2 = {b} [0, 1, 0,0, 1,1, 0,1 
Ga =e} [0,0, 1,0, 1,0, 1,1 
Ga = {d} [0,0,0,1,0,1, 1,1 
Gs = {te} [0, 1,0,0, 1,1, 0,1 
Get Fh [0,0,0,1,0,1,1,1 
G7 = G,UG2 [Ole 2 
Gg = G4aUGs [Oy tes Tees 12 
Go = G3 UG7 [0, 1, 2,0, 3,1, 2,3 
Gio = Ge U Ge [0, 1,0, 2,1, 3, 2,3 
Git = (Go)1x2 (0,:152:0,2.1, 9.2 
Gi2 = (Gio)1x3 [Oj 10, 212, 9.8 
Gi3 = Gi, UGi2 [0, 2, 2, 2,3, 3, 4,4 
Gia = (G13)2x3 [0,2 O38 


Figure 10.4.5: Maximum-cardinality independent set in a cliquewidth-3 graph. 


FACT 


F10: Every MSOL’-expressible problem can be solved in linear time on any class of 
cliquewidth-k graphs [CoMaRo00], provided that either there exists a linear time de- 
composition algorithm for the class (as for cographs) or a decomposition tree is provided 
as part of the input. This statement holds for variations of each MSOL’ problem that 
involve existence, optimum cardinality or total weight, counting the number of solu- 
tions, etc. Once a problem is expressed in MSOL’, a linear-time dynamic programming 
algorithm can be created mechanically. 


REMARKS 


R17: Observe that the MSOL expressions given in §10.4.3 for IndependentSet(V;), 
Clique(V;), DominatingSet(V,), and VertexColorable,,(Vi, ..., Vm), are also MSOL’ 
expressions. However, the MSOL expressions given for Matching(£), Connected(£}), 
HamCycle(£)), and HamPath(£,) are not in MSOL’. 


R18: Some problems such as variations of matching and hamiltonicity do not appear 
to be expressible in MSOL’, and it is not known whether these problems can be solved 
in linear time on cliquewidth-k graphs. However, such problems can often be solved 
in polynomial time, given the decomposition tree. Polynomial time is achieved by 
constructing a polynomial-size data structure that corresponds to each node in the tree 
decomposition (cf. [Wa94], [EsGuWa01]}). 


R19: Algorithms on cliquewidth-k graphs can be adapted to solve the same problems 
on related classes such as k-NLC graphs [Wa94]. 
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10.4.6 Algorithms on k-HB Graphs 


DEFINITION 


D18: k-HB (homogeneous balanced) graphs are graphs for which there is a 
particular O(n**?)-time top-down decomposition algorithm that constructs a pseudo- 
cliquewidth-(k+2*) balanced decomposition. (Also see §2.4.1 and [Jo03], [BoJoRaSp04].) 


FACTS 


F11: Every k-HB graph can be composed from single vertices using only the operation 
G = Gi XB,n Gg. Here Gy and G2 denote child subgraphs, each |V;| < 2-|V|/3, B = 
(Vg,Epz) is a bipartite graph with Vg = Z, U Z and Eg C Z X Zo, |Zi| < k, |Zo| < 
2" h: V > Vp is a mapping with each h(V;) C Z;, and (a, y) € E iff (h(x), h(y)) € Ep 
for alla € V; and y € Vo. 


F12: This k-HB decomposition leads to polynomial-time dynamic programming al- 
gorithms for many problems on k-HB graphs, using recursion (top-down) rather than 
dynamic programming (bottom-up). Each algorithm’s running time is polynomial be- 
cause at each node of the decomposition it evaluates O(1) parameters, each of which 
produces O(1) recursive calls on smaller subproblems. Also, the decomposition has 
O(log |V|) height, hence |V|?“™ nodes. [Jo03], [BoJoRaSp04] 


Maximum-Cardinality Independent Set in a k-HB Graph 


NOTATION: Algorithm 10.4.10 below uses the following notation: 
e G[S] = max-cardinality independent set that contains only vertices in S CV. 


e G.indep = max-cardinality independent set. 


Algorithm 10.4.10: Maximum-Cardinality Independent Set in 
a k-HB Graph 


Input: k-HB graph G = (V, E). 
Output: G.indep (size and instance of a max-cardinality independent set in G). 
If |V|=1 
G[S] < |S| 
Else 
If G=Gi XB Ge 
G[S] < max{Gi[T] + Ge[U] : X C 
(X x Y)N Eg =9, /a— 


G.indep = G[V] 
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EXAMPLE 


E7: We demonstrate Algorithm 10.4.10 on the 2-HB graph G shown in Figure 10.4.6. 
Note that G = G1 Xpi.n Go, where G1, Gz, B, and h are as shown. The top-level 
computations are summarized on the right. The maximum independent set has size 4, 
and the explicit solution is {r,t, w, y}. 


d a i M Mg Recursive calls 
SSS Se 
PFE ESAS X=053T=0=>G,[T]=0 
uw Lf Ye X={a} ST ={r,t} > Gill] =2 
X={b} => T = {¢,s,u} > Gi[T] =3 
X = {a,b} => T = {¢,r,s,t,u} > Gi[T] =3 
q r s t U 
Gy Y=0=5S3U=05 G)[U] =0 
Y = {ch} > U = {v,2,z} > Ge[U] =3 
vu Ww f8 Yy 2 Y = {d} > U = {w,y} > G2[U] = 2 
Go ° 7 : e Y = {c,d} > U = {v,u,2,y,z} > Go[U] = 3 
a b Compatible subsets 
A= {a, b} 
B A X=9,Y =0>G,(T]4+ G[U] =0+0=0 
Zo = {c,d} X=0,Y = {c} > Gi[T] + Go[U] =04+3=3 
e od X=0,Y ={d} > Gi[T] + GU] =0+2=2 
xX 0,Y {c, d} Gi[T] + Go[U] =04+3=3 
h(r) = h(t) =a X = {a}, Y =90> Gi[T] + Ge[U] =24+0=2 
h(q) = h(s) = h(u) = 6 X = {a}, Y = {d} > Gi[T] 4+ Ge[U] =242=4 
h  h(v) = h(x) = A(z) =e X ={b}, Y =05 G(T] + GU] =3+0=3 
h(w) = h(y) =d X = {a,b}, ¥ =0> Gi[T] + G2[U] =3+0=3 
Figure 10.4.6: Maximum-cardinality independent set in a 2-HB graph. 
REMARK 


R20: The maximum-cardinality clique and m-vertex colorability problems can also be 
solved in polynomial time on k-HB graphs. However, the chromatic number, dominating 
set, and hamiltonian problems are not known to be solvable in polynomial time on k-HB 
graphs. Maximum matching is of course solvable in polynomial time on k-HB graphs, 
but it is not known whether this can be done more efficiently than for arbitrary graphs. 


A Subset of the MSOL’ Expressions 


Most problems that are known to be solvable in polynomial time for k-HB graphs are 
expressible in a particular predicate language whose expressions form a subset of the 
MSOL/’ expressions. 
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DEFINITION 


D19: MSOL” for a graph G = (V,F) denotes a subset of MSOL’ restricted to 
variables v; with domain V, and variables V; with domain 2”. MSOL” contains primitive 
predicates such as Adjacent(v;,v;) and v; € V;. MSOL” permits the logical operators 
(=, A, V) and quantifiers (4, V). However, these primitives and connectors cannot 
be combined in any arbitrary way; rather every MSOL” expression must possess the 
following format. 


(Avi) or (AVin) ((Vu1) Fo(v1 EVi,...,U1€ Vin) 
A(Vv2)(Vu3) (Adjacent (v2, v3) > A1<i<j<mFij(v2 € Vi,u3 € Vj)) 
A(Vv4)(Vus5)(> Adjacent(va, vs) 4 A1<si<j<mFi; (va € Vi, Us € Vj))) 


REMARK 


R21: In Definition D19, each Fo, each Fj;, and each Fi, is an arbitrary formula that 
combines the indicated primitive predicates using operators =, A, and V. If any of these 
formulas is identically true, it may be omitted. 


EXAMPLE 


E8: The MSOL expressions for IndependentSet, Clique, and VertexColorable,, can 
be rewritten as equivalent MSOL” expressions as shown below. However, other MSOL 
expressions such as DominatingSet do not appear to be expressible in MSOL”. 


IndependentSet = (AV,)(Vv2)(Vv3) (Adjacent (v2, v3) 4 a(vo € Vi A v3 € V1)) 
Clique = (4V,)(Vv4)(Vu5)(4 Adjacent(v4, vs) 4 a(u4 € Vi A us € Vi)) 
VertexColorable,, = (4Vi)...(4Vin)((Vo1) (01 € Vi V... Vu € Vin) 

A(Vv2) (V3) (Adjacent(va, v3) a Ni<i<m7(v2 EV; Av3 € V;))) 


“—~ 


FACT 


F13: Every MSOL”-expressible problem can be solved in polynomial time when 
the input graph is restricted to any class of k-HB graphs [Jo03], [BoJoRaSp04]. This 
includes every cliquewidth-k& graph, even if its decomposition tree is not provided as part 
of the input. Once a problem is expressed in MSOL”, the polynomial-time recursive 
algorithm can be created mechanically. 
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INTRODUCTION 


In 1965, L. A. Zadeh introduced the concept of a fuzzy subset of a set as a way for 
representing uncertainty, [Za65]. Let S denote a set. A fuzzy subset of S is a function 
of S into the closed interval [0,1]. His seminal paper described fuzzy set theory and 
consequently fuzzy logic. The purpose of Zadeh’s paper was to develop a theory which 
could deal with ambiguity and imprecision of certain classes or sets in human thinking, 
particularly in the domains of pattern recognition, communication of information, and 
abstraction. This theory proposed making the grade of membership of an element in a 
subset of a universal a value in the closed interval [0, 1]. 

Zadeh’s ideas have found applications in engineering, computer science, artificial 
intelligence, decision analysis, pattern recognition, management science, operation re- 
search, robotics, and other areas. All areas in mathematics have been touched by fuzzy 
set theory. In this paper, we concentrate on both theoretical and applied issues con- 
cerning fuzzy graphs. 
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Kauffman was the first to define a fuzzy graph, [Ka73]. However, the cornerstone 
papers for the development of graph theory were by Rosenfeld [Ro75] and by Yeh and 
Bang, [YeBa75]. In [Ro75], Rosenfeld presented fuzzy analogs of several basic graph- 
theoretic concepts, including subgraphs, paths and connectedness, cliques, bridges and 
cutnodes, forests and trees. The paper [YeBa75] was partly motivated from the desire 
to extend the existing theoretical techniques at the time to data graphs directly rather 
than to threshold graphs 


10.5.1 Definitions and Basic Properties 
Let S be a set and pu a fuzzy subset of S. 


DEFINITIONS 


D1: The set of all fuzzy subsets of S is called the fuzzy power set of S and is denoted 
by FP(S). 


D2: Let t € [0,1]. Then p’ is defined to be the set {x € S| p(x) > t}. 
D3: The set ju’ is called a level set or the t-level set of pu. 


D4: The set {x € S|u(a) > 0} is called the support of yu and is denoted by Supp(jz). 


TERMINOLOGY: Throughout we use the notation V for supremum and A for infimum. 


D5: Let p,v be fuzzy subsets of S. Then we write w C v if u(x) < v(x) for allae S 
and we write w C v if w C v and there exists « € S' such that p(x) < v(x). 


D6: We define the fuzzy subsets 4 Uv and wv of S by for all  € S,(uUv)(x2) = 
(a) V v(e) and (uA v)(z) = p(x) Av(2). 


D7: The fuzzy subset p° of S is defined by for all x € S,w°(x) = 1 — p(z). 


TERMINOLOGY: We consider uw Uv and wv to be the union and intersection of pu 
and v, respectively, while yw is considered to be the complement of ys in S. 


Ds: A fuzzy relation of a set S into a set T is a fuzzy subset of the set of ordered 
pairs S x T. 


D9: If p is a fuzzy relation of S into T and o is a fuzzy relation of T into a set 
W, then the composition of p with o, written poo, is defined by for all (x,w) € 
Sx W,(poo)(x,w) = V{p(a,y) No(y,w) | ye Th. 


D10: If p is a fuzzy relation of a set S into S, we say that p is a fuzzy relation on S 
and define p? = po p. Let p be a fuzzy relation on S and n any positive integer greater 
than or equal to 2. We let p! = p and define p” recursively by p” = po p” +, where 
p”—' is assumed to have been defined. We define the fuzzy relation on p® on S' by for 


all x,y € S,p™(a,y) = V{p*(a,y) | k= 1,2,...}- 
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D11: A graph is a pair (V, R), where V is a set and R is a relation on V. The elements 
of V are thought of as vertices of the graph and the elements of R are thought of as the 
edges. 


D12: A fuzzy graph G = (V, py, p) is a triple, where V is a nonempty set, pu is a fuzzy 
subset of V, and p is a fuzzy relation on V such that for all x,y € V, p(x, y) < u(x) Ap(y). 
FACT 

F1: If (V,y,:) is a fuzzy graph, then (y‘, p*) is a graph for all t € [0,1] and in fact is 
a subgraph of (V,Supp(p)). 

DEFINITIONS 


D13: Let G = (V,,p) be a fuzzy graph. Then a fuzzy graph H = (V,v,7) is called a 
partial fuzzy subgraph of G ifv C wand Tt Cp. 


D14: The fuzzy graph H = (P,v,7T) is called a fuzzy subgraph of G induced by P if 
PCV,v(x) = p(x) for all « € P and r(z,y) = p(x, y) for all x,y € P. 


TERMINOLOGY: If the set of vertices V is understood, we sometimes write G = (w, p) 
for a fuzzy graph. It can be shown that if (v,7) is a partial fuzzy subgraph of (1, p), 
then T° C p®. 


D15: Let G = (u,p) be a fuzzy graph. Then a partial fuzzy subgraph of (v,r) of G 
is said to span G if u = v. In this case, we call (v,T) a spanning fuzzy subgraph of 
(H, p). 


10.5.2 Paths and Connectedness 
Let G = (V, p, p) be a fuzzy graph. 


DEFINITIONS 


D16: A path P in G is a sequence of distinct vertices 19,21, ...,2n (except possibly xo 
and z,) such that p(#;-1,7;) > 0,i = 1,...,n. Here n is called the length of the path. 
The consecutive pairs are called the edges of the path. 


D17: The diameter of x,y € V, written diam(z, y), is the length of the longest path 
joining zx to y. 


D18: Let P be a path. The strength of P is defined to be A?_, p(ai-1, 2;). In words, 
the strength of a path is defined to be the weight of the weakest edge. We call P a 
cycle if x9 = %, and n > 3. 


D19: Two vertices that are joined by a path are called connected vertices. 


FACT 


F2: The notion of connectedness is an equivalence relation. 
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DEFINITIONS 


D20: The equivalence classes of vertices under the connected relation are called con- 
nected components of the given fuzzy graph. They are just its maximal connected 
partial fuzzy subgraphs. 


D21: A strongest path joining any two vertices x,y has strength p™(x,y). We some- 
times refer to this as the strength of connectedness between vertices. 


D22: Let G = (p, p) be a fuzzy graph, let x, y be two distinct vertices, and let G’ be the 
partial fuzzy subgraph of G obtained by deleting the edge (x,y). That is, G’ = (, 0’), 
where p'(x,y) = 0 and p’ = p for all other pairs. We call (z,y) a bridge in G if 
p'(u,v) < p®(u,v) for some u,v in V. In words, if deleting the edge (x, y) reduces the 
strength of connectedness between some pair of vertices. 


FACTS 


F3: Let x and y be two vertices in G. Then (2, y) is a bridge if and only if there exist 
vertices u,v such that (x,y) is an edge of every strongest path from u to v. 


F4: Let (1, p) be a fuzzy graph. Then the following statements are equivalent. 
(1) (x,y) is a bridge. 

(2) p'(x,y) < p(z,y). 

(3) (a, y) is not the edge of any weakest cycle. 


DEFINITION 

D23: Let w be any vertex and let G’ be the partial fuzzy subgraph of G obtained 
by deleting the vertex w. That is, G’ = (u’, p’) is the partial fuzzy subgraph of G such 
that p/(w) = 0, = pw’ for all other vertices, p’(w,z) = 0 for all vertices z, and p’ = p 
for all other edges. We call w a cutvertex in G if p’°(u,v) < p©(u,v) for some u,v 


in V such that u 4 w & v. In words, if deleting the vertex w reduces the strength of 
connectedness between some other pair of vertices. 


FACT 


F5: A vertex w is a cutvertex if and only if there exist vertices u,v distinct from w 
such that w is on every strongest path from wu to v. 


DEFINITION 


D24: A fuzzy graph G is called nonseparable or a block if it has no cutvertices. 


REMARK 


R1: Although in a fuzzy graph, a block may have bridges, this cannot happen for crisp 
graphs. 
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10.5.3. Forests and Trees 


DEFINITIONS 
D25: A crisp graph that has no cycles is called acyclic or a forest. 
D26: A connected forest is a tree. 


D27: A fuzzy graph is called a forest if the graph consisting of its nonzero edges is a 
forest, and a tree if this graph is also connected. 


D28: If G = (u,p) is a fuzzy graph, we call G a fuzzy forest if it has a partial fuzzy 
spanning subgraph F’ = (u,7) which is a forest, where for all edges (x, y) not in F, i-e., 
such that 7(xz,y) = 0, we have p(xz,y) < 7™(a,y). In words, if (#,y) is in G, but is 
not in F, there is a path in F between x and y whose strength is greater than p(x, y). 
Clearly, a forest is a fuzzy forest. 


FACTS 
F6: A fuzzy graph G is a fuzzy forest if and only if in any cycle of G there is an edge 


(x,y) such that p(z,y) < p’'®(x,y), where G’ = (1, p’) is the partial fuzzy subgraph 
obtained by the deletion of the edge (x,y) from G. 


F7: Let G be a fuzzy graph. If there is at most one strongest path between any two 
vertices of G, then G is a fuzzy forest. 


REMARK 


R2: Note that the converse of Fact F7 does not hold. 


DEFINITION 


D29: Let G = (u,p) be a fuzzy graph. We call G a tree if and only if (Supp(w), 
Supp(p)) is a tree and we call Ga _ cycle if and only if (Supp(j), Supp(p)) is a cycle. 


FACTS 


F8: Let G = (,p) be a cycle. Then G is a fuzzy cycle if and only if G is not a fuzzy 
tree. 


F9: Let G = (,p) be a fuzzy graph. If there exists t € (0,1] such that (Supp(w), p’) 
is a tree, then G is a fuzzy tree. Conversely, if G is a cycle and G is a fuzzy tree, then 
there exists t € (0, 1] such that (Supp(w), p‘) is a tree. 


We next give some properties of fuzzy graphs. 


F10: Let G = (1, p) be a fuzzy graph such that (Supp(), Supp(p)) is a cycle. Then a 
vertex is a cutvertex of G if and only if it is a common vertex of two bridges. 


F11: If w is a common vertex of at least two bridges, then w is a cutvertex. 
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F12: If (u,v) is a bridge, then p~(u, v) = p(u, v). 


F13: If G = (u,p) is a fuzzy tree and (Supp(), Supp(p)) is not a tree, then there 
exists at least one edge (u,v) in Supp(p) for which p(u,v) < p©(u, v). 


DEFINITION 


D30: A complete fuzzy graph is a fuzzy graph G = (,p) such that p(u,v) = 
p(u) A uv) for all u,v € V. 


FACTS 
F14: If G = (,p) is a fuzzy tree, then G is not complete. 
F15: If G is a fuzzy tree, then the internal vertices of F are cutvertices of G. 


F16: Let G = (y,p) be a fuzzy graph. Then G is a fuzzy tree if and only if the 
following conditions are equivalent for all u,v EV: 

(1) (u,v) is a bridge. 

(2) p®(u,v) = plu, 0). 


F17: <A fuzzy graph is a fuzzy tree if and only if it has a unique maximum fuzzy 
spanning tree. 


REMARK 


R3: If G is a fuzzy graph which is not a fuzzy tree and T is the maximum fuzzy 
spanning tree of G, then there is at least one edge in T which is not a bridge and edges 
not in T are not bridges of G. 


FACT 


F18: If G = (u,p) is a fuzzy graph with Supp(w) = V and |V| = p, then G has at 
most p— 1 bridges. 


REMARK 


R4: It follows that if G = (wy, p) is a fuzzy graph with T a maximum fuzzy spanning 
tree of G, then end vertices of T are not cutvertices of G. Also, every fuzzy graph has 
at least two vertices which are not cutvertices. 


10.5.4 Fuzzy Cut Sets 


DEFINITIONS 


D31: Let G = (u,p) be a fuzzy graph. Let x € V and let t € [0,1]. Define the fuzzy 
subset 2, of V by Vy € V,a:(y) = 0 if y A a and 2,(y) =¢ if y = x. Then 2; is called 
a fuzzy singleton in V. If (x,y) € V x V, then (2, y)p(2,y) denotes a fuzzy singleton 
inV x V. 
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D32: Let G = (u,p) be a fuzzy graph and let E be a subset of Supp(p). 

(1) {(@, Wpw,y) |(@, y) € E} is called a cut set of (1, p) if EF is a cut set of (Supp(u),Supp(p)). 
(2) {(2, ¥) p(a,y)|(@ y) © E} is called a fuzzy cut set of (yu, ¢) if du,v € Supp(u) such 
that p’°(u,v) < p~(u,v), where p’ is the fuzzy subset of V x V defined by p’ = p on 
Supp(p) and p'(z,y) = OV(2,y) € E. 


When F is a singleton set, a cut set is called a bridge and a fuzzy cut set a fuzzy 
bridge. 


FACTS 


F19: Let G = (yu, p) bea fuzzy graph. Let V = {v,...,un} and C = {(v1, v2), (v2, U3), 
(Un—1, Un); (Un, V1) 57 = 3. 

(1) Suppose that C C Supp(p) and that V(v;, vx) € Supp(p)\C, p(v;, ve) < V{p(vi, vi41)| 
i =1,...,n}, where v,;41 = v1. Then either p is a constant function on C or G has fuzzy 
bridge. 

(2) Suppose that 0 4 Supp(p) Cc C. Then G has a bridge. 


F20: Let G = (u,p) be a fuzzy graph. Suppose that the dimension of the cycle space 
of (Supp(u), Supp(p)) is 1. Then G does not have a fuzzy bridge if and only if G is a 
cycle and p is a constant function. 


One can find a discussion of fuzzy chords, fuzzy cotrees, and fuzzy twigs in [MoNa00]. 


10.5.5 Fuzzy 1-Chain with Boundary 0, Fuzzy 
Coboundary, and Fuzzy Cocycles 


DEFINITION 


D33: Let (x,y) € V x V. Then (x,y) is called exceptional in G if and only if there 
exists a cycle C C V x V such that (2,y) € C and (z,y) is unique with respect to 
p(xz,y) = A{p(u, v)|(u, v) € C}. Let FE = {(2,y) © V x V|(z,y) is exceptional}. Let pz 
be a fuzzy subset of V x V defined by pg = p on V X V\E and px(a, y) = OV(a,y) € E. 


FACT 


F21: Let S, = {(z,y):|(v,y) € Supp(p),t € (0,1]} U {0,|t € (0,1]}. Let addition of 
elements of Supp(p) be a formal addition modulo 2, i.e., V(x, y), (u,v) € Supp(p), we 
write (x,y) + (u,v) if (x,y) A (u,v) and (a, y) + (u,v) = 0 if (wy) = (u,v). Then 
V(x, Y)t, (u,v)s € Sp, (x, yt oF (u, v)s = (x, y) = (u, v) irs: Also, V(2, yt € Sp, (x, yt oF 
Os = (2, y)tas = Os + (x, y)_ and 0; + Os = Oras. Clearly, (S,,+) is a commutative 
semigroup with identity 01. 


DEFINITION 


D34: Let S be a set of fuzzy singletons of a set W. Then foot(S) = {w € Wlu; € S}. 
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EXAMPLE 
El: Since Zz = {0,1} is a field of integers modulo 2,1 + 1=0. We have that 
S- Cys Ui) ateiad + 3 €,(ai, Ye) eas 


= VE + Aes w) pein (2, Ys) oon) 
(en Cea 


if e; = 1 and (2;, ¥:) p(x;,y;) = Op(a.y,) if €¢ = 0, €:, €; € Za. We have that 


» €; (Xa, Ys) plaasut) = (S° ei(xi, Yi) ms 


where m = A;{p(2i, yi) }- 


DEFINITIONS 


D35: Let G = (u,p) be a fuzzy graph. Then 

(1) So €i(2i, Yi) p(e;,y;) 18 a 1-chain with boundary 0 in G, where (2;,y;) € Supp(p) if 
and only if $> e;(a;, y;) is a 1-chain with boundary 0 in (Supp(),Supp(p)); 

(2) 0 €i(@i, Yi) p(ai,y;,) 18 fuzzy 1-chain with boundary 0 in (yu, p), where (2;,y:) € 
Supp(pz) if and only if $5 e;(#;, y;) is a 1-chain with boundary 0 in (Supp(),Supp(pz)). 


D36: A (fuzzy) 1-chain with boundary 0 in G is called a (fuzzy) cycle vector. 


D37: Let G = (u,p) be a fuzzy graph. Then 

(1) dJei(@i, Yi) p(e;,y;) 8 a coboundary of G, where (x;,y;) € Supp(p) if and only if 
YS) e:(@i, yi) is a coboundary of (Supp(/1),Supp(p)); 

(2) So €i (i, Ys) p(a;,y;) 18 fuzzy coboundary of G, where (xj, y;) € Supp(pz) if and only 
if > e;(@;, ys) is a coboundary of (Supp(s),Supp(pz)). 


D38: S’ CS, is called a (fuzzy) cocycle of G if and only if foot(S”) is a cocycle of 
((Supp(#),Supp(pz))) (Supp(“),Supp()). 


D39: Let G = (u,p) be a fuzzy graph. Then 
(1) the set of all (fuzzy) cycle vectors of G is called the (fuzzy) cycle set of G; 
(2) the set of all (fuzzy) coboundaries of G is called the (fuzzy) cocycle set of G. 


TERMINOLOGY: Let CS(1, 2), FCS(u, p),CoS(u, p), and FCoS(p,p) denote the cycle 
set, the fuzzy cycle set, the cocycle set, and the fuzzy cocycle set of the fuzzy graph 
G = (pu, p), respectively. When the fuzzy graph (1, ¢) is understood, we sometimes write 
CS,FCS,CoS, and FCoS. 


REMARKS 


R5: It follows that CS,FCS,CoS, and FCoS are not necessarily vector spaces over 
Z2. The details can be found in [MoNa00]. 


R6: The concepts of (fuzzy) twigs and (fuzzy) chords have results similar to their 
counterparts in the crisp case. 
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R7: Clearly, CS,FCS,CoS, and FCoS are subsets of S, = {e,|e € Supp(p),t € 
(0, 1]} U {O,|t € (0,1]}. Let S be a subset S,,. We let (S) denote the intersection of 
all subsemigroups of S, which contain S. Then (S) is the smallest subsemigroup of 5S), 
which contains S. Let St = {(e1):, +... + (en)z, (ese; € St = 1,...,n;n © N}, where 
N denotes the set of positive integers. Then S* is a subsemigroup of S). 


FACTS 


F22: (CS) = (CS)t =CSU {eq + 0plea € CS, 0, € (CS)T}. (CS) has Om as its identity 
where m = V{b|0, € (CS)T}. 


F23: (FCS) =(FCS)+ = FCSU {eq + lea € FCS, 0p € (FCS)*}. (FCS) has Om as 
its identity where m = V{b|0, € (FCS)*}. 


F24: (CoS) = (CoS)t = CoS U {ea + 0plea € CoS, 0p € (CoS)*}. (CoS) has 0, as its 
identity where m = V{b|0p € (CoS)T}. 


F25: (FCoS) = (FCoS)*t = FCoS U {ea + Onlea € FCOS, 0p € (FCoS)*}. (FCoS) has 
Om as its identity where m = V{b|0, € (FCoS)T}. 


DEFINITION 


D40: Let G = (u,p) be a fuzzy graph. Then the cycle rank of G, written m(,, p), 
is defined to be m(p, p) = V{STi_, til(es)e, € CS,i = 1,...,n, {e1,..., en} is a basis for 
foot(CS)}. The fuzzy cycle rank of G, written fm/(, p), is defined to be the cycle 
rank of (1, pz). If {e1, ..., en} is a basis for foot(CS) such that m(p, p) = >>;_, ti, where 
(e:)t, €CS,i = 1,...,n, then {e1, ...,e,} is called a cycle basis of (CS). If {e1,...,en} is 
a basis for foot(FCS(p, p)) such that fm(u, p) = >7_, ti, where (e;)1, € FCS(p, p),i = 
1,...,n, then {e1,...,en} is called a fuzzy cycle basis of (CS(, p)). 


FACTS 


F26: Let {e1,...,en} be a cycle basis of (CS). Then for all e, € CS, there is a reordering 
of €1,...,€n such that e, = (€1)t, +... + (€m)¢,,,™m <n, where t; = p(e;),i = 1,...,m. 


F27: Let {e1,...,en} be a fuzzy cycle basis of (CS(u, e)). Then for all e, € FCS(, p), 
there is a reordering of €1,...,€n such that e¢ = (€1)¢, +. + (€m)¢,,,m < n, where 
t; = p(e;),t = 1,...,m. 


DEFINITION 


D41: Let G = (yu, p) be a fuzzy graph. Then the cocycle rank of G, written m,(, p), 
is defined as follows: 


Me(H, P) = vi>- til(e:)4, € CoS,i =1,...,n, {e1,...,en}is a basis for foot(CoS)}. 
i=1 


The fuzzy cocycle rank of G, written fm-.(p,p), is defined to be the cocycle rank 
of (11, pz). If {e1,...,€n} is a basis for foot(CoS) such that me(y,p) = >>7_, ti, where 
(e;)4, € CoS,i = 1,...,n, then {e1, ..., en} is called a cocycle basis of (CoS). If {e1, ...,en} 
is a basis for foot FCoS (1, p) such that fm(p, p) = >o¥_, ti, where (e;)1, € FCxS(p, p),7 
=1,...,n, then {e1,...,e,} is called a fuzzy cocycle basis of (CS(, p)). 
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FACTS 


F28: Let {e1,...,en} be a cocycle basis of (CoS). Then for all e, € CoS, there is a 
reordering of €1,...,€n such that e; = (€1)t; +... + (€m)¢,,,m <n, where t; = p(e:),i= 


Ly .3 Ms 


F29: Let {e1,...,en} be a fuzzy cocycle basis of (CS(, e)). Then for all e, € FCoS, 
there is a reordering of €1,...,€n such that e¢ = (€1)¢, +... + (€m)¢,,,™ <n, where 
t; = p(e;),t=1,...,m. 


10.5.6 Fuzzy Line Graphs 


The line graph, L(G), of a (crisp) graph G is the intersection graph of the set of 
edges of G. Hence the vertices of L(G) are the edges of G with two vertices of L(G) 
adjacent whenever the corresponding edges of G' are. We present the notion of a fuzzy 
line graph. Let G = (V,X) and G’ = (V’,X’") be graphs. If u is a fuzzy subset of V 
and p is a fuzzy subset of V x V such that (1, p) is a fuzzy graph with Supp(p) C X, 
we call (uu, p) a partial fuzzy subgraph of G. 


DEFINITIONS 


D42: Let (u,p) and (y’, p’) be partial fuzzy subgraphs of G and G’, respectively. Let 
f be a one-to-one function of V onto V’. Then 

(1) f is called a (weak) vertex-isomorphism of (1,p) onto (w’,p’) if and only if 
Vu € V, (uv) < w’(f(v)) and Supp(u’) = f(Supp(1))) e(v) = w'(F(e)); 

(2) f is called a (weak) line-isomorphism of (11, p) onto (u’, p’) if and only if V(u, v) € 
X,(p(u,v) < p'(f(u), f(v)) and Supp(o’) = {(f(u), f(v))|(u,v) € Supp(p)}) p(u,v) = 
P'(f(u), f(r). 


D43: If f is a (weak) vertex-isomorphism and a (weak) line-isomorphism of (y, p) 
onto (1, 9’), then f is called a (weak) isomorphism of (41, e) onto (1’, p’). If (4, p) is 
isomorphic to (y’, p’), then we write (1, p) ~ (’, p’). 


D44: Let G = (V,X) be a graph, where V = {v,...,Un}. Let S; = {vj, 2i1, ..., Vig, }, 
where 2; € X and 2;,; has y; as a vertex, j =1,...,q@37=1,...,n. Let S = {$1,..., Sp}. 
Let T = {(5;,5;)|S;5; € S,5;S; £0,i A j}. Then Z(S) = (S,T) is an intersection 
graph and G ~ Z(S). Any partial fuzzy subgraph (v, y) of [(S) with Supp(y) = T is 
called a fuzzy intersection graph. 


D45: Let (u,p) be a partial fuzzy subgraph of G. Let I(S) be the intersection graph 
described above. Define the fuzzy subsets v,y of S and T, respectively, as follows: 

VS; € S,1(S;) = w(vi); 

V(Si,55) € T, V(Si, $5) = p(vi, v4). 


FACT 
F30: Let (u,p) be a partial fuzzy subgraph of G. Then 


(1) (4,7) is a partial fuzzy subgraph of Z(S); 
(2) (up) = (4,9): 
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REMARK 


R8: Let Z(S) be the intersection graph of (V, X). Let (i,y) be the fuzzy intersection 
graph of Z(S') as defined above. We call (v,y) the fuzzy intersection graph of (1, p). Fact 
F30 shows that any fuzzy graph is isomorphic to a fuzzy intersection graph. 


The line graph L(G) of G is by definition the intersection graph Z(X). That is, 
L(G) = (Z,W), where Z = {{x} U {uz,vz)|@ € X,Uz,ve € Vix = (Uz, Vz)} and 
W = {(Sz, Sy)|Sz 1 Sy 4 0,2,y € X,x # y} and where S, = {{x}U {uz, ve}, x € X. 


DEFINITION 


D46: Let (yu, p) be a partial fuzzy subgraph of G. Define the fuzzy subsets A, w of Z, W, 
respectively, as follows: 

VSz € Z, (Sz) = p(x); 

V(Sz, Sy) € W,w(Sz, Sy) = p(x) A ply). 


FACT 


F31: (A,w) is a fuzzy subgraph of L(G) and is called the fuzzy line graph corresponding 
to (4, p). 


Every cutpoint of L(G) is a bridge of G which is not an endline, and conversely, 
[MoNa00, p. 42]. It is shown in [MoNa00] that the relationship between cutpoints in 
L(G) and bridges in G does not carry over to the fuzzy case. 


REMARK 


R9: Let (u,p) and (y’, p’) be partial fuzzy subgraphs of G and G’, respectively. If 
f is a weak isomorphism of (ju, 9) onto (w’,p’), then it can be shown that f is an 
isomorphism of (Supp(z),Supp(p)) onto (Supp(u’),Supp(p’)). If (A,w) is the fuzzy line 
graph of (1, ¢), then it can also be shown that (Supp(A),Supp(w)) is the fuzzy line graph 


of (Supp(),Supp(p)). 


FACTS 


F32: Let (A,w) be the fuzzy line graph corresponding to (4, 2). Suppose that (Supp(yz), 
Supp()) is connected. Then 

(1) there is a weak isomorphism of (4, ) onto (A,w) if and only if (Supp(),Supp(p)) 
is a cycle and yz and p are constant functions on Supp(w) and Supp(p), respectively, 
taking on the same value; 

(2) if f is a weak isomorphism of (jz, 9) onto (A,w), then f is an isomorphism. 


F33: Let (u,p) and (’, p’) be partial fuzzy subgraphs of G and G’, respectively, such 
that (Supp(),Supp(p)) and (Supp(u’),Supp(p’)) are connected. Let (A,w) and ()’,w’) 
be the line graphs corresponding to (,p) and (y’,p’), respectively. Suppose that it 
is not the case that one of (Supp(),Supp()) and (Supp(y’),Supp(p’)) is A3 and the 
other is Ky,3. If (A,w) & (1,0), then (ps, 9) and (p’, p’) are line isomorphic. 


F34: Let (7,v) bea partial fuzzy subgraph of L(G). Then (7, v) is a fuzzy line graph of 
some partial fuzzy subgraph of G if and only if V(S;, Sy) € W,v(Sz, Sy) = T(x) A T(y). 


F35: (1, ¢) is a fuzzy line graph if and only if (Supp(/),Supp(p)) is a line graph and 
V(u,v) € Supp(y), e(u,v) = w(u) A p(v). 


Section 10.5. Fuzzy Graphs 1325 


10.5.7 Fuzzy Interval Graphs 


Intersection graphs and in particular interval graphs are used extensively in math- 
ematical modeling. Applications in archaeology, developmental psychology, ecological 
modeling, mathematical sociology, and organization theory are cited in [Ro76]. These 
disciplines all have components that are ambiguously defined, require subjective evalu- 
ation, or are satisfied to differing degrees. They are active areas of applications of fuzzy 
methods. It is therefore worthwhile to study the extent that intersection graph results 
can be extended using fuzzy set theory. 

The intersection graph of a family (possibly with repeated members) of sets is a 
graph with a vertex representing each member of the family and an edge connecting 
two vertices if and only if the two sets have nonempty intersection. Generally, loops are 
suppressed. If the family is composed of intervals or is the edge set of a hypergraph, 
then the intersection graph is called an interval graph or aline graph, respectively. 

A fuzzy analog of Marczewski’s theorem [Ma45] shows that every fuzzy graph with- 
out loops is the intersection graph of some family of fuzzy subsets. However, the natural 
generalization of the Fulkerson and Gross characterization [FuGr65] of interval graphs 
fails. However, a natural generalization of the Gilmore and Hoffman characterization 
[GiHo64] holds. 

Let G = (V, ws, p) be a fuzzy graph. 


DEFINITION 


D47: A fuzzy digraph is a triple D = (V, u, 6), where p is a fuzzy subset of V and 6 
is a fuzzy subset of V x V such that d(x, y) < u(x) A u(y) for all x, y € V. We note that 
6 need not be symmetric. 


REMARK 


R10: A fuzzy graph (fuzzy digraph) can be represented by an adjacency matrix, where 
the rows and columns are indexed by the vertex set V and the x,y entry is p(x, y) 
(d(x, y)). Vertex strength can be indicated by adding a column indexed by wp and letting 
the x, ps entry be p(x). 


DEFINITIONS 


D48: For a family F of fuzzy subsets, we define the t level family of F to be 
Ft = {atla € F}. 


D49: Let a be a fuzzy subset of V. The height of a is h(a) = V{a(a)|a € V}. 


We construct a sequence of crisp level graphs in order to see how a fuzzy subset’s 
structure changes between various levels. Theorems characterizing a fuzzy property in 
terms of level set properties are significant in that such theorems demonstrate the extent 
to which the crisp theory can be generalized. To formalize this sequence of graphs, we 
define the notion of fundamental sequence. 


D50: The fundamental sequence of a fuzzy graph G = (1, p) is defined to be the 
ordered set 
fs(G@) = {u(x) > Ow € V}U {o(@, y) > Olay € VI, 


where we use the decreasing order inherited from the closed real interval [0, 1]. 
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REMARK 


R11: The first element listed in fs(G) is the maximal vertex strength while the last 
element is the minimal nonzero edge strength. 


DEFINITION 


D51: Let F = {aj,...,a,} be a finite set of fuzzy subsets of a set V. The fuzzy 
intersection graph of F is the fuzzy graph Int(F) = (,), where p : F — [0,1] is 
defined by pu(a;) = h(a;),7 =1,...,n and p: F x F — [0,1] is defined by 


h a, 1a; if 7 ), 
p(a, aj) = ( a #3 for 7,7 = 1,...,n. 
Oifi= 4, 


REMARKS 


R12: ~~ F in the previous definition is considered a crisp set of vertices. The notion 
of a fuzzy intersection graph is different here than in the previous definition and so a 
different notation is used. Recall that every graph G = (V, X) is an intersection graph. 
For all x € V, let S, denote the union of {x} with the set of all edges incident with «x. 
It follows that G is isomorphic to the intersection graph of {S,|2 € V}. 


R13: If F = {a1,...,@n} is a family of fuzzy subsets of a set V and ¢ € [0,1], then 
Int(F‘) = ( Int(F))'. The graph Int(F‘) has a vertex set representing a; € F if and 
only if h(a;) > t. The set {(a;)’, (aj)'} is an edge of Int(F‘) if and only if i A j and 
h(a; a;) > t. These conditions also characterize the graph (Int(F))'. In particular, 
if F is a family of crisp subsets of V, then the fuzzy intersection graph and the crisp 
intersection graph definitions coincide. 


We next state the fuzzy analog of Marczewski’s theorem [Ma45]. 
FACT 


F36: If G = (u,p) is a fuzzy graph (without loops), then for some family of fuzzy 
subsets of F,G = Int(F). 


The families of sets most often considered in connection with intersection graphs are 
families of intervals of a linearly ordered set. This class of interval graphs is central to 
many applications. 

In both the crisp and fuzzy cases, distinct families of sets can have the same intersec- 
tion graph. In particular, the intersection properties of a finite family of real intervals 
(fuzzy numbers) can be characterized by a family of intervals (fuzzy intervals) defined 
on a finite set. Therefore, as is common in interval theory, we restrict our attention to 
intervals (fuzzy intervals) with finite support. 

We next generalize two characterizations of crisp interval graphs. Both make use 
of relationships between the finite number of points which define the intervals and the 
cliques of the corresponding interval graph. 
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DEFINITION 


D52: A clique of a graph is a maximal (with respect to inclusion) complete subgraph. 


REMARK 


R14: Ifa vertex z is not a member of a clique K, then there exists « € K such that 
(x, z) is not an edge of the graph. 


DEFINITION 


D53: Let V be a linearly ordered set. 

(1) A fuzzy interval T on V is a normal, convex fuzzy subset of V. That is, there exists 
an x € V with Z(x) = 1 and the ordering w < y < z implies that Z(y) > Z(w) AZ(z). 
(2) A fuzzy number is a fuzzy interval. 

(3) A fuzzy interval graph is the fuzzy intersection graph of a finite family of fuzzy 
intervals. 


REMARK 


R15: Let G = Int(F) be a fuzzy interval graph. It follows easily that for all t € (0, 1], 
the level graph G* is an interval graph. The converse of this result is not true [MoNa00, 
Example 2.12, p. 47]. 


The Fulkerson and Gross characterization makes use of a correspondence between 
the set of points on which the family of intervals is defined and the set of cliques of the 
corresponding interval graph. We provide natural generalizations of the crisp definitions. 
It follows from fuzzy graphs that the relationship holds only in one direction. 


DEFINITION 


D54: Let G = (, p) be a fuzzy graph. 

(1) We say that a fuzzy subset K defines a fuzzy clique of G if for each ¢ € (0, 1], K* 
induces a clique of G’. 

(2) We associate with G a vertex clique incidence matrix where the rows are indexed 
by the domain of jz, the columns are indexed by the family of all cliques of G, and the 
x,K entry is K(x). 


FACTS 


F37: Suppose that G is a fuzzy graph with fs(G) = {ri,...,rn} and let K be a fuzzy 
clique of G. The level sets of K define a sequence K™! C ... C K™, where each K™ is a 
clique of G™. Conversely, any sequence kK, C... C Ky, where each K; is a clique of G™ 
defines a fuzzy clique K, where K(x) = V{ri|x € Ki}. Therefore, K is a clique of the 
t-level graph G* if and only if K = K* for some fuzzy clique K. 


We now state the fuzzy analog of Fulkerson and Gross. 


F38: Let G = (V,p) be a fuzzy graph. Then the row of any vertex clique incidence 
matrix of G defines a family of fuzzy subsets F for which G = Int(F). Further, if there 
exists an ordering of the fuzzy cliques of G such that each row of the vertex clique 
incidence matrix is convex, then G is a fuzzy interval graph. 
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F39: The converse of Fact F38 is not true [MoNa00, Example 2.13, p. 50). 


TERMINOLOGY: Let G = (V,X) be a graph and D be a directed graph. We use the 
notation (x,y) for an edge in G and (a, y) for a directed edge in D. 


DEFINITIONS 


D55: A graph is called chordal or triangulated if each cycle with n > 4 vertices has a 
chord, i.e., there exist integers 7 4 0 ork An withO <j <k—-—1<nand (a;,a%) € X. 


D56: An orientation of a graph G = (V,X) is a directed graph G4 = (V, A) that 
has G as its underlying graph. That is, (x,y) € X implies that (2, y) € A or (y,x) € A, 
but not both. 


D57: A graph G is transitively orientable if there exists an orientation of G for 
which (u,v) € A and (v,w) € A implies (u,w) € A. 


D58: The complement of a graph G, denoted by G°, is the graph with vertex set V 
and edge set consisting of those edges which are not in X. For a fuzzy graph G = (1, p), 
we let G° = (4,1 — p). 


D59: A cycle of length n ina fuzzy graph is a sequence of distinct vertices 19, 11, .--, Xn 
such that p(2o,v%,) > 0 and if 1 <i<n, then p(a;_1,2;) > 0. A fuzzy graph G = (w, p) 
is chordal if for each cycle with n > 4, 


1. there exist integers 7 4 0 or k An such that 0< j <k-1<nand 
2. p(@;, tn) > Af{p(ei-1, 21) |i = 1,2, ...,n} A p(x, tn). 


REMARK 


R16: It is easily shown that a fuzzy graph G = (w, p) is chordal if and only if for each 
t € (0, 1] the t-level graph of G is chordal. 


FACT 


F40: If G is a fuzzy interval graph, then G is chordal. 


DEFINITION 


D60:_ ~— Let G = (p,p) be a fuzzy graph with fs(G) = {ri,...,r,} and let A be an 
orientation of G™. Then the orientation of G by A is the fuzzy digraph G4 = (1, pa), 
where 


p(x, y) if (x,y) € A, 


Cr nae 
O if (a,y) € A. 


The fuzzy graph G is called transitively orientable if there exists an orientation which 
is transitive, i.c., p4((x,y)) A pallys2)) < pa((a, 2)): 
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REMARK 


R17: Let G bea fuzzy graph. The t level graph of G4 has arc set {(x, y)|p4((x, y)) > th. 
Therefore an orientation of a fuzzy graph induces consistent orientations on each member 
of the fundamental sequence of cut level graphs. Conversely, it is possible to have 
a sequence of transitively oriented subgraphs G; C G2 C Gs, where the transitive 
orientation of Gz does not induce a transitive orientation of G,, and the transitive 
orientation of G2 cannot be extended to a transitive orientation of G3. 


FACTS 


F41: Suppose that G = Int(F) is a fuzzy interval graph. Then there exists an orien- 
tation A that induces a transitive orientation of G°. 


We next state the fuzzy analog of the Gilmore and Hoffman characterization. 


F42: A fuzzy graph G = (,p) is a fuzzy interval graph if and only if the following 
conditions hold: 

(1) for all « € Supp(#) = V, p(x) = 1, 

(2) each fuzzy subgraph of G induced by four vertices is chordal, 

(3) G° is transitively orientable. 


10.5.8 Operations on Fuzzy Graphs 


By a partial fuzzy subgraph of a graph G = (V,X), we mean a partial fuzzy sub- 
graph of (vv, vx), where yy and xx denote the characteristic functions of V and X, 
respectively. We denote the edge between vertices u,v by uv rather than (u,v) in this 
section. Let (4;, 9;) be a partial fuzzy subgraph of the graph G; = (V;, X;),i = 1,2. The 
operations of Cartesian product, composition, union, and join on (f11, 1) and (2, p2) 
are given in [MoNa00]. If the graph G is formed from G,; and G2 by one of the these 
operations, necessary and sufficient conditions are given in [MoNa00] for an arbitrary 
partial fuzzy subgraph of G to also be formed by the same operation from partial fuzzy 
subgraphs of G; and G3. Recall that the Cartesian product G = G, x G2 of graphs 
Gy = (Vi, X1) and Go = (V2, X2) is given by v= Vi x V2 and 


X = {(u, u2)(u, v2)|u € Vi, wave € Xo} U {(u1, w)(viw)|w € Vo, uiv1 € Xi}. 
Let py; be a fuzzy subset of V; and p; be a fuzzy subset of X;,7 = 1, 2. 


DEFINITION 


D61: Define the fuzzy subsets 41 x 2 of V and pip2 of X as follows: 


(1) V(w1, U2) € V, (tr X fa)(U1, U2) = fa(tr) A p2(u2), 
(2) Vu € Vi, Vugve € Xe, pipa((u, u2)(u, v2)) = pa(u) A p2(uzva), 
(3) Vw € Vo, Vurur € X1, pip2((ui, w) (v1, w)) = Hew) A pi(uiri). 
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It follows easily that if G is the Cartesian product of graphs G1 and G2 and (1;, p;) 
is a partial fuzzy subgraph of G;,i = 1,2, then (f41 x f42,p1p2) is a partial subgraph 
of G. 


FACT 


F43: Suppose that G is a Cartesian product of two graphs G, and Gp. Let (,p) be 
a partial fuzzy subgraph of G. Then (y,) is a Cartesian product of a partial fuzzy 
subgraph of G; and a partial fuzzy subgraph of G2 if and only if the following three 
equations have solutions for x;, y;,2;~, and win, where Vi = {v11, Vi2,..., Vin} and V2 = 
{vai, U2 5 2085 vam } 5 
(1) xi NY; = pW(V1;, V2j),t = 1,. 7 = 1,. +5 MN; 

= 


(2) a A Zn = Alay, Vas) xg, Yon), 1,...,239,k such that vojve, € X2; 
(3) yy A Wik = p((v1a, V27) (Vin, V27)),7 =1,...,m;i, kh such that vyjvin € X41. 
REMARK 


R18: Definitions and results concerning the composition of graphs and partial fuzzy 
subgraphs follow along the same lines as for the Cartesian product. The results with 
examples can be found in [MoNa00]. The results for the union and join of graphs and 
partial fuzzy subgraphs follow differently. It can be shown, for example, that if G is a 
union of two subgraphs G; and G2, then every partial fuzzy subgraph (1, p) is a union 
of a partial fuzzy subgraph of G; and a partial fuzzy subgraph of G2, It can also be 
shown that if G is the join of two subgraphs G; and G2, then every strong partial 
fuzzy subgraph (,e) of G is a join of a strong partial fuzzy subgraph of G, and a 
strong partial fuzzy subgraph of G2. Definitions, proofs, and examples can be found in 
[MoNa00]. 


10.5.9 Clusters 


In graph theory, there are several ways of defining “clusters” of vertices. One ap- 
proach is to call a subset C of V a cluster of order k if the following two conditions 
hold: 

(a) for all vertices x,y in C,d(x,y) < k; 

(b) for all vertices z ¢ C,d(z,w) > k for some w € C, 

where d(u, v) is length of a shortest path between two vertices uw, v. 

A 1-cluster is called a clique; it is a maximal complete subgraph. That is, a maximal 
subgraph in which each pair of vertices is joined by an edge. At the other extreme, if we 
let k — co, a k-cluster becomes a connected component, that is, a maximal subgraph 
in which each pair of vertices is joined by a path (of any length). 

These ideas can be generalized to fuzzy graphs as follows: In G = (,), we call 
CCV a fuzzy cluster of order k if 


A{p*(a,y)|a,y € C} > V{A(p*(w, z)lw € Chz ¢ CH. 


Note that Cis an ordinary subset of V, not a fuzzy subset. If G is an ordinary graph, 
we have p*(a,b) = 0 or 1 for all a,b. Hence this definition reduces to 

(1) p*(a,y) = 1 for all x,y in C, 

(2) p*(w, z) =0 for all z ¢ C and some w € C. 
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Property (1) implies that for all x, y in C, there exists a path of length < k between 
x and y and property (2) implies that for all z ¢ C and some w € C, there does not 
exist a path of length < k. This is the same as the definition of a cluster of order k. 

In fact, the k-clusters obtained using this definition are just ordinary cliques in 
graphs obtained by thresholding the k-th power of the given fuzzy graph. Indeed, let C 
be a fuzzy k-cluster, and let A{p*(a, y)|v, y € C} = t. If we threshold p* (and jz) at t, 
we obtain an ordinary graph in which C' is now an ordinary clique. 


10.5.10 Application to Cluster Analysis 


The usual graph theoretical approaches to cluster analysis involve first obtaining 
a threshold graph from a fuzzy graph and then applying various techniques to obtain 
clusters as maximal components under different connectivity considerations. These 
methods have a common weakness, namely, the weight of edges is not treated fairly 
in that any weight greater (less) than the threshold is treated as 1(0). We discuss an 
extension of these techniques to fuzzy graphs. It turns out that the fuzzy graph approach 
can be more powerful. 

The following table provides a summary of various graph theoretical techniques for 
cluster analysis. For cluster procedures (1), (2), and (3), the cluster independence can 
be considered disjoint while that of cluster procedure (4) is limited overlap and that 
of (5) is considerable overlap. The extent of chaining is high, moderate, low, low, and 
none for cluster procedures (1)—(5), respectively. 


Cluster procedure Graph theoretical interpretation of clusters 
1) Single linkage Maximal connected subgraphs 
2) k-linkage Maximal connected subgraphs of minimum degree 


k-edge connectivity | Maximal k-edge connected subgraph 
k-vertex connectivity Maximal k-vertex connected subgraph and cliques on k 


or less vertices 


5) Complete linkage Cliques 


DEFINITION 


D62: Let G = (V,p) be a fuzzy graph. A cluster of type k (k = 1,2,3,4) is defined 
by the following conditions: 

(1) maximal e-connected subgraphs for some 0 < € < 1. 

(2) maximal 7-degree connected subgraphs. 

(3) maximal 7-edge connected subgraphs. 

(4) maximal 7-vertex connected subgraphs. 

Hierarchial cluster analysis is a method of generating a set of classifications of a 
finite set of objects based on some measure of similarity between a pair of objects. It 
follows from the previous definition that clusters of type (1), (2), and (3) are hierarchial 
with different « and 7, whereas clusters of type (4) are not due to the fact 7-vertex 
components need not be disjoint. 
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It is also easily seen that all clusters of type (1) can be determined by the single- 
linkage procedure. The difference between the two procedures lies in the fact that 
e-connected subgraphs can be obtained directly from M,~ by at most n — 1 matrix 
multiplications, where n is the rank of Mg, whereas in the single-linkage procedure, it 
is necessary to obtain as many threshold graphs as the number of distinct values in the 
graph. 

The output of hierarchial clustering is called a dendogram, which is a directed tree 
that describes the process of generating clusters. 

It is shown in [MoNa00] that not all clusters of types 2, 3, and 4 are obtainable by 
procedures of k-linkage, k-edge connectivity, and k -vertex connectivity, respectively. 


FACTS 


F44: The 7r-degree connectivity procedure for the construction of clusters is more 
powerful than the k-linkage procedure. 


F45: The 7-edge connectivity procedure for the construction of clusters is more pow- 
erful than the k-edge connectivity procedure. 


Single Linkage 

An important result in hierarchical clustering is the equivalence between the single 
linkage and connected components of a fuzzy graph. That is, the following four methods 
generate the same partition: 


(1) the single linkage method, 

(2) connected components of an undirected graph, 

(3) transitive closure of a reflexive and symmetric fuzzy relation, 
(4) the maximal spanning tree of a weighted graph. 


The formal statement of the above result can be found in [Mi90, Proposition 6.1, p. 
161] along with its proof and pertinent algorithms. 


10.5.11 Fuzzy Graphs in Database Theory 


In the classical relational database theory, in order to design good databases (no 
data redundancy, no update anomalies) one has to know additional information called 
functional dependencies, which say that some values determine other values. This no- 
tion can be generalized for fuzzy relations. Certain kinds of decompositions of fuzzy 
relational databases can be obtained using level cuts [Ki90, Ki91]. 


DEFINITION 


D63: Let U = {A,..., An} be a set of attributes and V a set of values. Let DOM(A;) 
be a nonempty subset of V,i = 1,...,n. Let R be a subset of the Cartesian cross product 
x?_,DOM(A;). For all t = (ti, ity tn) € Rand A; € U, let t[Aj] = t;,1 = 1,...,n. Let 
X,Y € P(U), the power set of U. Then X is said to functionally determine Y in R if for 
two elements in R, the Y values are equal whenever the X values are equal. Formally, 
Vt,t' eV", VX,Y EU, 


tE Rt eR tx) =e [XJ styj =e [yY], 


where t(X) = (ti,,-+-, ba, ) for X = {Ai,, wy Ai, fe 
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We now fuzzify these ideas. Let p be a fuzzy subset of x?_,DOM(A,;). We replace 
= with the implication —, where Va, b € [0, 1], 


lifa<b, 
a>b= 
1— (a — b) otherwise. 


Then we get that the truth value of the fuzzy relation p satisfies a given functional 
dependency X > Y for X,Y € P(U) : Vti,te EV", 


w(X,¥) = 1—V{ p(t) A plea) | ti[X] = talX], but t[¥] 4 tel¥]}. 


For X,Y € P(U), let XY = X UY. The following properties are easily shown. 


Al IfY CX, then p(X, Y) = 1; 
A2 (X,Y) Apl¥,Z) < W(X, 2); 
AB u(X,Y) < w(XZ,YZ). 


If 7 is a fuzzy relation on P(U), the smallest fuzzy relation T+ on P(U) which 
satisfies Al, A2, A3 is called the closure of Tr. It can be shown tT? = TTT. 

We now associate with 7 a fuzzy graph G, = (w,p) as follows: The vertices are 
ordered pairs (X,Y) with w(X,Y) = 7(X,Y) and edges are ordered pairs of vertices of 
the form ((X,Y), (X, Z)) with p((X,Y), (X, Z)) = 7(Y, Z). An algorithm in [Ki90] and 
[Ki91] can be found that gives r+ by modifying step by step the labels of the graph. 


10.5.12 Strengthening and Weakening Members of a 
Group 


A fuzzy directed graph can be utilized to characterize the role played by an individual 
member in a group that a class of group members having relationship with any given 
member has no sharply defined boundary. The theory of graphs is an important tool 
in the study of the group structure [Ha72]. A strengthening member of a group is 
one whose presence causes the graph corresponding to the group to be more highly 
connected than that obtained when he is absent, while a weakening member is one 
whose presence causes the graph to belong to a weaker category of connectedness. The 
graph can be used to study problems concerning redundancies, liaison persons, cliques, 
structural balance, and so forth. 

In many cases, however, the mere presence or absence of a relation is not adequate 
to represent group structure. There may be different strengths of the relations between 
individuals. There may even be situations in which it is fuzzy rather than well-defined 
whether or not an arbitrary individual has a relationship with a given member, that is, 
a class of group members being in relationship with any given member does not have a 
sharply defined boundary. In such cases, fuzzy graphs become a more relevant model 
[TaNi76]. 


10.5.13 Network Analysis of Fuzzy Graphs 


Posing problems on networks serves as a means for visualizing a problem and for 
developing a better understanding of the problem. It also has certain computational 
advantages. It is easier for a decision maker to draw a picture of what he wants than it 


1334 Chapter 10. Graphs in Computer Science 


is to write down constraints. There are a wide variety of network type problems such 
as location, transportation, flow, reliability, and shortest path. These models often deal 
with deterministic data and a single objective. Fuzzy counterparts can be found in 
[K191, MaMo00]. 

The most basic network problem is the shortest path problem. The fuzzy shortest 
path problem was first analyzed in [DuPr80]. The fuzzy shortest path can be found, 
but it may not correspond to an actual path in the network. Generally fuzziness is 
introduced into the network through arc capacities, arc lengths, or vertex restrictions. 
To circumvent the problem of the fuzzy shortest path distance not corresponding to 
an actual path, a different category of fuzzy path problems is considered. Rather than 
viewing each arc as a fuzzy number, let each possible arc length and path length be a 
fuzzy set. Then each arc has a membership grade in each fuzzy set corresponding to 
a length. Then through a DP recursion, it is possible to find the shortest fuzzy path 
length. This length will be a fuzzy number that may not correspond to an actual path. 
however, each value in the fuzzy set with positive membership grade will correspond to 
a path in the network. Results on decision trees can be found in [Ad80]. 

In [Ko92], a way is shown for modeling various quantitatively featurable functional 
capabilities of computer, communication, and similar networks. It is shown that func- 
tional capability of distributed hierarchical multicomponent systems (networks) can be 
described by the directed rooted tree model according to fuzzy graph ideas. The prob- 
lem of optimizing the overall functional capability with a given component (vertex) set 
is introduced. The fuzzy formulae based on the algebraic connectives which are to be 
optimized are constructed. 

Imprecise observations or possible perturbations mean that capacities and flows in a 
network may well be better represented by intervals or fuzzy numbers than crisp quan- 
tities. In [Di01] analogues of the MFMCT and Karp-Edwards algorithm for networks 
with fuzzy capacities and flows are developed. The principal difference between fuzzified 
and traditional crisp versions is that although the maximum fuzzy flow corresponds to 
a minimum fuzzy capacity, the latter may incorporate a number of network cuts. The 
preliminary results are for interval-valued flows and capacities which, in themselves, 
provide robustness estimates for flows in an uncertain environment. 

Many types of document networks exist such as bibliographic databases containing 
scientific publications, social networking services, as well as databases of datasets used in 
scientific endeavors. However, the prime example of a document network is the World 
Wide Web (WWW). Each of these databases possesses several distinct relationships 
among documents and between documents and semantic tags or indices that classify 
documents appropriately. For instance, documents in the WWW are related via a hy- 
perlink network, while documents in bibliographic databases can be related to semantic 
tags such as keywords used to describe their content. Given these relations, we can 
compute distance functions (typically via co-occurrence measures) among documents 
and/or semantic tags, thus creating associative, weighted networks between these items 
— which denote stronger or weaker co-associations. 


10.5.14 Intuitionistic Fuzzy Graphs and Group 


Decision-Making 
DEFINITION 
D64: Let Fy and Fz be sets and let G C Ey x Ee. Let pg, vg : FE, x E2 > [0,1] be 
such that 0 < ue(x,y) + ve(2,y) < 1 for all (x,y) € Ey x Ey. Then the set 


G* = {((2,y), wa(2, y), Ya(z,y))\(t,y) © Fi x Eo} 


is called an intutionistic fuzzy graph. (The functions ug and vg are interpreted as 
the degree of membership and nonmembership, respectively, of the element (a, y) in the 
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set G.) 


In group decision-making, a set of experts in a given field is involved in a decision 
process concerning the selection of the best alternatives among a set of predefined ones. 
Each expert is asked to evaluate at least a subset of the alternatives in terms of its 
performance with respect to predefined criterion. The expert evaluations are expressed 
as a pair of numeric values, interpreted in the intuitionistic fuzzy framework. These 
numbers express a “positive” and a “negative” evaluation, respectively. Intuitionistic 
fuzzy graphs can be constructed for each expert. A suitable operation is performed 
on these intuitionistic fuzzy graphs to obtain a single intuitionistic fuzzy graph which 
provides an aggregation of the experts’ opinions [At99, AtPaYaAt03]. 
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INTRODUCTION 

Roughly speaking, expander graphs are large, sparse, pseudorandom graphs. They enjoy 
a remarkable range of applications (particularly in computer science), including switch- 
ing networks, derandomization, error-correcting codes, cryptographic hash functions, 
and much, much more. Moreover, dozens of other branches of mathematics connect in 
some way to the theory; the list includes, among others, functional analysis, analytic 
number theory, ergodic theory, combinatorics, random walk theory, operator algebras, 
representation theory, and geometric group theory. To quote Tao [Tal2], “it is quite 
remarkable that a single problem—namely the construction of expander graphs—is so 
deeply connected with such a rich and diverse array of mathematical topics. (Perhaps 
this is because so many of these fields are all grappling with aspects of a single gen- 
eral problem in mathematics, namely when to determine whether a given mathematical 
object or process of interest ‘behaves pseudorandomly’ . . .).” 


10.6.1 Foundational Definitions and Results 


Isoperimetric Constants and Expander Families 


DEFINITIONS 


D1: Let X be a graph with vertex set V, and let S C V. The boundary of S', denoted 
OS, is the set of all edges in X incident to both a vertex in S and a vertex not in S. 


D2: Let X be a finite graph with a vertex set V. The isoperimetric constant of 
X, denoted h(X), is the minimum, over all subsets S of V with |S < n/2, of |05|/|S|, 
where |S denotes the number of vertices in S, and |0S| denotes the number of edges in 
OS. A set S that achieves this minimum is called an isoperimetric set. 
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The invariant h(X) is sometimes also called the expansion constant of X, the edge 
expansion constant of X, the Cheeger constant of X, or (for regular graphs) the 
conductance of X. It is sometimes denoted i(X). 

Equivalently, one can remove the restriction on S and replace |S| with min{|S|, |.S°|}. 


D3: Let (X,,) be a sequence of finite regular graphs, each with the same degree. We 
say (X,) is an expander family if |X,,| > co as n — oo, and there exists « > 0 such 
that h(X,,) > ¢ for all n. (Here |X,,| denotes the order of the graph X,,.) 


While we sometimes speak of “expander graphs,” the more apropos term is “expander 
family,” because generally speaking, no graph can be a straight-out expander in and of 
itself. 


EXAMPLES 
El: We have that h(X) = 0 if and only if X is not connected. 


E2: Regard the cycle graph Cy, as a Cayley graph on the group Z, of integers modulo 
n with generating set {+1}. Let |a| denote the largest integer less than or equal to x. 
Then S = {1,...,|n/2|} is an isoperimetric set, with |OS| = 2. Figure 10.6.1 illustrates 
the situation for Cg. Hence h(C,,) = 4/n if n is even, and h(C,) = 4/(n — 1) if n is 
odd. It follows that (C,,) is not an expander family. So d-regular expander families do 
not exist for d < 3. 


Figure 10.6.1: The cycle graph Cg, with an isoperimetric set as hollow dots and its 
boundary as dashed line segments. 


Relationship to Graph Spectra 


DEFINITIONS 


D4: Let X be a finite graph with vertex set V. Let L?(X) be the set of all complex- 
valued functions on V. The adjacency operator of X is the linear mapping A : 
L?(X) > L?(X) defined by (Af)(v) = > f(w), where the sum is over all vertices w 
adjacent to v. We use the notation A9(X), Ai(X),...,An—1(X) for the eigenvalues of 
A, with the convention that Ap(X) > A(X) > --- > An-1(X). We frequently write A; 
instead of \;(X) when the graph X is understood. 


D5: Let X bea finite d-regular graph. The quantity d— ., is the spectral gap of X. 


FACTS 


F1: Let X be a finite d-regular graph. For j = 0,1,...,n — 1, we have |\,| < d. (See, 
for example, [GoRo01] or [KrSh11] for a proof.) 
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F2: Let X be a finite d-regular graph. Then 

d—, & 

i 

The inequality (d—,)/2 < h(X) follows from Rayleigh—Ritz theorem of linear algebra; 
see [GoRo01] or [KrSh11]. See [Mo89] for a proof of the right-hand inequality. 


h(X) < /(d+ )(d—Aa). 


A weaker inequality, h(X) < ,/2d(d— 4.1), is due to Alon, Milman, Tanner, and 
Dodziuk. Computing or estimating the isoperimetric constant directly from its definition 
can be quite difficult. Fact F2 allows us to obtain information about it indirectly via 
eigenvalues, for which one can employ algebraic tools such as the Rayleigh—Ritz theorem. 
Fact F2 also induces a connection between expander graphs and random walk theory, 
as A, plays a prominent role in the latter. 


F3: As an immediate corollary of the preceding fact, we see that a sequence of d- 
regular graphs forms an expander family if and only if the spectral gaps are uniformly 
bounded away from zero. In other words, let (X,,) be a sequence of d-regular graphs 
with |X,,| > co. Then (X,,) is an expander family if and only if there exists « > 0 such 
that d—Ai(X,,) > € for all n. 


In lieu of the adjacency operator, one sometimes studies the eigenvalues of the Laplacian 
operator A instead. For a d-regular graph, we have A = dI — A, where J denotes the 
identity operator. 


Ramanujan Graphs and the Alon—Boppana Theorem 


DEFINITIONS 
D6: Let X be a finite regular graph of order n. We define 
MX) := max{|\i(X)|,|An-1(X)|} if X is nonbipartite 
~~ | max{|A1(X)|,|An—2(X)|} if X is bipartite. 


For finite nonbipartite d-regular graphs, the largest eigenvalue d can be thought of as a 
trivial eigenvalue. For finite bipartite d-regular graphs, the extremal eigenvalues d and 
—d are trivial. So A(X) more or less captures the second-largest non-trivial eigenvalue 
of X. 


D7: Let X be a finite d-regular graph. We say X is Ramanujan if \(X) < 2Vd—1. 


Some authors define X to be Ramanujan when A1(X) < 2Vd— 1. 
Definition D7 is motivated by the Alon—Boppana theorem (Fact F5). 
EXAMPLE 


E3: For n > 3, the complete graph K,, is nonbipartite with eigenvalues n — 1 (with 
multiplicity one) and —1 and is therefore Ramanujan. 


FACTS 


F4: Suppose (X,,) is a sequence of finite d-regular graphs, where d > 3 and |X,,| > co. 
Using Fact F3, a quick computation shows that if each X,, is Ramanujan, then (X,,) is 
an expander family. 
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F5: Let (X,,) be a sequence of finite d-regular graphs with |X,,| > oo. Then 
lim inf A(X,,) > 2Vd — 1. 


As discussed in [Mu03], this theorem is attributed to Alon and Boppana but is also 
independently due to Serre. 


Group Representations and Kazhdan Constants 


DEFINITIONS 


D8: Let G be a finite group, and let [ C G. For any finite-dimensional unitary 
representation 7: G + GL(V), define «(G,IT,7) to be the minimum, over all unit 
vectors uv € V andally € I, of ||7(y)v—v]|. (Note that a minimum exists by compactness 
of the unit sphere in V.) We define the Kazhdan constant «(G,T) to be the minimum, 
over all nontrivial irreducible unitary representations 7 of G, of «(G,I',7). (Note that 
a minimum exists because there are only finitely many irreducible representations of G, 
and because compatible inner products are unique up to a scalar constant—see [KrSh11] 
for details.) 


Many variations of Definition D8 are possible. The following variant is sometimes 
also called the Kazhdan constant. Recall that for a finite group G, the right regular 
representation R : L?(G) + L?(G) is defined by (R(y)f)(g) = f(gy), where L?(G) 
denotes the set of all complex-valued functions on G, with standard inner product 


(f,9) = eee f(x) g(2). 


D9: Let G be a finite group, and let lr c G. Let L2(G) denote the set of complex-valued 
functions on G orthogonal to a nonzero constant function. Let R be the restriction of 
the right regular representation to L2(G). Define &(G,T) := K(G,T, R). 


D10: Let G be a group, and let ! Cc G. We denote by Cay(G,I) the Cayley graph of 
G with respect to I, that is, the graph with vertex set G so that x and y are adjacent if 
and only if y~'x €T. Recall that we say T is symmetric if y € T implies that y~' € T. 
Recall also that Cay(G,T) is an undirected graph if and only if I is symmetric. 


FACTS 

F6: Let G be a finite group, and let TC G. Then «(G,T) > &(G,P) > “SP. 

F7: Let G be aa finite group, and let [ be a symmetric subset of G. Let 7,..., 7% 
be a complete set of inequivalent irreducible representations of G. For 7 = 1,...,k, let 
Mz; = doyer ™j(7). Let A be the adjacency operator of Cay(G,I). Then A is unitarily 
equivalent to a direct sum of the operators M,,,, where for each j = 1,...,k, the number 


of times M,, appears in this direct sum equals the dimension of 7;. 


Fact F7, first observed in [Lo75], provides a crucial link that allows one to apply the 
theory of group representations to the analysis of Cayley graphs; the next theorem 
follows from this link and is of particular interest in the theory of expander graphs. 


F8: Let G be a finite group, T be a symmetric subset of G, and d = |[|. Let X = 
Cay(G,T). Then 


2,/n(X) > #(G,T) > SS, 
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For proofs of Facts F6 and F8, see [KrSh11] or [Lu94] or [LuWe93]. Putting them 
together with Fact F3, we get the following. 


F9: Let (G,,) be a sequence of finite groups with |G,,| + co. Let d be a positive integer. 
For each n, let T, be a symmetric subset of G,, with |[,| = d. Then (Cay(G,,T,)) 
is an expander family if and only if there exists a positive real number € such that 
K(Gr,T,) > € for all n. 


In other words, for a sequence of Cayley graphs, the isoperimetric constants are bounded 
away from zero if and only if the spectral gaps are bounded away from zero if and only 
if the Kazhdan constants are bounded away from zero. 


10.6.2 Major Results and Open Problems 


Existence and Construction of Expander Families 


DEFINITION 

In the following definition, graphs will be undirected, but multiple edges and loops are 
allowed. If v is a vertex in a graph and e is an edge incident to v, then we denote by 
e(v) the other endpoint of e. 


D111: Let X,Y be finite regular graphs, where the degree of X equals the order of Y. 
Let Vx,Vy be the vertex sets of X and Y, respectively. Let Ex, Ey be the edge sets 
of X and Y, respectively; treat multiple edges as distinct elements. For each v € Vx, 
let E, = {e € Ex | eis incident to v}, and let L, : Vy > E, be a bijective function. 
Define the zig-zag product X@Y to be the graph with vertex set Vx x Vy so that 
the number of edges between (21, yi) and (#2, y2) equals the number of ordered pairs 
(21,22) € Ey x Ey such that y; is an endpoint of 21; yg is an endpoint of z2; and 
Lz, (21(y1)) = Leo (z2(y2)). 


Many variations of this definition are possible; see, for example, [ASW08], [BeTa11], 
and [RVW02]. 


FACTS 
F10: Expander families exist. 


Fact F10 is originally due to Pinsker [Pi73], who used a probabilistic argument. Margulis 
gave the first explicit construction of an expander family in [Ma73]. His construction 
relies on forming quotients of infinite discrete groups with Kazhdan’s Property T. Fact 
F13 implies that for any integer d > 3, a d-regular expander family exists. 

For any d-regular graph T, let p(T) = A(T)/d. (Dividing by d effectively replaces the 
adjacency operator with the Markov chain transition operator.) 


F11: Let X be a finite dx-regular nonbipartite graph, and let Y be a finite dy-regular 
nonbipartite graph such that dx equals the order of Y. Then 


u(X@Y) < w(X) + WY) + u(Y)?. 


Fact F11 is independent of the choice of L. 
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F12: Recall that for a graph T with adjacency operator A, we denote by T? the graph 
with adjacency operator A?. Let W be a nonbipartite d-regular graph with order d+ 
such that u(W) < 1/5. (Such a graph W indeed exists.) Recursively define a sequence 
(W,,) by W, = W? and W,,41 = W2@W. It then follows from Facts F3 and F11 that 
(W,,) is an expander family. 


Facts F1l and F12 are due to [RVW02]. Other constructions and stronger estimates 
can be found in that paper as well. The significance of the zig-zag product approach 
is that the proofs involve nothing more than elementary linear algebra, in contrast to 
previous constructions of expander families, where the proofs relied on deep results in 
analytic number theory. A related graph product, the (balanced) replacement product, 
is used in [ASWO8] to construct an expander family; that proof is purely combinatorial, 
as it bypasses eigenvalues altogether and instead uses the definition of the isoperimetric 
constant directly. 


Ramanujan Graphs and Zeta Functions 


DEFINITIONS 


D12: Let X bea graph. Let C = (vo, eo, U1,---,;€n—1,; Un = Vo) be a closed walk in X. 
Then C backtracks if e; = e;+, for some i with O< 7 <n-—-—1. Otherwise, we say that 
C is backtrackless. (In other words, C' is backtrackless if no edge is traversed twice 
consecutively.) We say C' has a tail if e9 = en_1. Otherwise, we say that C is tailless. 
We say a closed walk P is prime if it is backtrackless, is tailless, and cannot be written 
as a k-fold concatenation of a closed walk C’' with itself. 


D13: Let X bea finite regular graph. The Ihara zeta function ¢x(u) of X is defined 
to be the product, over all equivalence classes of prime walks, of the complex-valued 
functions (1 — ullP))\*, Here ¢(P) denotes the length of the walk P, and two walks are 
said to be equivalent if one is obtained from the other by cyclic permutation. That is, 
the walks (vo, €0, U1,---;@n—1; Un) (U1, €1,-++5@n—1; Un; €0; U1); -++5 (Un—1, En—1; Un; C0; 
UL, +++) Cn—2) Un—1) are all equivalent. 


The Ihara zeta function satisfies many functional relationships like those of other zeta 
functions. The book [Tel1] provides a thorough treatment. 


FACTS 


F13: Let q be a prime power. Then there exists a family of (q+ 1)-regular Ramanujan 
graphs. 


Fact F13 is due to Lubotzky, Phillips, and Sarnak [LPS88] in the case where gq is an 
odd prime. (Similar results were obtained by Margulis [Ma88].) These “LPS graphs” 
are Cayley graphs on the groups PGL(2, Z,) and PSL(2, Z,) for certain primes p. They 
provide an explicit construction of regular graphs with arbitrarily large girth and chro- 
matic numbers; Erdés had previously nonconstructively demonstrated the existence of 
such graphs. Chiu [Ch92] has a construction of Ramanujan graphs for the case gq = 2, 
and Morgenstern [Mo94] has one for arbitrary prime powers q. 

Open problem: For any integer d > 3 not equal to one plus a prime power, determine 
whether a family of d-regular Ramanujan graphs exists. 
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F14: The Ihara zeta function of a finite regular graph X satisfies the Riemann hy- 
pothesis if and only if X is Ramanujan. That is, for a finite (¢ + 1)-regular graph X, 
we have that X is Ramanujan if and only if 0 < Re(s) < 1 and ¢x(q~*) = 0 implies 
that Re(s) = 1/2. 


See, for example, [Te99] for a proof of Fact F14. 


Group Structure and Expansion 


DEFINITIONS 


D14: Let (X,,) be a sequence of finite graphs. For each n, let V,, be the vertex set of 
Xn. We say (X,) has logarithmic diameter if diam(X,,) = O(log |Vn|). 


D15: Let (G,) be a sequence of finite groups. We say (G,,) has logarithmic diame- 
ter if there exists a positive integer d and symmetric subsets T, C G, with |T,,| = d for 
all n such that diam(Cay(G,,T',)) = O(log|Gn|). We say (G,,) yields an expander 
family if there exists a positive integer d and symmetric subsets T, C G, with |T,,| = d 
for all n such that (Cay(G,,T,)) is an expander family. 


FACTS 
F15: Let (X,,) be an expander family. Then (X,,) has logarithmic diameter. 


The proof of Fact F15 follows quickly from the definitions; [KrSh11], for example, works 
out the details. 


F16: Let (G,) be a sequence of finite groups. For each n, let Q, be a homomorphic 
image of G,. Suppose that |Q,| > oo. If (G,) yields an expander family, then (Q,) 
yields an expander family. 


For a subgroup H of a group G, denote by [G : H] the index of H in G. 


F17: Let (G,) be a sequence of finite groups. For each n, let H,, be a subgroup 
of Gyn. Suppose that there exists a positive integer M such that for all n, we have 
IG, : Hn] < M. If (Gy) yields an expander family, then (H,,) yields an expander 
family. 


See [KrSh11] for proofs of Facts F16 and F17 directly from the definition of isoperimetric 
constant and [KrSh11] or [LuWe93] for proofs using Kazhdan constants. 

For a group G, recall that the derived subgroup G‘”) is defined recursively by taking 
G©) = G and defining G°"+) to be the commutator subgroup of G(”). For a solvable 
group G, denote by &(G) the derived length of G. The following fact is an immediate 
consequence of Facts F16 and F17 as well as Example E4. 


F18: Suppose (G,,) is a sequence of finite groups that yields an expander family. Then 


for all positive integers m, the sequence ( ie) is bounded. In particular, if each G,, 
is solvable, then the sequence (¢(G,,)) must be unbounded. 


We remark that there exists a sequence of solvable groups (indeed, p-groups, hence 
nilpotent groups) that yields an expander family [LuWe93]. 
Other group-theoretic obstructions to expansion are discussed in [Lu94] and [LuWe93]. 
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F19: Let (G,,) be a sequence of finite nonabelian simple groups with |G,,| > co. Then 
(G,) yields an expander family. 


Fact F19 represents the combined work of many authors. An overview of all cases except 
that of Suzuki groups can be found in [KLN06]. This final case is dealt with in [BGT11], 
which draws on techniques developed in [BoGa08}. 

Open problem: Find necessary and sufficient conditions for a sequence of groups to 
yield an expander family. 


EXAMPLES 
E4: Let G be an abelian group, I a symmetric subset of G, and d = |[|. Then 
diam(Cay(G,T)) > |G|4/4 - 1. 


It follows that no sequence (G,,) of abelian groups with |G,| — co has logarithmic 
diameter. Hence by Fact F15, no sequence of abelian groups yields an expander family. 
See [KrSh11] for a detailed proof. 


E5: Let D, denote the dihedral group of order 2n. Then D,, contains a cyclic group 
of order n as a subgroup of order 2. So by Fact F17 and Example E4, we see that (D,) 
does not yield an expander family. 


E6: For each positive integer n, let V,, be the set of all ordered pairs of the form (s, k), 
where s is a string of Os and Is of length n, and k is an element of the integers modulo 
n. Define the cube-connected cycle graph CCC, to be the cubic graph with vertex set 
V, so that (s,k) is adjacent to (s,k+1) and to (s’,k), where s’ is the string identical to 
s except in in position k. So CCC), is something like a hypercube, but with each vertex 
replaced by an n-cycle. See Figure 10.6.2 for a picture of CCC3. Then (CCC,,) has 
logarithmic diameter but is not an expander family. Let G, be the wreath product of 
the cyclic group of order 2 with the cyclic group of order n. Then C'CC,, can be realized 
as a Cayley graph on G,,. Hence (G,,) has logarithmic diameter. However, because G,, 
admits the cyclic group of order n as a homomorphic image, by Fact F16 and Example 
E4, we see that (G,,) does not yield an expander family. So the converse of Fact F15 
fails. See [KrSh11] for a more thorough discussion of this example. 


Figure 10.6.2: A cube-connected cycle graph. 


E7: Let y = (1,...,n) and 7 = (1,2) be elements of the symmetric group S,, on n 
letters. The bubblesort graphs are defined to be the graphs Cay(Sn,{y,y~1,7}). The 
bubblesort graphs do not form an expander family (see [KrSh11] or [Lu94]). However, in 
[Ka07], Kassabov proves that the symmetric groups yield an expander family. Therefore 
group structure does not determine expansion. 
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Random Graphs and Expansion 


DEFINITIONS 


D16: Let X be a graph, and let A be a nonempty set of vertices of X. The vertex 
boundary of A, denoted 0’(A), is the set of all vertices of distance 1 from A. 


D17: Let X be a finite graph of order n. Let V be the vertex set of X. We say X is 
an ¢-expander if for every nonempty subset A of V, we have |0’A| > «(1 — |A|/n)| A]. 


FACTS 
The following fact essentially shows that for d > 3, a random sequence of d-regular 
graphs will form an expander family. 


F20: [Bo88] Let d > 3 be an integer, and let 0 < a < 1 bea real number such that 
24/4 < (1—a)!~*(1+a)!*%. Then as the order n goes to infinity, the probability that 
h(X) > (1 — a)d/2 for a randomly chosen d-regular graph X of order n goes to 1. 


F21: [Fr91] Let d be a positive integer, and let € be a positive real number. Then as 
the order n goes to infinity, the probability that \1;(X) > 2Vd—1-+ « for a randomly 
chosen d-regular graph X of order n goes to 1. 


Fact F21 had been conjectured by Alon. Sarnak [Sa04] summarizes its implication by 
noting, “So the random graph is asymptotically Ramanujan.” 


F22: [AlRo94] Let « < 1 be a positive real number. Then there exists 6 > 0 such that 
for any positive integer n and any group G of order n and any randomly chosen set I 
of at least 6 log,(n) elements of G, the probability that Cay(G,I) is an expander goes 
to lasn > o. 


In the facts above, the papers cited make precise the meaning of “randomly chosen.” 


Computer-generated statistical evidence seems to indicate that slightly more than half 
of bipartite (and slightly more than one-fourth of nonbipartite) regular graphs are Ra- 
manujan. Based on their data, Miller and Novikoff [MiNo08] conjecture that the dis- 
tribution of second-largest eigenvalues approaches a Tracy-Widom distribution as the 
number of vertices goes to infinity. 


10.6.3 Other Surveys and General Sources 


Readers looking for other general surveys on expanders will find no shortage of material. 
In only two pages, [Sa04] provides a remarkably thorough overview. The award-winning 
survey [HLW06] goes into considerably more depth, with an especially rich discussion of 
applications in computer science. Another survey, [Lu12], stands out for its discussion 
of connections to number theory, group theory, and geometry. Terrence Tao’s blog post 
[Ta12] on the subject is well worth a read. The book [Lu94] is an essential guide for 
many in the field. The authors’ text [KrSh11] presents some elementary aspects of 
expander graphs in a way that is meant to be accessible to advanced undergraduates or 
beginning graduate students and contains many ideas for student research projects such 
as REUs. Xiao’s undergraduate thesis [Xi03] deserves special mention, particularly for 
its discussion of various applications in computer science. Relationships between graph 
eigenvalues and other graph invariants are explored in detail in [Ch97]. Murty’s survey 
article [Mu03] on Ramanujan graphs goes into more detail on that subject. 
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INTRODUCTION 


Visibility graphs are often studied within the field of graph drawing, itself a sub- 
discipline of computational geometry. Good overviews of these fields can be found 
in [DiEaTaTo99, GoOR04, DeOR11], and the websites www.graphdrawing.org and 
www.cs.smith.edu/~orourke/ are resources for additional material. Visibility graphs 
are studied for their theoretical, algorithmic, and applied interest. Most generally a 
visibility layout is formed by a collection of objects placed in an ambient space with 
prescribed visibility between objects. The vertices of a visibility graph correspond to 
objects and its edges correspond to visibilities between two objects. Here we specialize 
to polygonal shapes in R” and unobstructed line-of-sight visibility, typically requiring 
both object placement and visibility parallel to one or more axes in R”. Often our 
objects are axis-aligned line segments, rectangles, boxes, etc. Other types of visibil- 
ity graphs involve more complex configurations (see §10.7.7 and [DeHaMo03]) and are 
studied, for example, in motion planning problems [DeOR11, Str05], VLSI design and 
two-layer routing [U184]. 


10.7.1 Bar-Visibility Graphs 


The first study of rectilinear objects in R? with vertical and horizontal visibility is 
due to Garey, Johnson, and So [GaJoSo76] in which circuit board elements are laid out 
in a piecewise linear fashion and errors in fabrication are detected using an algorithm 
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that employs graph coloring results. Possible erroneous short circuits are detected 
with at most twelve tests; for an exposition and improvement to at most four tests, 
see [Hu93]. Theoretically the question of representing planar graphs with horizontal 
segments and vertical visibility was posed by Melnikov [Me81] and by de Fraysseix and 
Rosenstiehl [deFRo81]. Initial results were obtained by Duchet et al. [DuHaLaMe83] 
and by Thomassen [Th84], and a comprehensive history can be found in [DiEaTaTo99]. 


DEFINITIONS 


D1: A bar-visibility (or BV) layout of a graph G is a representation of G in the 
plane by disjoint, closed, horizontal line segments (called bars) in which each vertex 
corresponds to a bar and two vertices are adjacent if and only if there is an unobstructed, 
non-degenerate vertical visibility band between the corresponding bars [Wi85]. (Non- 
degenerate visibility means visibility through a positive-width band.) A bar-visibility 
graph (or BVG) is a graph with a BV layout. 


D2: A planar graph has a strong visibility layout if its vertices can be represented 
by disjoint, closed, horizontal line segments and two vertices are adjacent if and only 
if there is unobstructed vertical visibility for edges in which visibility may be along a 
(width-0) line. 


D3: In the settings of both bar-visibility and strong visibility, a planar graph is said 
to have a weak visibility layout if it can be represented by disjoint, closed, horizontal 
line segments for vertices and with vertical visibility between the bars for each pair of 
adjacent vertices. (Thus there may be visibilities that do not correspond to edges.) 


REMARKS 
R1: A BV layout is also known as an €-visibility layout [TaTo86]. 


R2: <A BV layout induces a plane embedding of G by placing each vertex on its 
corresponding bar and drawing edges between pairs of vertices whose bars have vertical 
visibility; see also Fact F3 and Example E2. 


R3: A subgraph of a BVG may not be a BVG; see Example El and Fact F2. 


R4: When disjoint horizontal line segments are laid out, the resulting BVG is the same, 
regardless of whether the segments are closed, open, or neither, but these choices can 
make a difference in the graph of the strong visibility layout. As is customary [TaTo86], 
we choose always to use closed line segments. 


R5: It is an artifact of the evolution of terminology that the definitions of strong and 
weak visibility do not describe complementary properties of visibility layouts. 


EXAMPLE 


E11: Figure 10.7.1 shows a bar-visibility graph G and a layout of G. The layout 
is not a strong layout of G because, for instance, there is a degenerate visibility line 
between bars 2 and 7, but the vertices are not adjacent. If we form a subgraph G’ by 
deleting the edge (1,6), we obtain the smallest planar graph that is not a bar-visibility 
graph, namely, the bipartite graph K2,3 plus three pendant edges. However, G’ is a 
weak bar-visibility graph since it is a subgraph of a bar-visibility graph. 
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Figure 10.7.1: A graph and its BV layout. 


FACTS 


F1: Any set S of horizontal segments in R? determines a BVG G of which it is a BV 
layout. S also determines a graph G’ of which it is a strong BV layout, and G is a 
subgraph of G’. Every subgraph of G and of G’ has a weak visibility layout using the 
segments of S and the appropriate model of bar-visibility or strong visibility. 


F2: Wismath [Wi85] and Tamassia and Tollis [TaTo86] showed that a planar graph is 
a BVG if and only if it can be drawn in the plane with all cut-vertices on a common 
face (and without loss of generality that face is the infinite, exterior face). [TaTo86] 
gives a linear-time algorithm to determine whether a graph is a BVG. An alternative 
layout algorithm follows from giving the graph a canonical ordering [deFHuPaPo90, 
Ka96, Nu99]. 


F3: Every BVG (and every graph with a strong visibility layout) has a planar drawing 
with edges that are polylines with at most two bends (i.e., consisting of at most three 
contiguous straight-line segments) [LuMaWo87, DiTa88, DiEaTaTo99]; see also E2. 


EXAMPLE 


E2: A plane drawing of the graph induced by a BV layout can be obtained by vertically 
thickening each bar to become a rectangle and placing a vertex in its center. A polyline 
edge with at most two bends is produced by choosing a visibility line and drawing 
radial lines from each of its endpoints to the vertex in the center of the thickened bar; 
see Fig. 10.7.2. 


Figure 10.7.2: A BV layout induces a 2-bend polyline embedding. 


FACTS 


F4: Tamassia and Tollis [TaTo86] show that every maximal planar graph has a strong 
visibility layout that can be determined in linear time, and every 4-connected planar 
graph has one that can be determined in polynomial time. They also provide a charac- 
terization of graphs G that have a strong visibility layout in terms of extensions of G 
that have a “strong st-numbering.” 
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F5: Andreae [An92] proved that deciding whether a planar graph has a strong visibility 
layout is an NP-complete problem. 


F6: In the settings of bar-visibility [TaTo86] and strong visibility [DuHaLaMe83], a 
graph has a weak visibility layout if and only if it is planar. Thomassen [Th06] proved 
that a countably infinite graph is a weak visibility graph if and only if it is planar. 


DEFINITIONS 


D4: _ A visibility layout is collinear if there are two bars with endpoints having the 
same x-coordinate; otherwise it is noncollinear. 


D5: A graph G isa unit bar-visibility graph if it has a BV layout in which all bars 
have equal length. 


D6: A digraph D is a directed bar-visibility graph if there is a BV layout of the 
underlying, undirected graph G such that for each arc (u,v) of D, the visibility of u to 
uv is in the positive y direction. 


D7: A planar graph is a near-triangulation if it has a plane embedding with all 
finite, interior faces 3-sided. 


D8: A caterpillar is a tree containing a path with the property that every vertex is 
at distance at most one from the path. 


FACTS 


F7: Every noncollinear visibility layout is both a BV layout and a strong BV layout 
of the same graph, but the bar-visibility graph and the strong visibility graph induced 
by a collinear layout may be different. In [TaTo86] it is shown that every graph with 
a strong visibility layout also has a BV layout. The 4-cycle Cz has a strong visibility 
layout but not a noncollinear visibility layout [LuMaWo87, TaTo86], and the complete 
bipartite graph K2,4 has a BV layout but not a strong visibility layout [TaTo86]. Hence 
the noncollinear BVGs form a strict subclass of the graphs with strong visibility layouts, 
which in turn form a strict subclass of the BVGs. 


F8: Luccio et al. [LuMaWo87] proved that a planar graph has a noncollinear visibility 
layout if and only if after possibly duplicating some edges it is a near-triangulation. 


F9: A tree is a unit BVG if and only if it is a subdivision of a caterpillar with maximum 
degree 3 [DeVe03]. In [DeGeHu05] triangulated polygons with unit-length BV layouts 
are characterized. [ChHuKeSh06] studies BVGs in which all bars have length between 
1/k and 1 given integer k > 1, and they characterize all trees that have such a layout. 


F10: A digraph D is a directed bar-visibility graph if and only if D* is planar and 
acyclic where D* is formed by adding two vertices, a source s and a sink t, an arc (s,v) 
to every vertex vu with indegree 0, an arc (w,t) from every vertex w with outdegree 0, 
and the arc (s,t) [Wi85, TaTo86]. 


REMARKS 


R6: In [KiWi96] bar-visibility layouts of edge-weighted graphs are studied with the 
requirement that the width of a visibility band be proportional to the weight of the edge 
to which it corresponds. 
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R7: From a BV layout of G one obtains a tessellation representation in R? with 
an axis-aligned rectangular tile representing each vertex, edge, and face of the induced 
plane embedding of G. This is obtained essentially by expanding each visibility band 
to a maximal band. Then the vertices and faces are represented by line segments or 
degenerate tiles. Such tessellations have been studied in the context of undirected and 
directed graphs; see [RoTa86, TaTo89, MoRo98, DiEaTaTo99]. 


R8: BV layouts have also been studied and characterized on the surface of a cylinder, 
a Mobius band, and the projective plane [TaTo91, De01, Hu05]. 


RESEARCH PROBLEMS 


RP1: Is there a characterization of planar graphs that have BV layouts with all bars 
of unit length or a characterization with all bars lying within the interval [1/k, 1]? 


10.7.2 Rectangle-Visibility Graphs 


Another visibility model that derives from the rectilinear polygon layouts studied in 
[GaJoSo76] is the class of rectangle-visibility graphs, in which vertices are represented 
as axis-parallel rectangles in the plane and edges are induced by both vertical and hor- 
izontal visibilities. The first studies of these occurred in [Wi89, BoDeHuSh97, DeHu97, 
HuShVi99]. 


DEFINITIONS 


D9: A rectangle-visibility (or RV) layout of a graph G is a representation of G in 
the plane by interior-disjoint axis-parallel rectangles in which each vertex corresponds 
to a rectangle and two vertices are adjacent if and only if there is an unobstructed, non- 
degenerate vertical or horizontal visibility band between the corresponding rectangles. 
A rectangle-visibility graph (or RVG) is a graph with an RV layout. 


D10: A unit rectangle-visibility graph (or unit RVG) is a graph with an RV 
layout using unit squares. 


D11: For integer k > 1, a graph has thickness k if k is the minimum for which the 
graph’s edges can be partitioned into & planar graphs. 


D12: A k-tree is either a k-clique or a graph formed from a smaller k-tree T by adding 
a vertex adjacent precisely to the vertices of a k-clique within T. A partial k-tree is a 
subgraph of a k-tree. 


REMARKS 
R9: In [Wi89] RVGs are called 2-box-representable graphs. 


R10: The analogous definitions of strong, weak, collinear and noncollinear visi- 
bility apply to RV layouts as to BV layouts. 


R11: The horizontal and vertical visibilities of an RV layout induce a partition of edges 
of the corresponding RVG into two (planar) BVG subgraphs; hence the thickness of an 
RVG is at most two [GaJoSo76]. 
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EXAMPLES 


E3: Figure 10.7.3 shows an RV layout of the nonplanar graph K44 and the partition of 
its edges, induced by the horizontal and vertical visibilities, into two planar subgraphs, 
drawn with straight lines. 
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Figure 10.7.3: An RV layout of K44 and the subgraphs induced by its horizontal and 
vertical visibilities. 


E4: Figure 10.7.4 shows a layout of Kg as an RVG, and the two induced planar graph 
drawings with polylines for edges. 


y 


Figure 10.7.4: An RV layout of Kg and the subgraphs induced by its horizontal and 
vertical visibilities. 


FACTS 
F11: Every planar graph is an RVG [Wi89]. 
F12: The complete graph K,, is an RVG if and only if n < 8 [DeHu97, HuShVi99]. 


F13: Dean and Hutchinson [DeHu97] showed that for integers p < q the complete 
bipartite graph K,,, is an RVG if and only if p < 4, although Ks,, has thickness two 
for 5 < q < 12 [BeHaMo64]. From Fig. 10.7.3 it is clear how to lay out K4,, for q > 4. 
They show that K’5,5 is a weak RVG, and K,, is a noncollinear RVG if and only if p < 2 
or p= 3 and q < 4; hence the noncollinear RVGs form a strict subclass of the RVGs. 


F14: Bose et al. [BoDeHuSh97] and Bied] [Bill] proved that for k = 1,2 all partial 
k-trees are noncollinear RVGs. (Note that a 1-tree is a tree, and a 2-tree is also called 
a series-parallel graph.) For 1 < k < 4 every k-tree is an RVG [BoDeHuSh97]. Since 
5-trees can have thickness 3, 5-trees need not be RVGs. 
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F15: Every graph that is a union of two forests of caterpillars is a noncollinear RVG 
and consequently so are graphs with maximum degree 3 [BoDeHuSh97]. 


F16: A tree is a unit RVG if and only if it is the union of two forests of subdivided 
caterpillars, each having maximum degree 3 [DeElHaPa08]. Results on representing 
3-trees as unit RVGs are in [DeHul0]. 


F17: Every graph with maximum degree four is a weak RVG, and every graph whose 


vertices of degree four or more are mutually at distance at least two apart is a weak 
RVG [BoDeHuSh97]. 


F18: An RVG on n > 5 vertices has at most 6n — 20 edges, and for each n > 8 there 
is a (connected) RVG with 6n — 20 edges. There are thickness-2 graphs that are not 
RVGs (since a thickness-2 graph may have as many as 6n — 12 edges) [HuShVi99]. A 
bipartite RVG on n > 4 vertices has at most 4n — 12 edges [DeHu97]. 


F19: A unit RVG on n> 1 vertices has at most 6n — 4[,/n ] + 1 edges, and for each 
n > 64 there is a unit RVG with at least 6n — 12|,/n| +6 edges. A bipartite unit RVG 
with n > 7 vertices has at most 4n — 2[,/n | +5 edges [DeE]HaPa08}. 


F20: Mansfield [Ma83] proved that it is an NP-complete problem to determine whether 
a graph has thickness two; Shermer [Sh96a] proved the same for determining whether a 
graph is an RVG. 


REMARK 


R12: An RV layout has a natural orientation with each visibility directed in the positive 
x or y direction. An RVG inherits an orientation by directing each edge according to 
that of its visibility band, as do the two underlying BVGs of horizontal and vertical 
visibilities. 


DEFINITION 


D13: A planar, acyclic digraph is an st-graph if it has a single source s and a single 
sink f. 


FACTS 


F21: Streinu and Whitesides initiated a more detailed study of topological RVGs 
[StrWh03]. Let R be a set of interior-disjoint, axis-aligned rectangles in R? with visi- 
bilities directed as in Remark R12. To this add a frame of two wide rectangles N and 
S at the top and the bottom of the layout and two tall rectangles W and FE at the left 
and the right that span the layout of R and so that there is no N—E, E—S, S—W, or 
W —N visibility. The topological rectangle visibility graph of R or TRVG(R) isa 
pair (Dy, Dy) of digraphs, where Dy records the upwardly directed, cyclically ordered 
vertical visibilities of RU {S, N} including multiplicities, and similarly Dy records the 
left-to-right directed, cyclically ordered visibilities of RU{W, E}. A pseudo TRVG is 
a TRVG with digraphs (Dy, Dz) that are both directed, acyclic planar st-graphs whose 
underlying graph is 2-connected. In [StrWh03] they characterize TRVGs in terms of a 
pseudo TRVG (Dy, Dz) and conditions on paths within Dy and Dy. More precisely, 
given a pseudo TRVG (Dy, Dy), necessary and sufficient conditions are given for the 
existence of R for which TRVG(R) = (Dy, Dx). There is a quadratic time algorithm 
to recognize and lay out a TRVG specified by a set R and the related pair (Dy, Dy). 
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REMARKS 


R13: Since an RV layout induces two BV layouts of the horizontal and vertical visi- 
bilities, it induces a drawing of its RVG as the union of two planar graphs with iden- 
tical vertex locations and edges that are polylines, each with at most two bends; see 
Fig. 10.7.2. An RV layout and related polyline drawing is used in [Bill] to show that 
series-parallel graphs (or partial 2-trees) can be drawn in the plane in small area. For 
more information on orthogonal polyline graph drawing in the plane, see Section 10.3. 


R14: See also Remark R18 on 2-box or rectangle intersection graphs. 


RESEARCH PROBLEMS 


RP2: A graph G is doubly-linear [HuShVi99] (or equivalently G has geometric 
thickness two [DiEpHi98]) if it can be drawn as the union of two straight-edged planar 
graphs with identical vertex locations. All known examples of RVGs are doubly linear 
(as compared with Remark R13), and doubly-linear graphs have at most 6n — 18 edges 
[HuShVi99], higher than the bound of Fact F18. Is there a doubly-linear graph that is 
not an RVG? 


RP3: Is it an NP-complete problem to determine if a graph is an RVG with all 
rectangles squares (or unit squares)? (See also Fact F26.) 


The next research problems were raised by T. Shermer [Sh96b]. 


RP4: The noncollinear RVGs form a subclass of the strong RVGs—is it a strict 
subclass? (See also Fact F13.) Is there a containment relation between the classes of 
strong RVGs and RVGs, and if so, is it strict? 


RP5: Is there a graph that is the union of three caterpillar forests that is not an RVG? 


RP6: If G has maximum degree 4, is it an RVG? If it has maximum degree 5, is it a 
weak RVG? 


RP7: For n> 0, let c(n) be the maximum such that every graph with n vertices and 
crossing number at most c(n) is an RVG. What can be said about the function c(n)? 


RP8: How much information can be dropped from that given for a TRVG and still 
have a polynomial time recognition algorithm for that class of RVGs [StrWh03]? 


10.7.3 Visibility Representations in R® 


Many early papers from this and the previous subsection were derived from the 
McGill University Bellairs Research Institute’s Workshop on Visibility Representations, 
organized by S.H. Whitesides and J.P. Hutchinson in 1993. There the model of using 
axis-aligned rectangles in R?, placed parallel to the z = 0 plane, and with visibility 
parallel to the z-axis, was first studied. This is one of several models of visibility in R?. 

In this subsection through §10.7.6 visibility bands are always non-degenerate, i.e., 
they must have positive width. 
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DEFINITIONS 


D14: A 3D visibility representation of a graph G in R® is a layout of the vertices 
of G with disjoint, (a, y)-axis-aligned rectangles, lying perpendicular to the (vertical) z- 
axis. Edges are represented with vertical visibility so that two vertices in G are adjacent 
if and only if there is a closed cylinder of non-zero radius within an unobstructed vertical 
visibility corridor between the corresponding rectangles [BoEvFeLu94, BoEvFeHo98]. 
This visibility is also known as z-visibility, and the represented graphs are called VR- 
representable graphs and ZPR graphs [BoEvFeLu94, FeMe99]. 


D15: A graph has a weak 3D visibility representation if it is a subgraph of a graph 
with a 3D visibility representation. 


D16: A 3D visibility representation by unit squares, polygons, or unit discs 
is a 3D representation in which unit squares, polygons, or unit discs, respectively, are 
used instead of rectangles. 


D17: A class of objects is universal if for every graph there is a 3D visibility repre- 
sentation of the graph by objects from that class [AlGoWh98]. 


D18: The multipartite number of graphs with 3D visibility representations is the 
maximum integer & for which every complete k-partite graph has a 3D visibility rep- 
resentation. Equivalently the multipartite number is the maximum k for which every 
k-partite graph (or k-colorable) graph has a weak 3D visibility representation [Sto08]. 


EXAMPLE 


E5: Figure 10.7.5(a) shows a 3D layout of K¢ using unit squares; in the figure darker 
squares are placed behind lighter squares. In Figure 10.7.5(b) a translucent square is 
added on top to achieve a layout of K7. Both layouts are based on figures in [FeHoWh96]. 


Figure 10.7.5: 3D layouts of Kg and K7 using unit squares. 


FACTS 


F22: Every planar graph is 3D representable, and the set of 3D representable graphs 
is not closed under graph minors [BoEvFeLu94]. 


F23: K, is 3D representable for all n < 22 [BoEvFeHo98], and no K,, is 3D repre- 
sentable for n > 51 [FeHoWh96, Sto09]. 
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F24: K,,, is 3D representable for all positive p and g. Bounds are also known for 
tripartite graphs and for complete bipartite graphs minus a perfect matching. Every 
K,, is 3D representable by unit discs [BoEvFeLu94]. 


F25: The multipartite number for 3D representable graphs is 8; see §10.7.4 and [Sto08] 
for related definitions and results. 


F26: Fekete, Houle and Whitesides [FeHoWh96] prove that K,, is 3D representable by 
unit squares if and only if n < 7. They also show that it is an NP-complete problem to 
recognize graphs that have a 3D visibility representation by squares [FeHoWh97]. 


F27: Fekete and Meijer [FeMe99] give bounds on the sizes of possible complete graphs 
when the number of different sizes (congruent by translations) and shapes (congruent 
by translations and rotations) for 3D representation is considered. For example, if one 
shape of rectangle is allowed, Ky, n < 12, is 3D representable and no K,, with n > 14 is 
so representable. The same bounds (12 and 14) exist for 3D representations of kK, when 
two different sizes of rectangles are allowed. All bounds of 55 in Table 1 in [FeMe99] 
have now been reduced to 50 by results of Stola [Sto09]. 


F28: Alt, Godau and Whitesides [AlGoWh98] show that there is no k for which the 
class of polygons with at most k sides is universal. Every graph with n vertices has a 
3D representation by polygons, each having at most 2n sides, and there is a quadratic 
time algorithm for constructing such a representation from a graph. 


REMARKS 


R15: There is extensive literature on 3D visibility representations of complete graphs 
using regular k-gons; see, for example, [Stol0] where the biggest representable K,, is 
bounded by O(k*). 


R16: Cobos et al. [CoDaHuMa96] have initiated a study of graph representations using 
hyper-rectangles in R” with orthogonal visibility. Every graph can be so represented, 
and a fundamental problem is the determination of the minimum n which allows for 
such a representation for different families of graphs. 


RESEARCH PROBLEMS 


RP9: In [FeMe99] there are many instances of distinct upper and lower bounds for 
complete graphs representable by different shapes and sizes of rectangles, as in Fact 
F27. What is the best possible bound in each case? 


RP10: What is the best bound on the size of K,, that has a 3D representation by 
regular k-gons? 
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10.7.4 Box-Visibility Graphs 


Another natural model in R* represents the vertices of a graph by axis-parallel 
3-dimensional boxes (called boxes or 3-boxves) and edges by visibility in directions 
parallel to each of the three axes. 


DEFINITIONS 


D19: A bozx-visibility graph has vertices represented by boxes in R® and edges 
represented by unobstructed visibility through a positive-radius, axis-parallel cylinder 
[BoJoMiOR94]; such a representation is also called a box-visibility representation 
or BR [FeMe99]. 


D20: K,, is said to have a BR by unit cubes if it has a BR with each box a unit 
cube [FeMe99]. 


D21: The multipartite number for box-visibility graphs is the maximum integer k 
for which every complete k-partite graph is a box-visibility graph [Sto08). 


EXAMPLE 


E6: Figure 10.7.6, based on a figure in [FeMe99], shows a layout of Ag using unit cubes 
in R°. The layout is shown from three perspectives, with the second and third views 
obtained by rotating the preceding view 90° about the vertical axis, to aid in seeing the 
visibility cylinders between each pair of cubes. The white cubes at top and bottom see 
all the others vertically. 


_ || | 


Figure 10.7.6: Three views of a layout of Kg in R? using unit cubes. 


FACTS 


F29: Ky, n < 56, is a box-visibility graph; this bound with n < 42 was first shown in 
[BoJoMiOR94] and the better bound in [FeMe99]. There is no such representation of 
Ky, with n > 169. The latter bound with n > 184 was shown in [FeMe99]; the better 
bound follows from the work of [Sto09]. 


F30: K, has a BR by unit cubes for n < 8, and for n > 10 no K,, has such a box 
representation [FeMe99]. 


F31: The multipartite number & for box-visibility graphs is bounded by 22 < k < 42 
[Sto08}]. 
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REMARKS 


R17: Results of [FeMe99] are obtained by studying forbidden configurations in related 
partially ordered sets, and the authors obtain additional results by specifying shapes 
and sizes of representing boxes. Romanik [Rom97] also uses posets to determine that 
directed RVGs and digraphs with 3D visibility representations have infinite dimension. 


R18: Let ad-boz be a d-dimensional rectilinear box in R@ or equivalently the Cartesian 
product of closed line segments, one within each of the d axes [Th86]. The intersection 
graph of a set of d-boxes in R@ has a vertex for each d-box and an edge joining two 
vertices precisely when the corresponding d-boxes intersect. Intersection graphs of 1- 
boxes are interval graphs. Early work on d-box intersection graphs is found in [Rob69, 
Sc84]. For d = 2 these are also called rectangle intersection graphs and graphs with 
boxicity 2. Kratochvil [Kr94] proved that determining if a graph is a 2-box intersection 
graph is NP-complete. For d = 3 these graphs are also called box intersection graphs 
and graphs with boxicity 3. Thomassen [Th86] proved that every planar graph is 
a 3-box intersection graph. When vertices of graphs are represented by orthogonal 
polyhedra in R3, every K,, is the intersection graph of such polyhedra [Wh84]. 


RESEARCH PROBLEMS 


RP11: Obtain better or best possible bounds on the size of K, that is box- 
representable and also best bounds for representation by unit cubes. 


RP12: Determine the multipartite number of box-visibility graphs. 


10.7.5 Bar k-Visibility Graphs 


Since bar-visibility graphs are planar, they are limited in their applicability for 
modeling more complex systems, like multi-layer circuits, that may correspond to non- 
planar graphs. One way to broaden the visibility model is to permit the opacity of 
the bars to vary, in other words, to permit visibility through one or more bars. Bar k- 
visibility graphs, which permit visibility through & bars, were introduced by Dean et 
al. [DeEvGeLa07] and further studied by Hartke et al. [HaVaWe07] and by Felsner and 
Massow [FeMa08]. 


DEFINITION 


D22: Given a layout in the plane of disjoint horizontal line segments (called bars) and 
an integer k > 0, a k-visibility band between two bars is a non-degenerate vertical 
band between the two bars that passes through at most & intermediate bars. If a 
k-visibility band passes through no intermediate bars, then the two bars it joins have 
direct visibility; otherwise they have indirect visibility. A graph G is a bar k-visibility 
graph if G can be represented by such a layout, so that vertices correspond to bars 
and two vertices are adjacent if and only if there is a k-visibility band between the 
corresponding bars. A graph is a weak bar k-visibility graph if it is a subgraph of a 
bar k-visibility graph. 


REMARK 


R19: A bar-visibility graph is the same as a bar 0-visibility graph, and an interval 
graph with n vertices is the same as a bar k-visibility graph with k > n — 2. 
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EXAMPLE 


E7: Figure 10.7.7 shows a bar 1-visibility layout of the (non-planar) complete graph 
Kg. Note that, for example, bar 1 has direct visibility only to bars 2 and 8, while its 
visibilities to bars 3-7 are all indirect visibilities with intermediate bar 2 in each case. 


Figure 10.7.7: A bar 1-visibility layout of Kg. 


FACTS 


F32: A bar k-visibility graph with n > 2k+4+3 vertices has at most (k+1)(3n—4k-—6) 
edges [DeEvGeLa07, HaVaWe07], and this bound is tight. 


F33: Ky4z+4 is the largest complete bar k-visibility graph [DeEvGeLa07, HaVaWe07]; 
in particular, Kg is the largest complete bar 1-visibility graph (see Fig. 10.7.7). 

F34: The chromatic number x(G) of a bar k-visibility graph G is at most 6k + 6 
[DeEvGeLa07]. 

F35: If G is a bar 1-visibility graph, then its thickness O(G) is at most 4. If G is a 
bar k-visibility graph with k > 2, then O(G) < 2k(9k—1). There are thickness-2 graphs 
that are not weak bar 1-visibility graphs [DeEvGeLa07], but there are bar 1-visibility 
graphs with thickness 3 [FeMa08}. 


F36: For every k > 0 there are bar k-visibility graphs that are not bar (k+1)-visibility 
graphs, and there are bar (k + 1)-visibility graphs that are not bar k-visibility graphs 
[HaVaWe07]. 


F37: Ifa graph G contains a triangle-free, non-planar induced subgraph, then G is 
not a bar k-visibility graph for any & [HaVaWe07]. 


F38: If G is a connected, d-regular bar k-visibility graph with d < 2k + 2, then G is 
a complete graph [HaVaWe07]. 


RESEARCH PROBLEMS 


RP13: (See Fact F35.) What is the tight bound on the thickness of bar 1-visibility 
graphs? More generally, what is the tight bound on the thickness of bar k-visibility 
graphs? Is it a quadratic bound in k, like the one given in [DeEvGeLa07], or is it linear 
in k? 


RP14: Dean et al. [DeEvGeLa07] also give bounds on the chromatic number of bar 
k-visibility graphs, but they do not claim that these are tight bounds. What is the tight 
bound on the chromatic number of bar k-visibility graphs? 


RP15: Are there forbidden induced subgraphs for bar k-visibility graphs besides the 
triangle-free, nonplanar graphs [HaVaWe07]? 
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10.7.6 Bar Visibility Number 


Bar k-visibility graphs broaden the visibility model by permitting visibility to pass 
through one or more bars. Another way to extend the model is to permit vertices to 
be represented by more than one bar. This model was first introduced by Chang et 
al. [ChHuJaLe04]; results for directed graphs were introduced by Axenovich et al. in 
[AxBeHuWel1]. 


DEFINITION 


D23: For t > 1a t-bar representation (or t-bar layout) of a graph G is a bar- 
visibility layout in which each vertex of G corresponds to a maximum of t bars, and 
two vertices of G are adjacent if and only if there is an unobstructed, non-degenerate 
vertical visibility band between two bars that correspond to the two vertices. The bar 
visibility number b(G) is the minimum ¢ such that G has a t-bar representation. 


D24: Fort > 1 a (directed) t-bar representation of a digraph D is a t-bar visibility 
layout of the underlying undirected graph such that there is an arc from vertex u to 
vertex v if and only if the visibility band from uw to v is directed in the positive y direction; 
see Definition D6. The (directed) bar visibility number b(D) is the minimum t such 
that D has a directed t-bar representation. 


EXAMPLE 


E8: Figure 10.7.8 (modified from a figure in [DePhRal2]) shows a 2-bar layout of the 
complete graph Ky (which has thickness 3). The layout can be viewed as a directed 
2-bar layout of a tournament on nine vertices. The figure also demonstrates that Ko 
has a 2-bar layout using all bars of unit length. 


1 


Figure 10.7.8: A unit 2-bar visibility layout of Kg. 


REMARKS 
R20: A bar-visibility graph is the same as a 1-bar visibility graph. 


R21: For every pair of graphs G and H, b(GU H) < b(G) + b(A), and the same is 
true for directed bar visibility number. 


R22: If G is the underlying graph of a directed graph D, then b(G) < b(D). 
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FACTS 


F39: The bar visibility number of the complete graph K,, is [n/6] [ChHuJaLe04], 
and the bar visibility number of the complete bipartite graph K,,q is [S| [Ca06, 
ChHuJaLe04]. 


F40: If G has n vertices, then b(G) < [n/6] + 2 [ChHuJaLe04]. If D is a digraph 
with n vertices, then b(D) < (n+10)/3. If the underlying graph G of D has maximum 
degree A, then b(G) < b(D) < [(A(G) + 1)/2] [AxBeHuWel11]. 

F41: If G has n vertices and e edges, then b(G) > [$°], and if G is triangle-free, then 
b(G) > [$*) [ChHuJaLe04]. It follows that these lower bounds also hold for directed 
bar visibility number. 


F42: It follows from a result of Wismath [Wi85, Wi89] that if G is planar, then 
b(G) < 2 [ChHuJaLe04]. Furthermore, every planar graph has a 2-bar representation 
in which every vertex that is not a cut-vertex is assigned only one bar [ChHuJaLe04]. 


F43: If D is a planar digraph, then b(D) < 4. If D is also triangle-free, then 
b(D) < 3, and this bound is sharp. If D is outerplanar or has girth > 6, then b(D) < 2 
[AxBeHuWel1]. 


F44: If D is a transitive tournament on n vertices, then b(D) < 7n/24 + 2V/nlog n; 
for n sufficiently large, b(D) < 3n/14+ 42 [AxBeHuWel1]. 


RESEARCH PROBLEMS 


RP16: What is the bar visibility number of the d-dimensional hypercube Qa? Does 
it equal the lower bound [(n + 1)/4]? 


RP17: Is the directed bar visibility of a graph at most twice the bar visibility number 
of the underlying graph [AxBeHuWel1]? 


RP18: Is there a planar digraph D with b(D) = 4 [AxBeHuWel1]? 


RP19: If D is a transitive tournament on n vertices, what is the best upper bound 
on b(D)? Asymptotically what is the best bound on b(D)? 


10.7.7 Other Visibility Graphs 


We conclude with a brief mention of other types of visibility graphs (especially 
those with names similar to terms in this subsection) but do not include comprehensive 
references on fundamental and latest results. 

Three well-studied visibility graphs are defined below. More information can be 
found in O’Rourke [OR93] and in Chapter 28 of Goodwin and O’Rourke [GoOR04]. 


DEFINITIONS 


D25: The vertex visibility graph or polygon vertex visibility graph has vertices 
represented by the vertices of a simple polygon in the plane and edges by straight-line 
visibility within the polygon between polygon vertices. 
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D26: Given a set of disjoint segments in the plane, the endpoint visibility graph or 
segment endpoint visibility graph has vertices represented by the endpoints of the 
segments and edges by straight-line visibility between endpoints. 


D27: The segment visibility graph or (whole) segment visibility graph has 
vertices represented by disjoint line segments in the plane and edges by (any) straight- 
line visibility between corresponding segments. 


In the third definition, when the segments are constrained to be horizontal and 
visibility to be vertical, then we are in the case of bar- or strong visibility graphs. 

All three concepts are studied for graph characterization, recognition, and algorith- 
mic reconstruction. The chromatic number, clique number, Hamiltonicity, and size 
constraints of these graphs are studied, among other structural and algorithmic prop- 
erties. 


RESEARCH PROBLEMS 


RP20: Find necessary and sufficient conditions to determine when a graph is the 
vertex visibility graph of a polygon [DeOR11]. 


RP21: A collection of open problems from 2003 is available in [BrEpGoKo04]. 
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3D visibility representation of a graph G: a set of disjoint, (x, y)-axis aligned 
rectangles lying perpendicular to the z-axis in R*, each corresponding to a vertex of 
G, with two vertices adjacent if and only if there is unobstructed, vertical visibility 
through a non-degenerate cylinder between the corresponding rectangles. 

adjacency operator: the linear mapping A, from the set of all complex-valued func- 
tions on the vertex set of a graph to itself, so that the value of Af at a vertex v 
equals the sum of the values of f over all neighbors of v. 

ancestor — of a vertex y in a rooted tree: a vertex x that lies on the unique path from 
the root to y. 

, proper — of a vertex y in a rooted tree: an ancestor other than vertex y itself. 


angular resolution p-— in a polyline drawing: the smallest angle formed by two edges, 
or segments of edges, incident on the same vertex or bend. 

a-approximation algorithm — for minimization problems: a polynomial-time algo- 
rithm that is guaranteed to find a solution of size at most a times the minimum. 

area — of a drawing: the area of the convex hull of the drawing. 

articulation point: see cutpoint. 

aspect ratio — of a drawing: the ratio of the longest to the shortest side of the smallest 
rectangle with horizontal and vertical sides covering the drawing. 

back edge, — for a dfs-tree in a directed graph: an edge directed from descendant to 
ancestor. 

back edge, — for a dfs-tree in an undirected graph: any non-tree edge. 

backtracks: goes back and forth along an edge, as in a walk. 

bar k-visibility graph: a graph whose vertices can be represented by horizontal bars in 
the plane, with two vertices adjacent if and only if there is a non-degenerate vertical 
band joining the corresponding bars and passing through at most k intermediate 
bars. 

bar-visibility graph (or BVG): a graph whose vertices can be represented by hori- 
zontal bars in the plane, with two vertices adjacent if and only if there is an unob- 
structed, non-degenerate vertical band joining the corresponding bars. The graph is 
also known as an €-visibility graph. 

bar visibility number 6(G): the minimum t for which G has a t-bar representation. 

bend ~— in a polyline drawing: a point where two segments belonging to the same edge 
meet. 

bfs: abbreviation for breadth-first search. 

biconnected component — of an undirected graph: a maximal set of edges that has 
no cutpoint. 

biconnected graph: an undirected graph with no cutpoint. 

breadth-first search (bfs): a graph search method that finds shortest paths. 
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breadth-first tree: an ordered tree in which the children of a vertex x are the vertices 
discovered from x in a breadth-first search. 
bridge: an edge whose removal reduces the strength of connectedness between some 
pair of vertices. 
bridge, — in an undirected graph: a cutedge, i.e., an edge whose removal disconnects 
the graph. 
bridgez — in a mixed graph: an edge whose removal disconnects the underlying undi- 
rected graph. 
bridges — in a flow graph with root r: an edge (v,w) in a flow graph that belongs to 
every rw-path. 
bridgeless graph: an undirected graph with no bridges. 
bridge component (BC) — of a graph: a connected component of the graph that 
results when all the bridges are deleted. 
bridge tree: the tree formed by contracting every bridge component of a connected 
graph. 
bridgeless spanning subgraph, smallest: a bridgeless spanning subgraph of a con- 
nected bridgeless undirected graph that has the minimum possible number of edges. 
boundary: the set of edges connecting a given set of vertices to its complement. 
box-visibility graph: a graph whose vertices can be represented by axis-aligned, 3- 
dimensional boxes in R°, with two vertices adjacent if and only if there is an unob- 
structed, non-degenerate, axis-parallel cylinder joining the corresponding boxes. 
cardinality — of a cluster: the number of its vertices. 
Cartesian product, join and union: operations on fuzzy graphs. 
caterpillar: a tree containing a path with every vertex at distance at most one from 
the path. 
Cayley graph: a graph with vertex set G for a given group G and symmetric subset 
I of G, where x and y are adjacent iff y~tx €T. 
certificate — for a graph property P and a graph G: a graph G’ such that G has 
property P if and only if G’ has the property. 
chordal or triangulated graph: if each cycle with n > 4 vertices has a chord. 
chromatic index — of a graph: the minimum number of colors needed to color all edges 
such that edges with a common endpoint receive different colors. 
chromatic number — of a graph: the minimum number of colors needed to color all 
vertices such that adjacent vertices receive different colors. 
clique: a maximal set of vertices that are pairwise adjacent; sometimes maximality is 
not required. 
cliquewidth-k graph — defined recursively (with [k] denoting the set {1,2,...,k}): 
e Any graph G with V(G) = {v} and I(v) € [k] is a cliquewidth-k graph. 
e If G, and Gp are cliquewidth-k graphs and i, 7 € [k], then 
(1) the disjoint union G U G2 is a cliquewidth-k graph. 
(2) the graph (G1);x; is a cliquewidth-k graph, where (G1);x; is formed from G; by 
adding all edges (v1, v2) such that 1(v,) = 7% and I(v2) = j. 
(3) the graph (G1);,; is a cliquewidth-k graph, where (G1);-,; is formed from G by 


switching all vertices with label 7 to label 7. 
cluster — in dynamic graph algorithms: a connected subgraph, subject to various ad- 


ditional problem-specific conditions. 

clustering technique — used in the design of dynamic algorithms: a technique based 
on partitioning the graph into a suitable collection of connected subgraphs called 
clusters, such that each update involves only a small number of such clusters. 
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cograph — defined recursively: 

e A graph with a single vertex is a cograph. 

e If G, and Gp are cographs, then the the disjoint union G; U G2 is a cograph. 

e If G, and Gy are cographs, then the cross-product G x G2 is a cograph, which is 

formed by taking the union of G; and G2 and adding all edges (v1, v2) where v1 is in 

G, and v2 is in Go. 

collinear visibility layout: a bar-visibility (resp., rectangle-visibility) graph lay- 
out that contains two bars (resp., rectangles) having endpoints with the same x- 
coordinate (resp., having collinear sides). If a visibility layout contains no such pair 
of bars (resp., rectangles), it is noncollinear. 

complete: a fuzzy graph (ju, p) is complete if p(x, y) = u(x) A p(y). 

composition: if p is a fuzzy relation of S into T and o is a fuzzy relation of T into 
W, the composition poo is defined by po o(x,w) = V{p(z,y) Aoly,w) |y € T}. If 
f= pp Sips pp Sp ep, and pe SH Vie elem 1.28 

connected fuzzy graph: if there is a path between every pair of vertices. 

convex drawing: a planar straight-line drawing such that the boundary of each face 
is a convex polygon. 

cross edge — for a dfs-tree in a directed graph: a nontree edge joining two unrelated 
vertices. 

crossing: a point where two edges intersect. 

cutedge: an edge whose removal disconnects a graph. 

cutpoint: a vertex whose removal disconnects a graph. 

cutvertex: a vertex whose removal reduces the degree of connectedness between some 
other pair of vertices. 

cycle set, fuzzy cycle set, the cocycle set, and the fuzzy cocycle set: sets in a 
fuzzy graph that are not necessarily vector spaces over the integers modulo 2. 

dag: acronym for “directed acyclic graph,” i.e., a directed graph with no (directed) 
cycle. 

Delaunay drawable graph: a planar triangulated graph that admits a drawing that 
is a Delaunay triangulation. 

Delaunay triangulation: a planar straight-line drawing with all internal faces trian- 
gles, and such that three vertices form a face if and only if their convex hull does 
not contain any other vertex of the triangulation. 

depth-first search (dfs): a graph search method that iteratively scans an edge inci- 
dent to the most recently discovered vertex that still has unscanned edges. 

depth-first spanning forest; — in an undirected graph: a collection of depth-first 
trees, one for each connected component of the graph. 

depth-first spanning forest, — in a directed graph: a collection of depth-first trees 
containing every vertex once, with all cross edges joining two different trees directed 
from right to left. 

depth-first tree (dfs-tree) — in a graph: an ordered tree in which the children of a 
vertex x are the vertices discovered from x in a depth-first search. 

descendant — of a vertex y in a rooted tree: a vertex x such that y lies on the unique 
path from the root to x. 

___, proper — of a vertex y in a rooted tree: a descendant other than vertex y itself. 
dfs: abbreviation for depth-first search. 

diameter — of a graph: the maximum distance between two vertices. 

directed bar-visibility graph: a BVG representable such that for each arc (u,v) of 
the digraph, the visibility of u to v is in the positive y direction. 

discovery — of a vertex: when a search reaches that vertex for the first time. 
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discovery order — induced by a graph search algorithm: a numbering of the vertices 
from 1 to |V|, in the order the vertices are discovered in the search, i.e., the preorder 
of the search tree. 

distance — from vertex u to vertex v: length of a shortest uv-path. 

dominance drawing: an upward drawing of an acyclic digraph, such that there exists 
a directed path from vertex u to vertex v if and only if x(u) < x(v) and y(u) < y(v), 
where «(-) and y(-) denote the coordinates of a vertex. 

dominating set — of a graph: a set of vertices such that every vertex is either in this 
set or has a neighbor in this set. 

domination of a vertex in a flow graph: vertex v dominates vertex w 4 v if every 
rw-path contains v. 

dominator tree — for a flow graph: a tree that represents all the dominance relations. 

dynamic graph problem: a problem concerned with efficiently answering queries 
regarding whether a dynamic graph has the specified property. 

___, connectivity: answering a query whether the graph is connected, or whether 
two vertices are in the same connected component. 

___, decremental: a partially dynamic problem in which only deletions are allowed. 

__, fully: a problem in which the update operations include unrestricted insertions 
and deletions of edges or vertices. 

___, incremental: a partially dynamic problem in which only insertions are allowed. 

__, minimum spanning tree: the problem of maintaining a minimum spanning 
forest of a graph during insertions of edges, deletions of edges, and edge-cost 
changes. 

__, partially: a problem in which only one type of update, either insertions or 
deletions, is allowed. 

dynamic graph: a graph that is undergoing a sequence of updates; formally, a graph- 
valued variable. 

dynamic programming: evaluation of a recursive formula in such a way as to avoid 
repeated computations. 

k-ECSS: a k-edge-connected spanning subgraph of a k-edge-connected graph. 

__, smallest: a kK-ECSS that has the minimum possible number of edges. 

endpoint visibility graph: a graph whose vertices can be represented by the ver- 
tices of a simple polygon in the plane and edges by straight-line visibility between 
endpoints. 

ET tree — for a tree T: a dynamic balanced binary tree whose leaves are the sequence 
of vertex occurrences in an Euler tour of T’. 

Euler tour — of a tree T: in dynamic graph algorithms, a closed walk over the digraph 
obtained by replacing each edge of T' by two directed edges with opposite direction, 
such that the walk traverses each edge exactly once. 

expander family: a sequence of finite regular graphs, each with the same degree, 
so that the number of vertices goes to infinity but the isoperimetric constant is 
uniformly bounded away from zero. 

external degree — of a cluster: the number of tree edges. 

face — of a drawing: a region of the plane defined by a planar drawing. 

finish-time — at a vertex: when a search leaves that vertex for the last time. 

finish-time order — induced by a graph search: a numbering of the vertices from 1 to 
|V|, in the order they are finished in a search, i.e., the postorder of the search tree. 

flexibility — of a network: an application of fuzzy graphs is to model information 
networks. A measure of flexibility of a network is (m—n)/n(n — 2), where m is the 
numbers of edges and n is the number of nodes. 
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flow graph: a directed graph with a distinguished root vertex r that can reach every 
vertex. 

forest: a fuzzy graph if the graph consisting of its nonzero edges is a forest. 

forward edge — for a dfs-tree in a directed graph: a nontree edge from ancestor to 
descendant. 

fuzzy clique: a fuzzy subgraph K of a fuzzy graph G such that the level subgraphs K* 
induce a clique of G*. 

fuzzy cluster — of order k: a subset C of the set of vertices in a fuzzy graph (y, p) such 
that A{p*(z,y) | zy € C} > V{A{p*(w, z) | w € C} | z ¢ Ch. k-clusters are just 
ordinary cliques in graphs by thresholding the k-th 

fuzzy forest: a fuzzy graph (u, e) if it has a partial fuzzy spanning subgraph F = (1,7) 
which is a forest, where for all edges (x, y) such that r(x, y) = 0, p(x, y) < T*(a,y), 
ie., if there is a path in F' between x and y whose strength is greater than p(z, y). 

fuzzy graph: let V be a set, yu a fuzzy subset of V, and p a fuzzy relation on V. The 
pair (u, p) is a fuzzy graph if p(x,y) < w(x) A u(y). 

fuzzy intersection graph: Let G = (V,X) be a graph, where V = {v,...,un}. Let 
Si = {Up Aiso.149 Bigsby Where ayy © X. and aj, has v; asa vertex, J = 11+. 036 = 
ih, coe NK. Let S = {Si, cee Opts Let T = {(S,'S5) | SiS; € S, SNS; x 0,7 x jh. Then 
T(S) = (S,T) is an intersection graph and G ~ T(S). Any partial fuzzy subgraph 
(4, y) of Z(S) with Supp(7) = T is called a fuzzy intersection graph. 

fuzzy interval graph: the fuzzy intersection graph of a finite family of fuzzy intervals. 

fuzzy line graph: a partial fuzzy subgraph (A,w) of a fuzzy graph (4, p) such that for 
all S; € Z,A(Sz) = p(x) and for all (S;,Sy) € W,w(S2z, Sy) = p(x) A p(y), where 
Z = {{x}U{ug, vr) | & € X, Us, v2 € Vi = (Uz, vr) } and W = {(Sz, Sy) | SrA Sy F 
O,x,y € X,x A y} and where S, = {{x}U {uz, ve}, « € X. L(G) = (Z,W) is the 
intersection graph Z(X). 

fuzzy relation: a fuzzy subset of S x T, where S and T are sets. 

fuzzy set operations: basic set operations for fuzzy subsets of a set. 

fuzzy tree: a connected fuzzy forest. 

grid drawing: a polyline drawing such that vertices, crossings, and bends have integer 
coordinates. 

hamiltonian cycle — in a graph: a simple cycle that includes every vertex. 

hamiltonian path — in a graph: a simple path that includes every vertex. 

historical shortest path — in a dynamic graph: a path that has been a shortest path 
at some point during a sequence of updates of the graph, and such that none of its 
edges has been updated since then. 

hv-drawing: an upward orthogonal straight-line drawing of a binary tree such that the 
drawings of the subtrees of each node are separated by a horizontal or vertical line. 

Ihara zeta function: the product over all prime walks P modulo cyclic permutation 
of (1 — ylensth of alae where wu is an indeterminate. 

imbedded (di)graph drawing problem: a planar (di)graph with a prespecified 
topological imbedding (i.e., set of faces) that must be preserved in the drawing. 

immediate dominator — of a vertex w in a flow graph: the unique dominating vertex 
v such that every other dominator of w dominates v. 

independent set — in a graph: a set of vertices that are pairwise non-adjacent. 

interlacing segments — of a cycle C: two segments S,T such that either |V(S) M 
V(T) A V(C)| = 3, or there are four distinct vertices u,v,w,2, which occur in that 
cyclic order along C’, such that u,w € S and v,x € T. 

internal vertex — of path: a vertex that is not an endpoint of the path. 

in-tree: a tree representing paths of a graph that lead to a given vertex. 
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irreducible Markov chain: a Markov chain whose transition graph remains strongly 
connected after all null-probable edges are deleted. 

isoperimetric constant: the minimum ratio, over all subsets containing no more than 
half the vertices of a finite graph, of the number of edges in the boundary to the 
number of vertices in the set. 

Kazhdan constant: the best possible lower bound, given a finite group G and a subset 
T of G, for the distance a unit vector in a nontrivial irreducible unitary representation 
space of G is moved by some element of I. 

layer of vertices — in a rooted graph or digraph: a vertex subset comprising all the 
vertices at a given distance from the root. 

layered drawing: a drawing of a rooted graph such that vertices at the same distance 
from the root lie on the same horizontal line. 

left of a vertex y — in an ordered tree: a vertex x such that some common ancestor 
of x and y has children c and d, with c preceding d as siblings. 

length function — on a graph: an assignment of numerical lengths to the edges, usually 
nonnegative numbers. 

level set for a fuzzy subset ys of a set S and t € [0,1]: the set up’ = {x € S | u(x) > th, 
which is also called the t-level set of pu. 

locally historical path z if every proper subpath of 7 is a historical path. 

locally shortest path 7a if every proper subpath of 7 is a shortest path. 

matching, perfect matching — in a graph: a spanning subgraph such that every 
vertex has degree exactly 1. 

matching — in a graph: a set of edges that share no common endpoints. 

mixed graph: a graph in which directed and undirected edges both occur. 

monadic second-order logic (MSOL): a type of logic in which each variable may 
represent an individual element (vertex or edge) or a set of elements (vertex set or 
edge set); see Definition D10 of §10.4. 

__, MSOL/’ — for a graph G = (V, E): the subset of MSOL expressions restricted to 
variables v; with domain V, e; with domain E, and V; with domain 2”. 
multipartite number: the maximum k for which every complete k-partite graph has 
a 3D visibility representation. 

near-triangulation: a graph with a plane embedding having all finite, interior faces 
3-sided. 

nonseparable or a block: a fuzzy graph that has no cutvertices. A block may have 
bridges, but this is not the case in ordinary graphs. 

open ear decomposition — of an undirected graph: a partition of the edges into a 
simple cycle Pp and simple paths P,,..., P, such that for each i > 0, P; is joined to 
Pp and previous paths only at its (2 distinct) ends. 

ordered tree: a rooted tree in which the children of each vertex are linearly ordered. 

orientation — of a graph: assignment of a unique direction to each undirected edge. 

orthogonal drawing: a drawing in which each edge is a chain of horizontal and vertical 
segments. 

orthogonal representation: a representation of orthogonal drawing in terms of bends 
along each edge and angles around each vertex. 

out-tree: a tree representing paths of a graph that originate from a given vertex. 

partial fuzzy subgraph: a fuzzy graph (v,T) is a partial fuzzy subgraph of a fuzzy 
graph (wy, p) if vy C w and tT C p. 

path: a sequence of distinct vertices 20, 21,...,2n (except possibly xq and 2,,) with 
the edges (2;-1,2;) having positive weight. 

uv-path: a path starting at vertex u and ending at vertex v. 
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planar drawing: a drawing in which no two edges cross. 

polyline drawing: a drawing in which each edge is a polygonal chain. 

postorder — of an ordered tree: the finish-time order of a depth first search of the tree 
itself. 

potentially uniform path — of a graph G: a path such that every proper subpath is 
a historical shortest path. 

predicate calculus: a type of logic in which predicates have arguments and expressions 
are built using various operators (=, A, V), and in which the quantifiers (V, 5). 

preorder — of an ordered tree: the dikéovery order of a depth first search of the tree 
itself. 

prime closed walk: a closed walk that is backtrackless, tailless, and not expressable 
as a repeated concatenation of a closed walk with itself. 

proximity drawing: a drawing of a graph based on a geometric proximity relation. 

Ramanujan graph: a d-regular graph such that every non-trivial eigenvalue \ satisfies 
|A| < 2/d—1. 

recursively constructed graph class: defined by a set (usually finite) of primitive 
or base graphs, in addition to one or more operations that compose larger graphs 
from smaller subgraphs; each operation involves either fusing specific vertices from 
each subgraph or adding new edges between specific vertices from each subgraph. 

rectangle-visibility graph (or RVG): a graph whose vertices can be represented 
by axis-parallel rectangles in the plane, with two vertices adjacent if and only if 
there is an unobstructed, non-degenerate, horizontal or vertical band joining the 
corresponding rectangles. 

reducible flow graph: a flow graph that can be transformed into its root vertex r by 
a sequence of reduction operations; that is, if e is the only edge entering a vertex 
w #r, then contract edge e (and its other endpoint) into the vertex v. 

related vertices — in a rooted tree: two vertices such that one is an ancestor of the 
other. 

restricted multi-level partition: a partition that consists of a collection of restricted 
partitions satisfying the following: 


(1) The clusters at level 0 (known as basic clusters) contain one vertex each. 
(2) The clusters at level € > 1 form a restricted partition with respect to the tree 
obtained after shrinking all the clusters at level ¢— 1. 
(3) There is exactly one vertex cluster at the topmost level. 
restricted partition — of a tree T: a partition of its vertex set V into clusters of 
external degree < 3 and cardinality < 2 such that: 
(1) Each cluster of external degree 3 has cardinality 1. 
(2) Each cluster of external degree < 3 has cardinality at most 2. 
(3) No two adjacent clusters can be combined and still satisfy the above. 
scanning an edge: the work done by a graph searching algorithm when it traverses 
the edge. 
searching a graph: a methodical (linear-time) exploration of all the vertices and edges 
of a graph. 
segment — of a cycle C in a biconnected graph G: either (i) a chord of C (i.e., an edge 
not in C that joins two vertices of C’); or (ii) a connected component of the graph 
G —V(C), plus all the edges of G joining that component to C. 
segment visibility graph: a graph whose vertices can be represented by disjoint line 
segments in the plane and edges by straight-line visibility between corresponding 
segments. 
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semidominator: a useful intermediate concept in computing dominators, defined in 
terms of a depth-first search tree. 

separation pair: two vertices in a biconnected graph whose removal disconnects the 
graph. 

series-parallel graph — a recursively defined graph: see Definition D5 of §10.4. 

set-merging problem: the problem of maintaining a partition of a given universe 
subject to a sequence of union and find operations. 

shortest-path tree — in a rooted graph: a tree in which the path from the root r to 
any vertex v is a shortest ru-path. 

sink — in a directed graph: a vertex with outdegree 0. 

source ~— in a directed graph: a vertex with indegree 0. 

spanning fuzzy subgraph: a partial fuzzy subgraph (v,r) of a fuzzy graph (w, p) 
spans (1, p) if w= v. 

sparse graph: a graph with at most O(|V|) edges. 

sparsification technique: a technique for speeding up dynamic graph algorithms, 
which when applicable, transforms a time bound of T(|V], |E|) into O(T(|V], |V])). 

spectral gap: the distance between the degree and second-largest eigenvalue of a finite 
regular graph. 

start vertex: the distinguished vertex of a flow graph. 

straight-line drawing: a drawing in which each edge is a straight-line segment. 

strength — of a path: the weight of the weakest edge in the path. 

strength — of connectedness between vertices x,y: p™(x, y). 

strong component (SC) — of a directed graph: a maximal subgraph in which any 
two vertices are reachable from each other. 

strong component graph (SC graph) — for a directed graph: the result of contract- 
ing every strong component to a vertex; also called the condensation graph. 

strong visibility graph: a graph whose vertices can be represented by horizontal 
bars in the plane, with two vertices adjacent if and only if there is an unobstructed, 
vertical band, possibly with width zero, joining the corresponding bars. 

strongly connected digraph: a digraph in which every vertex can reach every other 
vertex by a directed path. 

support: for yz a fuzzy subset of a set S, the support of yw is the set Supp(w) = {x € 
S| w(x) > Of. 

t-bar representation — of a graph (resp., digraph): a bar-visibility representation in 
which each vertex corresponds to at most t bars, such that two vertices are adjacent 
if and only if there is an unobstructed, non-degenerate vertical band joining two bars 
corresponding to the two vertices (resp., such that there is an arc (u,v) if and only 
if there is such a band with the visibility of u to v in the positive y direction). 

tail: an edge traversed in both the first and last step of a closed walk. 

top tree — in dynamic graph algorithms: a tree that describes a hierarchical partition 
of the edges of another tree, well suited to maintaining path information. 

topological numbering (topological order, topological sort) — of an acyclic di- 
rected graph: assignment of an integer to each vertex so that each edge is directed 
from a lower number to a higher number. 

topology tree: in dynamic graph algorithms, a hierarchical representation of a tree T 
into clusters. 

tournament: a directed simple graph such that each pair of vertices is joined by exactly 
one edge. 

traversable mixed graph: a mixed graph in which every vertex can reach every other 
by a path with all its directed edges pointing in the forward direction. 
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tree: a connected graph with no cycles and sometimes with a designated root used to 
describe recursive constructions. 

__, drawable as a minimum spanning tree: a tree T such that there exists a set 
P of points (especially in R2 or R3) such that the minimum spanning tree of P 
(using Euclidean metric distances) is isomorphic to T. 

__, rooted, recursively defined: the graph whose only vertex is the root r (and 
no edges); or the result of joining the roots of two disjoint trees with a new edge. 

tree edge: edge of a spanning tree in a graph. 

treewidth — of a graph G: the minimum width taken over all tree-decompositions of 
G; measures how closely the graph resembles a tree. 

treewidth-k graph: a graph whose treewidth is no greater than k. 

triconnected graph: an undirected connected graph that remains connected whenever 
any two or fewer vertices are deleted. 

uniform path — of a graph G: a path such that every proper subpath is a shortest 
path. 

unit bar-visibility graph: a BVG representable with all bars having equal length. 

unit rectangle-visibility graph: an RVG represented by unit squares. 

universal class — of objects: a class for which every graph has a 3D visibility repre- 
sentation with the objects as vertices. 

update — on a graph: an operation that inserts or deletes edges or vertices of the graph 
or changes attributes associated with edges or vertices, such as cost or color. 

upward drawing: a drawing of a digraph such that each edge is monotonically non- 
decreasing in the vertical direction. 

upward planar digraph: a digraph that admits an upward planar drawing. 

vertex cover — of a graph: a set of vertices such that every edge is incident on at least 
one vertex in the set. 

vertex visibility graph: a graph whose vertices can be represented by the vertices of 
a simple polygon in the plane and edges by straight-line visibility within the polygon 
between polygon vertices. 

visibility drawing: a drawing of a graph based on a geometric visibility relation; e.g., 
the vertices might be drawn as horizontal segments, and the edges associated with 
vertically visible segments. 

Voronoi diagram: the dual graph of a Delaunay triangulation. 

weak visibility graph: a graph whose vertices can be represented by horizontal bars in 
the plane, so that if two vertices are adjacent then there is an unobstructed vertical 
band, possibly with width zero, joining the corresponding bars. 

well-behaved time bound T(n) if T(n/2) < cT(n) for some c < 1. 

zig-zag product: a certain graph product formed by two regular graphs X and Y 
where the degree of X equals the order of Y and the edges of X are labeled at each 
vertex by vertices of Y. 
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INTRODUCTION 


The Maximum Flow Problem is one of the basic problems in combinatorial opti- 
mization. It models a large variety of problems in a diverse set of application areas 
including data flowing through a communications network, power flowing through an 
electrical network, liquids flowing through pipes and parts flowing through a factory. 
It has also served as a prototypical problem in algorithm design, and many useful and 
powerful ideas were first introduced in the context of maximum flows. This section 
will describe maximum flows and some generalizations. The book by Ahuja, Magnanti 
and Orlin [AhMaOr93] is an excellent reference and includes a large number of appli- 
cations. Other texts and surveys with significant coverage of maximum flows include 
[CoCuPuSc98], [Ev79], [La76], [PaSt82], [Ta83], and [GoTaTa90]. 


11.1.1 The Basic Maximum Flow Problem 


Informally, in a maximum flow problem, we wish to send as much stuff as possible 
from one place in a network to another, while limiting the amount of stuff sent through 
an arc by the capacity of that arc. 


DEFINITIONS 


D1: Ans-t (flow) network G = (V, E,s,t, cap) is a directed graph with vertex-set V 
and arc-set E, two distinguished vertices, a source s and a sink t, and a nonnegative 
capacity function cap: E > N. We adopt the convention, without loss of generality, 
that if arc (v,w) € E then the reverse arc (w,v) ¢ E. (See Subsection 11.1.4.) 
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D2: A (feasible) flow is a function f : E > R which obeys three types of constraints: 


capacity constraints: f(v,w) < cap(v, w), for each arc (v,w) € E. 
conservation constraints: 
Uw )ee f(w,v) = Viwwyer f(v,w) for each vertex vu € V — {s,¢}. 

nonnegativity constraints: f(v,w) > 0, for each arc (v, w) € E. 
D3: The value of a flow f, denoted val(f) (or |f|), is the total flow into the sink, 
Le., 

val(f)= D7 f(v,t) 
(v,t)hEE 

D4: Inthe maximum flow problem, we are given a flow network G = (V, E, s,t, cap) 
and wish to find a flow f of maximum value. 


EXAMPLE 


E1: An example of a flow network appears in Figure 11.1.1. A maximum flow for the 
network appears in Figure 11.1.2. The numbers on the arcs are capacities; the flows 
appear in parentheses. It is straightforward to verify that the three properties of a flow 
are satisfied. 


Figure 11.1.2: A maximum flow of value 10. 


FACTS 


F1: The flow conservation properties imply that the value of a flow is also equal to 
total flow out of the source, i.e., 


val(f) = Se f(v,t) = S- f(s,v) 
(v,t)hEE (s,v)EE 


F2: Although a flow is defined as a real-valued function, an integer-valued maximum 
flow always exists (since the capacity function is integer-valued). 
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11.1.2 Minimum Cuts and Duality 


An important and dual concept related to maximum flows is that of minimum cuts. 


Cuts in a Network 


An s-t cut combines the concepts of partition-cut (86.4) and an (s|t)-disconnecting set 
of edges (§4.1). 


DEFINITIONS 


D5: Let G = (V, E,s,t, cap) be an s-t network, and let S and T form a partition of V 
such that source s € S and sink t € T. Then the set of all arcs that have one endpoint 
in set S' and the other endpoint in set T is called an s-t cut of network G and is denoted 
(S,T). An arc (v,w) is a forward arc of the cut if v € S and w € T, and (v,w) isa 
backward arc ifue TandweS. 


D6: The capacity of an s-t cut (S,T), denoted cap(S,T), is the sum of the capacities 
of the forward arcs of the cut in the forward direction, i.e., 


cap(S,T) = Ss cap(v, w) 


(v,w)E EVES weT 
D7: A minimum s-t cut in a flow network G is a cut of minimum value, that is, 


min{cap(S,T) : (S,T) is an s-t cut}. 


EXAMPLES 


E2: The s-t cut ({s,2,v}, {w,t}) = {(x,w), (v,t)} appears in Figure 11.1.3. The 
capacity of the cut cap(S,T) = cap(x,w) + cap(v,t) = 13. Notice that (w,v) is a 
backward arc and hence its capacity is not included in the capacity of the cut. 


Figure 11.1.3: The s-t cut ({s, 2, v}, {w,t}) = {(x, w), (v,t)} has capacity 13. 


E3: A minimum cut of capacity 10 appears in Figure 11.1.4. 
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Figure 11.1.4: A minimum cut of capacity 10. 


Weak Duality 


DEFINITION 


D8: Given a flow f and an s-t cut (S,T), the flow across cut (S,T), denoted f(S,T), 
is the sum of the flows on the forward arcs minus the sum of the flows on the backward 
arcs, 1.€., 

f(S,T)= S- f(v, w) — S- f(v, w). 


(v,w)EEwES wet (v,w)EEvET wes 


FACTS 


F3: Flow conservation implies that for a given flow f and any cut (S,T), val(f) = 
f(S,T). 


F4: (Weak Duality) Let f be any flow in an s-t network G, and let (S,T) be any s-t 
cut. Then 
val(f) < cap(S,T) 


F5: Let f be a flow in an s-t network G and (S,T) an s-t cut, and suppose that 
val(f) = cap(S,T). Then flow f is a maximum flow in network G, and (S,T) a minimum 
s-t cut. (This is an immediate consequence of weak duality [Fact F4].) 


EXAMPLE 


E4: The flow f (in parentheses) and s-t cut (S,T) shown in Figure 11.1.5 illustrate 
Fact F3. In particular, val(f) = 6, and the flow across the cut, f(S,T), equals f(x, w)+ 
f(v,t) — f(w,v) =2+5—-1=6. 


Figure 11.1.5: val(f) = f(S,T) = 


1382 Chapter 11. Networks and Flows 


11.1.3. Max-Flow Min-Cut Theorem 


The relationship between the maximum-flow problem and its dual, the minimum- 
cut problem, is an example of strong maz-min duality that occurs between certain opti- 
mization problems and their dual problems. Two other instances are Konig’s theorem 
(§11.3), which states that the size of a minimum vertex cover in a bipartite graph equals 
the size of a maximum matching, and Menger’s theorem (§4.1), which relates the local 
connectivity between two vertices of any graph and the number of internally disjoint 
paths between them. 


The Residual Network and Flow-Augmenting Paths 


Algorithms that iteratively increase the flow in a network via flow-augmenting paths 
often use an associated digraph, called a residual network, for finding these paths more 
easily. 


DEFINITIONS 


D9: Let f be the current flow in a network G = (V, E,s,t,cap). An arc (v,w) € EF is 
increasable if f(v,w) < cap(v, w) and is reducible if f(v,w) > 0. 


NOTATION: Let J denote the set of all increasable arcs, and let R be the set of all 
reducible arcs. (Of course, in general, 19 R 4 0.) 


D10: Given a flow f in a network G = (V,E,s,t,cap), the residual network G; = 
(V, Ey, s,t, rp) has vertex-set V, and the arc-set Ey is constructed from network G as 
follows: for each arc (v,w) € E, if arc (v,w) € I, then create an arc (v,w) in Gy, and 
label it with a residual capacity rs(v,w) = cap(v, w) — f(v, w); if arc (v,w) € R, then 
create an arc (w,v) in Gy, and label it re(w,v) = f(v,w). 


D11: Given a flow f in a network G = (V,E,s,t,cap), a flow-augmenting path 
P for network G is a directed s-t path in the residual network Gy. The capacity of 
flow-augmenting path P, denoted Ap, is given by Ap = a rp(v,w). 

v,w)cEy¢ 


REMARK 


R1: It follows from the definitions that the capacity Ap is always positive. 


FACTS 


F6: [Flow Augmentation] Let f be a flow in a network G = (V, E,s,t, cap), and let P 
be a flow-augmenting path with capacity Ap. Let fp be defined as follows: 
f(v,w)+Ap_ if (v,w)e E(P)NI 
fre(v,w) = 4 f(v,w)— Ap if (w,v) € E(P) and (v,w) € R 
f(v,w) otherwise 
Then fp is a feasible flow in network G and val(fp) = val(f) + Ap. 


F7: [Characterization of Maximum Flow] Let f be flow in a network G. Then f is a 
maximum flow if and only if there does not exist an f-augmenting path in G. 
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F8: Max-Flow Min-Cut [FuDa55, ElFeSh56, FoFu56] For a given network, the value 
of a maximum flow is equal to the capacity of a minimum cut. 


EXAMPLES 


E5: The current flow f in the network G shown at the top in Figure 11.1.6 has value 
9. The corresponding residual network G'y is shown at the bottom. 


Figure 11.1.6: A network G with flow f and its residual network G',. 


There are exactly two directed paths from s to t in the residual network, each corre- 
sponding to a different flow-augmenting path for increasing the flow. One of them is the 
directed path s + v > w — t in the network G;. The arc (v, w) in path P corresponds 
to the reducible arc (w,v) in network G. Notice that Ap = 1. 


E6: If the flow f given in Figure 11.1.6 above is augmented by the flow-augmenting 
path P in Example E5, then the resulting flow is as shown in Figure 11.1.7. Observe 
that the one reducible arc, (w,v), of the flow-augmenting path P reduces the flow on 
that arc by 1. 


Figure 11.1.7: Increasing the flow in Example E5 by 1. 


A simple application of Fact F5 shows that fp is a maximum flow. Also straightforward 
to show is the non-existence of a flow-augmenting path corresponding to fp. 
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11.1.4 Algorithms for Maximum Flow 


There are three popular approaches to compute a maximum flow in polynomial time. 
The first uses flow-augmenting paths, the second uses a push-relabel method, and the 
third uses linear programming. We discuss the first two methods briefly. 


Ford—Fulkerson Algorithm 


The first published maximum-flow algorithm, due to Ford and Fulkerson [FoFu62], is 
essentially a greedy method: we iteratively push flow along flow-augmenting paths from 
source to sink. 


Algorithm 11.1.1: Ford—Fulkerson Algorithm 


Input: a flow network G = (V, E, s,t, cap) 
Output: a maximum flow f 
Initialize f(v,w) = 0 for all (v,w) € E. 
Calculate residual network Gp. 
While an augmenting path in Gf exists 
Let P be an augmenting path in Gy with capacity Ap. 
Obtain increased flow fp using flow-augmenting path P (Fact F6). 
fate 
Update residual network G's 


COMPUTATIONAL NOTE: The efficiency of the Ford—Fulkerson algorithm depends on 
which augmenting path is chosen, and on the data structures used to facilitate the 
computation. If an arbitrary augmenting path is chosen, the algorithm may not run in 
polynomial time [Za72]. However, many natural choices of a path lead to a polynomial 
time algorithm. The first such algorithm, due to Edmonds and Karp [EdKa72], and 
Dinic [Di70], always chooses the shortest augmenting path, where the length of a path is 
defined to be the number of arcs on the path. This algorithm runs in O(|E||V]) iterations 
of the main loop. Each iteration requires a breadth-first search and some updates of 
flow variables, which can be done in O(|E|) time, and hence the algorithm runs in 
O(|E|?|V]|) time. Many further improvements are possible with data structures such as 
dynamic trees [S]Ta83] or by augmenting flow on several shortest paths simultaneously 
[Ka74]. (These are known as blocking flows.) The current fastest running time using 
this approach is due to Goldberg and Rao [GoRa98]. 


Preflow-Push Algorithms 


An alternative approach to computing a maximum flow, called a preflow-push algorithm, 
was introduced by Goldberg [Go87] and Goldberg and Tarjan [GoTa88]. It uses a push- 
relabel strategy, which pushes flow over individual arcs, rather than paths. To do so, 
it allows flow to “accumulate” at some vertices, creating an excess at those vertices. 
The push operation selects one of these active vertices and tries to remove its excess by 
pushing flow to neighbors that are“closer” to the sink. The relabel operation maintains 
distance labels that help keep track of these neighbors. 
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DEFINITIONS 


D12: A preftow is a relaxed version of a flow, a function f : E + R* which obeys 
three types of constraints: 
capacity constraints: f(v,w) < cap(v, w), for each arc (v,w) € E. 
relaxed conservation constraints: 
VMwwert(W,v) — Vw wen f(v,w) 2 0 for each vertex v € V — {s, t}. 


nonnegativity constraints: f(v,w) > 0, for each arc (v,w) € E. 


D13: Let f be a preflow in an s-t network G = (V, FE, s,t, cap). The excess at vertex 
v, denoted e(v), is given by e(v) = Viweyen f(W,¥) — Lawyer f(v,w). A vertex v 
with e(v) > 0 is called an active vertex. 


TERMINOLOGY: Given a preflow f, the sets of increasable arcs, I, and reducible arcs, 
R, are defined as in the case when f is a feasible flow (Definition D9). Similarly, the 
residual network G;, = (V, Ey, s,t,ry) is defined in the same way as before (Definition 
D10). 


D14: Let f bea preflow in an s-t network G = (V, E,s,t,cap). A distance function 
h:V — N satisfies: 

h(s) = |V|, h(t) =0, and h(v) < h(w) +1 
for each arc (v, w) € Ey in the residual graph Gf. 


D15: For a given preflow f and distance function h, an arc (v,w) in the residual 
network Gy is admissible if h(v) = h(w) +1. 


Algorithm 11.1.2: Preflow-Push Algorithm 


Input: a flow network G = (V, E, s,t, cap) 
Output: a maximum flow f 
{Initialization} 
f(v,w) := 0 for all (v,w) € E. 
h(v) := 0 for allu EV 
h(s) := |V] 
f(s,w) := cap(s,w) for all arcs (s,w) € E. 
Compute residual network G'y and excesses e. 
While there are active vertices 
Select an active vertex v. 
If Gy contains an admissible arc (v, w) 
{Push flow on arc (v, w).} 
A := min{e(v), r¢(v, w)} 
e(v) := e(v) —A 
e(w) := e(w) +A 
If arc (v,w) ET 
f(v,w) := fvu,w) +A 
Else 
f(v,w) := f(v,w)-A 
Else {Relabel v.} 
h(v) := min{h(w) : (v,w) € Ey and r¢(v, w) > OF 
Update residual network G'y. 
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COMPUTATIONAL NOTE: The preflow-push algorithm terminates in O(|V|?|E|) time 
using simple data structures. More careful selection of operations and careful use of data 
structures leads to algorithms with running times of O(|V|°) or O(|V||E| log(|V|?/|E])) 
[GoTa88]. The fastest preflow-push algorithm is slightly faster than these and is due to 
King, Rao and Tarjan [KiRaTa94]. 


COMPUTATIONAL NOTE: In practice a good implementation of a preflow-push algo- 
rithm seems to be faster than a good implementation of an augmenting path algorithm. 
Two heuristics are essential ingredients in implementing a push relabel algorithm well. 
First, a backwards breadth-first search of the residual graph is performed periodically, 
in order to update distance labels. Second, the gap heuristic, which quickly identifies 
vertices that must be on the sink side of the minimum s-t cut, is employed [ChGo97]. 


11.1.5 Variants and Extensions of Maximum Flow 


We briefly mention some variations and extensions of the basic maximum-flow prob- 
lem. For more extensive coverage, see, e.g., [AhMaOr93, EvMi92). 


FACTS 


F9: The convention that if arc (v,w) € E then the reverse arc (w,v) ¢ E is without 
loss of generality. Any maximum flow algorithm can easily be extended to handle this 
case. Alternatively, arc (v,w) can be converted to two arcs (v,x) and (x, w) each with 
capacity cap(v,w), and arc (w,v) can be converted to two arcs (w,y) and (y,v) each 
with capacity cap(w, v). 


F10: [multiple-source multiple-sink] Suppose that we have a flow network G with mul- 
tiple sources {s1,..., 6} and multiple sinks, {t),...,t¢}. We can still find a maximum 
flow in this network by the following transformation. We create a new supersource s 
and supersink ¢, and add an arc (s,s;) with cap(s,s;) = co for each source s;, and an 
are (t;,t) with cap(t;,t) = co for each sink t;. A maximum flow in this network is easily 
interpreted as a maximum flow for the multiple-source multiple-sink problem. 


F11: Maximum flow can be used to find a maximum matching in a bipartite graph. 
(Matchings are discussed in §11.3.) 


F12: If each arc has a cost, then we obtain the minimum-cost flow problem (see §11.2). 


F13: [flow on undirected edges] The network G can contain undirected as well as 
directed edges. An undirected edge (v,w) with capacity cap(v, w) is understood to be 
an edge that can carry up to cap(v, w) units of flow in either direction. In any flow, it 
is only necessary that flow be carried in one direction or the other, as flows in opposing 
directions cancel each other out. For example, an undirected edge with flow f(v, w) = 3 
and f(w,v) = 2 is equivalent to an edge with f(v,w) = 1 and no flow in the (w,v) 
direction. We can therefore convert an undirected edge to two oppositely directed arcs, 
each with capacity cap(v, w). 


F14: [lower bounds on flow] The network G can also contain lower bounds on the flow 
over an arc. It is still possible to find a maximum flow in such a graph, providing that 
one exists (see, e.g., [AhMaOr93, EvMi92}). 
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Multicommodity Flow 


Perhaps the most important extension of maximum flow is the extension to the case of 
multiple commodities. 


DEFINITIONS 


D16: A commodity i is a triple (s;,t;,d;) where s is a source, t is a sink, and d is a 
demand, or amount of flow to be routed. 


D17: A multicommodity flow network G = (V,E, K, cap) is a directed graph with 
vertex set V and arc-set FE’, commodity set K, and a nonnegative capacity function 
cap: FE + N. We adopt the convention that if arc (v,w) € E then the reverse arc 
(w,v) ¢ E. The commodities are indexed by the integers 1,2,...,k. 


D18: A multicommodity flow in a multicommodity flow network G = (V, E, K, cap) 
is a set of k =|K| functions f; : E > R* satisfying the following conditions: 


joint capacity constraints: ae filv, w) < cap(v,w), for each arc (v, w) € E. 

conservation constraints: 

VUww)ee filw,v) = Vw wyer fi(v, w) for each v € V— {s;,t)} andi=1,...,k, and 
V(w,ter Jil, ts) — Vee wen Siti, w) = di for i =1,...,k. 


nonnegativity constraints: f;(v,w) > 0, for each arc (v,w) € FE andi=1,...,k. 


Variants of Multicommodity Flow Problems 


DEFINITIONS 


D19: In the feasible multicommodity flow problem, we are given a multicommod- 
ity flow network G, and wish to know if a multicommodity flow exists. We call such a 
flow a feasible multicommodity flow. 


D20: In the concurrent flow problem, we are given a multicommodity flow net- 
work G, and we wish to compute the maximum value z for which there is a feasible 
multicommodity flow in the network with all demands multiplied by z. 


D21: Inthe mazimum multicommodity flow problem, we are given a multicom- 
modity flow network, except that for each commodity, we are not given a demand. 
We wish to find, for each commodity i, a flow f; of value (demand) val(f;) such that 
yy val(f;) is maximized. 


COMPUTATIONAL NOTE: If we do not require that the flows be integral (even though 
capacities and demands are integral), then all the above multicommodity flow problems 
can be solved in polynomial time via linear programming. More efficient combinatorial 
algorithms that compute approximately optimal solutions also exist [LeMaP1StTaTr95, 
Yo95, and Gako98]. 


COMPUTATIONAL NOTE: If we require that all flows be integral, then all the above 
problems are NP-hard. The degree to which we can approximate them varies from 
problem to problem. (See [Va01] for a survey.) Note that integral multicommodity flow 
generalizes the disjoint-paths problem. 
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D22: In an unsplittable flow problem, we have the additional restriction that each 
commodity must be routed on one path. All variants of this problem are NP-hard, but 


constant factor approximation algorithms exist for single-source multiple-sink variants 
[K196, KoSt97, DiGaGo98]. 
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INTRODUCTION 

Minimum cost flows are a powerful and useful network flow model that is distin- 
guished by supply nodes, demand nodes, and linear flow costs on the edges of a di- 
rected network. They are used to model complex problems occurring in transportation, 
transshipment, manufacturing, telecommunications, graph drawing, human resources, 
statistics, numerical algebra, physics, and many other engineering disciplines. Minimum 
cost flows generalize many other network problems. They lie on the tractable side of 
a boundary between computable and intractable problems: while there exist efficient 
algorithms to find integer minimum cost flows, most generalizations of integer minimum 
cost flow problems are NP-hard. 

The book Network Flows by Ahuja, Magnanti, and Orlin provides thorough coverage 
of minimum cost flows, applications, and related topics [AhMaOr93]. Other recent texts 
and surveys with significant coverage of minimum cost flows include [CoCuPuSc98, 
GoTaTa90, IwMcSh00, Sc03}. 


11.2.1 The Basic Model and Definitions 


DEFINITIONS 


D1: A (standard) flow network G = (V, A, cap,c, b) is a directed graph with vertex- 
set V, arc-set A, a nonnegative capacity function cap: A + N, a linear cost function 
c: A Z, and an integral supply vector b: V > Z that satisfies 7 ,<y b(w) = 0. 
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D2: Ans-t flow network (or single-source single-sink network) is a flow network 
G = (V, A, cap,c,b) that contains two distinguished vertices s and t such that b(v) = 0 
for all v € V — {s,t} and b(s) = —b(t) > 0. 


D3: An extended flow network G’ = (V’,A’,cap’) of G = (V,A, cap, c, b) is an s-t 
network with vertex-set V’ = VU{s,t}, arce-set A’ = AU{(s, v)|b(v) > O}U{(w, t)|b(w) < 
0} and capacity function cap’ defined by 


cap(v,w), if (v,w) eA 


cap'(v,w) = § b(v), ifv=s 
—b(v), ifw=t 


D4: A transshipment network is a flow network in which all arcs have infinite 
capacity. 


D5: A (standard) flow (also called a (standard) feasible flow) is a function 
f:A-— Z that satisfies 


capacity constraints: f(v,w) < cap(v,w) for all (v,w) € A, 
nonnegativity constraints: f(v,w) > 0 for all (v,w) € A, 


flow conservation constraints: )~|f(v,w) — f(w,v)] = b(v) for each v € V. 


D6: A flow f in an s-t flow network G = (V, A, cap,c,b) is called an s-t flow and is 
said to have volume 0b(s). 


D7: A minimum cost flow is a flow f with minimum cf value among all flows. 


D8: A circulation is a flow for the supply vector b= 0. A minimum cost circu- 
lation is a circulation f with minimum c7? f value among all circulations. 


NOTATION: Sometimes we use the subscript notation fuy, Cuv, C@Puy or by instead of 
f(u,v), c(u,v), cap(u,v), or b(v), respectively. 


REMARK 


R1: In the definitions above, we can extend the definition of the capacity function cap 
to all ordered pairs in V x V by defining cap(v, w) = 0 for (v, w) ¢ A. This has the effect 
of extending any flow function f to all such ordered pairs as well. This extended view of 
the capacity and flow functions is notationally convenient for interpreting expressions 
like the flow conservation constraints given above and for various other expressions 
appearing later in this section. 


ASSUMPTIONS 
Al: G has no parallel arcs and no oppositely directed pairs of arcs. 


A2: No arc in G has negative cost. 
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EXAMPLE 


E1: An example of a flow network appears on the left in Figure 11.2.1. The supply at 
each node is indicated in brackets. In parentheses at each arc is the arc cost followed 
by the arc capacity. A feasible flow in this network is indicated by the italic numerals 
in the figure on the right. 


(2.3) 
[516 © 3} pie © (-3] 


Figure 11.2.1: An example of a flow network and a feasible flow. 


NOTATION: The following notation is used throughout this section: 
(a) n =|V|andm=|A| (number of vertices and arcs, respectively, in the network). 


(b) C= max \c(e)| (largest arc cost). 

(c) U= max{cap(e)|cap(e) <oo} (largest finite capacity). 
e€ 

(d) B= max |b(v)| (largest magnitude of a supply [or demand]). 
ve 


(e) For an arc subset F C A, c(F’) = SS} c(e). 
ecF 


(f) M = Xele)cap(e). 


(g) S(n,m,C) denotes the time complexity of computing single source shortest paths 
from a fixed vertex in a directed network with n vertices, m arcs, and arc lengths 
(costs) bounded by C. 


FACTS 
F1: The quantity M is an upper bound on the cost of a flow. 


F2: Assumption 1 is without loss of generality and is made for notational convenience 
only. Alternatively, we can remove a parallel or opposite arc (v,w) by introducing a 
new node z and replacing (v,w) with (v,z) and (z,w), where (v, z) has the same cost 
and capacity as (v,w) and (z,w) has 0 cost and infinite capacity. 


F3: Assumption 2 is without loss of generality. If arc (v, w) € A has negative cost, it can 
be removed by saturating the arc: modify functions cap, c and b by setting cap(w, v) = 
cap(v,w), c(w,v) = —c(v,w), b(v) = b(v) — cap(v, w), b(w) = b(w) + cap(v, w). If the 
resulting problem yields a flow with a value of f’(w,v) on arc (w,v), then the value of 
f(v, w) = cap(v, w) — f’(w, v). 
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F4: A minimum cost flow may also be defined in an undirected graph. By replacing 
each undirected arc by two oppositely directed arcs of the same cost and capacity, the 
undirected problem may be solved in the directed graph. If f’ is the flow in the directed 
graph, the flow f in the undirected graph is obtained as follows: for all ordered pairs 
(v,w), 


f(v,w) = max{0, f’(v, w) — f’(w,v)}. 


F5: A (feasible) flow in flow network G can be found by finding a maximum s-t flow 
in the extended flow network G’. (See the multiple-source multiple-sink extension in 
Subsection 11.1.5.) 


F6: The minimum cost flow problem in G is equivalent to the minimum cost circulation 
problem in the network obtained by adding an infinite capacity arc (t, s) to G’ with cost 
—mC. 


F7: The following graph optimization problems are all special cases of minimum cost 
flows. The directed versions are described, but the undirected versions can be treated 
similarly. 


Mazimum flows: for b, = 8 = —b; and b, = 0 for v € V\{s,t}, find a flow that 
maximizes 6. Using 


b=0, c=0, A+ AU {(t,s)}, Gs =-—1, and can, =~, 


a minimum cost circulation in G = (V,A,cap,c,b) is a maximum flow in the original 
network. (See §11.1.) 


Single-source shortest paths: for a given vertex s, find the shortest path using arc 
lengths | from s to every other node. Using 


b,=|V|/—1, by =—1, cap=oo, and c=l, 


a minimum cost flow in this graph describes a shortest path tree by taking all arcs with 
positive flow. (See §10.1.) 


k arc-disjoint s-t paths: given vertices s and t, arc cost function c, and an integer k, 
find k paths from s to t such that no two share an arc. Using 


b, =k=—-h, by =0, ve V\{s,t}, cap=1, and cas given, 


the minimum cost flow yields a solution to this problem by taking all arcs with positive 
flow. (See §4.1, Menger’s theorems.) 


Mazimum-weight bipartite matching: given a bipartite graph G = (V; U Va, A) 
with arc weights w, find a maximum weight subset of arcs such that no two share an 
end point. Using 

by =1, Vue Vy, b =-1, Vue Vo, c=-—w, and cap=1, 


a minimum cost flow yields a maximum weight matching by taking all arcs with positive 
flow. (See §11.3.) 


Minimum cost transshipment: a minimum cost flow in a transshipment network. 
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Residual Networks 


DEFINITIONS 
D9: Given a flow f in a network G, let 
F={eceAlf. < cape} and B= {ee Alf. > O}. 


Note that F and B may intersect. The residual network of G (with respect to) f, 
denoted Gf, is the network that contains for each arc e = (v,w) € F a forward arc 
(v,w) with residual capacity cap. — fe and cost c., and for each arc e = (v,w) € Ba 
backward arc (w,v) with residual capacity f, and cost —c.. Denote the arc-set of G'y 


D10: A set S C Ay, of arcs is augmenting if each arc in S has positive residual 
capacity. An augmenting path is a path whose set of arcs is augmenting. 


D11: A flow f in a network G is augmented by u on arc set S C Af as follows: for each 
forward arc (v,w) € S, set f(v,w) = f(v,w) +u, and for each backward arc (v, w) € S, 


set f(w,v) = f(w,v) — u. 


D12: An arc set S C A; is saturated if f is augmented on S by a quantity equal to 
the minimum capacity of an arc in S. 


FACT 
F8: Let f be a flow in G and f’ be a circulation in Gy. Then f + f’ defined as 
(f+ f')(v, w) = fv,w) + fv, w) — fi(w,r) 


is a flow in G. 


EXAMPLE 


E2: Figure 11.2.2 shows the residual network of the flow appearing in Figure 11.2.1. 


(2,1) 
(2,1) (-3,2) 


(4,3) 


Figure 11.2.2: The residual network of the flow in Figure 11.2.1. 
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11.2.2 Optimality Conditions 


DEFINITION 


D13: Given a labeling 7: V — Z, the reduced cost vector c” is defined as cj, := 
Cow + Ty — Ty 


FACTS 
F9: If f is a minimum cost flow, then Gy has no negative cost cycles. 


F10: If there exists a minimum cost flow, then there exists one such that the set of 
arcs in F') B forms a forest. 


F11: If Gy, has no negative cost cycles, then any 7: V + Z obtained as shortest path 
distance labels from a selected vertex r by applying Dijkstra’s algorithm (see §10.1) 
satisfies cf, > 0 for all (v,w) € Gy. 


F12: For any flow f and labeling 7, we have 
(i) c™(C) = c(C) for any cycle C in G. 
(ii) c"(P) =c(P) +7, — 7 for any s tot path P in G. 
(iii) oow Cowlow = Vow Cowsvw + iy Tubv. 
Thus f minimizes 7, ,, Cowfvw if and only if f minimizes >”, ,, Chu fow- 


F13: (Reduced-cost optimality conditions) If there is a labeling 7 : V > Z such that 
Cow = 9 for all (v,w) € Gy, then f is a minimum cost flow. 


F14: (Complementary slackness) If there is a labeling 7: V > Z such that 
Cow <9 => fow = COPow 
Cow 70 => fow =0 
Chip =0 +0< fow S Capyw 


then f is a minimum cost flow. In this case, f and 7 are complementary. 


REMARKS 


R2: There is an economic interpretation of the reduced-cost optimality conditions: if 
c(v, w) is the cost of shipping one unit from v to w and 7, is the sales price of one unit 
at v, then the reduced cost for (v,w) is the cost associated with buying an item at v, 
shipping it to w, and selling it at w. If this is negative, it is worth doing, and thus the 
arc should be saturated. If it is positive, it is not worth doing, and thus the arc should 
not carry flow. 


TERMINOLOGY: Consistent with the economic interpretation of Fact F13, the vertex 
labels 7 are often called dual prices. 


R3: Fact F14 is a restatement of Fact F13. 
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EXAMPLE 


E3: Figure 11.2.3 highlights a negative cost cycle in the residual network appearing in 
Figure 11.2.2. On the left of Figure 11.2.4 is the new flow obtained by saturating this 
cycle, and on the right is the new residual graph with node labels in italics. The node 
labels and the residual network together show that the flow on the left is a minimum 
cost flow in the network in Figure 11.2.1. 


[5](b 


Figure 11.2.4: A minimum cost flow, residual network, and node labels. 


A Basic Cycle-Canceling Algorithm 


The correctness of the following simple algorithm for finding minimum cost flows rests 
on Facts F9, F11, and F13. 


Algorithm 11.2.1: Basic Cycle Canceling 


Input: a flow network G. 
Output: a minimum cost flow f 


Find a feasible flow f and compute the residual network Gy. 
While Gy contains a negative cost cycle 
Find a negative cost cycle C. 
Augment flow f by saturating the arc-set of cycle C. 
Compute G'. 
Return flow f. 
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FACTS 


F15: If cap is integral, then the initial feasible flow is integral, and each augmenta- 
tion modifies the existing flow by integral amounts. If in addition c is integral, each 
augmentation decreases the cost of the initial flow by at least —1. 


F16: Negative cycles in Gy can be found using the algorithm of Bellman and Ford 
[Be58, Fo56]. (See also the references suggested in the introduction to this section.) 


F17: There is a minimum cost flow in G if and only if G has no negative cost cycles with 
infinite capacity. In this case, the cycle-canceling algorithm finds an integer minimum 
cost flow after at most 1 augmentations. 


COMPUTATIONAL NOTE: 


(a) Zadeh [Za73] demonstrated by a family of examples that the basic cycle-canceling 
algorithm may require O(U) augmentations. 


(b) ) A modified cycle-canceling algorithm that simultaneously augments on a set of 
node-disjoint cycles that decrease the cost of the flow the most finds a minimum 
cost flow after at most O(mlog(nCU)) augmentations [BaTa89]. 


(c) A modified algorithm that augments on a minimum-mean cycle (a cycle C that 
minimizes c(C)/|C|) can be implemented to find a minimum cost flow after at most 
O(mnlog(nC)) augmentations [GoTa8g9]. 


REMARK 


R4: Further discussion of useful ways to select cycles and comparisons with dual 
algorithmic approaches that cancel cuts can be found in [IwMcSh00]. 


11.2.3. The Dual Problem 


Frequently, problems arise that can be solved using minimum cost flow methods, 
but their equivalence to a minimum cost flow problem may not be immediately obvious. 
Certainly, recognizing such problems is important. The linear programming dual of the 
minimum cost flow problem is one such type of problem. 


DEFINITIONS 


D14: The dual of the minimum cost flow problem (DualMCF) defined on flow network 
G is 


Minimize S- CAPywZuw + S- Tydy 
vw v 
Subject to 
Cow + Ty — Tw + 2Z0w 20 for all (v,w)E A 
Zw 20 ~ forall (v,w)E A 


D15: The pair of vectors z and m are dual feasible if they satisfy the constraints of 
DualMCF. 
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FACTS 


F18: Given an arbitrary vector 7 € R”, the pair (7, z™) is dual feasible if z™ € R™ is 
defined by z7,,, = max{0,—c7,, }. 


F19: So CAPywZuw + 24M st + min 2 cole} =0. 


2,0 dasi Measible ee f a flow 


F20: The vectors z and 7 are optimal for DualMCF if and only if there exists a flow 
f with YS capywZvw + 2 Cru fuw = 0. 


U,W U,W 


F21: If flow 3 satisfying the conditions in Fact F20 exists, it can be found by fixing 
f on all arcs (v,w) with c™,, #0 to fow = er a by = by — 2 fow + 2 fuw: 


and setting f on all remaining arcs to the value of a feasible flow for bi in the pene. 
G = (V,A) with A = {(v, w)|c™,, = 0}. 


11.2.4 Algorithms for Minimum Cost Flow 


There are a wide variety of algorithms designed to solve minimum cost flow prob- 
lems. Some algorithms, like network simplex, are based on linear programming. The 
cycle-canceling algorithm of Subsection 11.2.2 can be interpreted as a primal algorithm: 
it starts with a feasible flow, and improves this solution until the complementary dual 
solution (in the sense of Fact F14) is feasible. In this subsection, we discuss two al- 
gorithms: a (primal-dual) augmenting-path algorithm that maintains primal and dual 
solutions that satisfy complementary slackness and works to make both solutions fea- 
sible; and a push-relabel algorithm based on the preflow-push maximum flow algorithm 
in $11.1. In the process we describe two powerful ideas that help algorithms achieve 
polynomial run-time: capacity scaling and cost scaling. 

The first algorithm is simpler to describe and discuss if we assume capacities are 
infinite and G is a complete digraph (each pair of nodes has two oppositely directed arcs 
between them). Thus, we begin by discussing the relation between the transshipment 
problem and the minimum cost flow problem. 


A Transshipment Problem Associated with a Minimum Cost Flow 
Problem 


DEFINITIONS 


D16: Let G = (V,A,cap,c,b) be a flow network with nonnegative costs. The associ- 
ated transshipment network is the network obtained by replacing each arc e = (v, w) 
by three arcs (v, 2), (Ye, %e), and (ye, w) having infinite capacity and with costs c., 0, 
and 0, respectively. Also, the supplies at the new nodes x, and ye are defined to be 
b(ae) = —cape and b(ye) = cape. 


D17: The completion of a transshipment network G = (V, A, cap, c, b) is the complete 
transshipment network obtained by adding all missing arcs and giving each of them 
infinite capacity and arc cost M +1. 
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FACTS 


F22: Let G be a flow network and let G be its associated transshipment network. 
Then G has a feasible flow f if and only if G has a feasible flow f such that for each arc 
e=(v,w), foo. = fe, ia = cape — fe, and fuew = fe. Moreover, the cost of f equals 
the cost of f. 


F23: Let G be a flow network and let G* be the completion of its associated trans- 
shipment network. If the minimum cost transshipment uses an arc with cost M + 1, 
then the original minimum cost flow problem is not feasible. 


F24: Let G be a flow network and let G* be the completion of its associated transship- 
ment network. Any algorithm that finds a minimum cost transshipment in G* yields a 
solution to the minimum cost flow problem for G. 


A Primal-Dual Algorithm 


Edmonds and Karp gave the first polynomial time algorithm to find a minimum cost flow 
[EdKa72]. Their algorithm introduces the idea of capacity scaling. As mentioned above, 
the algorithm presented here is a simplified version designed for the transshipment 
problem on a complete transshipment network. 


DEFINITION 


D18: Given a flow f and a vertex v, the excess at v, denoted excess(v), is the initial 
supply at v minus the net flow out of v, i.e., 


excess(v) = b(v) — S>[f(v, w) — f(w,v)]- 


wEeV 


NOTATION: For a length (or cost) function c, a flow f, and vertex labels 7, dist(s, t) 
denotes the shortest (or least cost) augmenting path from v to w using c™. By Fact 
F12, this is the same as the shortest (or least cost) augmenting path using c. 


Algorithm 11.2.2: Capacity-Scaling Algorithm 


Input: a complete transshipment network G = (V, A, b,c) 

Output: a minimum cost transshipment f and optimal dual prices 7. 
Initialize 7(v) = 0 for all v € V; f(v,w) = 0 for all (v,w) € A; A = allege FI, 
While A > 1 

While max{excess(s)} > A and min{excess(t) } <-A 
Compute dist(s,v) for allu EV. 
Augment flow by A on a least cost s-t augmenting path. 
Update dual price 7(v) = 7(v) + dist(s, v) for all v € V. 
A := A/2. 
Return transshipment f and dual prices 7. 


TERMINOLOGY: A A’-phase is the set of algorithmic steps while A = A’. 
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FACTS 


F25: Throughout Algorithm 11.2.2, f is nonnegative, and after each augmentation 
and label update, f and 7 are complementary. 


F26: When Algorithm 11.2.2 terminates, f is feasible. Thus, since f and 7 are 
complementary, f is minimum cost. 


F27: The number of augmentations in a A-phase is at most n— 1. 


COMPUTATIONAL NOTE: ‘The capacity-scaling algorithm finds a minimum cost trans- 
shipment in O(n(log B)S(n,m, C)) time. Moreover, if G is a standard flow network with 
n vertices and m edges, then the number of vertices in the completion of its associated 
transshipment network (described in Definitions D16 and D17) equals n + m. Thus, 
the number of augmentations necessary to find a minimum cost flow in that complete 
transshipment network (which, by Fact F24, finds one in the original network G) equals 
m+n-—1, which is O(m). Since the complexity of a shortest-path computation is unaf- 
fected by vertices of degree 2 (added in the transformation in Definition D16), it follows 
that the run-time complexity of the capacity-scaling algorithm for solving a minimum 
cost flow problem is O(m(log B)S(n,m, C)). 


COMPUTATIONAL NOTE: Algorithm 11.2.2 can be easily modified to work directly with 
a network with finite capacities. To ensure that the final f and a are complementary 
(so that the final f is optimal), the following condition must be maintained throughout 
the algorithm: c™(v,w) > 0 if residual capacity of (v, w) is at least A. In the modified 
algorithm, the search for augmenting paths in a A-phase is restricted to arcs with resid- 
ual capacity at least A. To maintain the relaxed complementary slackness, immediately 
after A is decreased, each arc with negative reduced cost and residual capacity at least 
A/2 is saturated. 


A Push-Relabel Algorithm 


The push-relabel method was initially introduced for maximum flows (see the preflow- 
push algorithm in §11.1). We describe here a modification for minimum cost flows that 
runs in polynomial time. It relies on cost scaling and is due to Goldberg and Tarjan 
[GoTa8s9]. 


DEFINITIONS 


D19: The pair (f,7) defined on A x V is €-optimal if f is nonnegative and c™(v, w) > 
—e for all arcs with positive residual capacity. 


D20: Arc (v,w) is admissible if it has positive residual capacity and c™(v, w) < 0. 


D21: Given a current flow f in a network G, the operation push(v, w) assigns a flow 
of min{cap(v, w), f(v, w) + excess(v)} to arc(u, v). 


D22: The operation push(v, w) is saturating if the resulting flow on arc (v, w) equals 
cap(v, w); otherwise it is non-saturating. 
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Algorithm 11.2.3: Push-Relabel Algorithm 


Input: a flow network G = (V, A, u, b,c) 
Output: a minimum cost flow f, and optimal dual prices 7. 
Initialize 7(v) = 0 for all ue V; f(v,w) =0 for all (v,w) € As €=C. 
While « > + 
Saturate each arc (v, w) satisfying c™(v, w) < 0. 
While max{excess(v)} > 0, 


Select v with excess(v) > 0. 
If there is w with (v,w) admissible {then push(v, w)} 
f(v,w) := min{cap(v,w), f(v, w) + excess(v)} 
Else  {relabel(v)} 
m(v) := m(v) —€ 
e:=€/2 
Return flow f and dual prices 7. 


TERMINOLOGY: An ¢’-phase is the set of algorithmic steps while € = ¢’. 


FACTS 


F28: If f is nonnegative and z(v) = 0 for all v € V, then (f,7) is C-optimal. If f is a 
flow and (f,7) is eoptimal for « < +, then f is a minimum cost flow. 


F29: After saturation at the start of the e-phase, the pair (f,7) is e-optimal, and it 
remains ¢€-optimal throughout the phase. At the end of each e-phase, f is a flow. Thus 
at the end of the last phase, f is a minimum cost flow. 


F30: The network of admissible arcs is acyclic throughout the algorithm. 
F31: 7a is monotone non-increasing throughout the algorithm. 
F32: There are at most 3n relabels per vertex in an ¢-phase. 


F33: There are O(n) saturating pushes per arc in an e-phase. 


COMPUTATIONAL NOTE: ‘The run-time complexity of the algorithm depends on how v 
is selected and, for a given v, how w is selected. Goldberg and Tarjan [GoTa87], and 
independently Bertsekas and Eckstein [BeEc88], suggest selecting v to be the first vertex 
in a topological order implied by the graph of admissible arcs, and then selecting w by 
cycling through the adjacency list of v. They show that using such protocols, the number 
of non-saturating pushes is O(n*) per e-phase. Using more sophisticated data structures, 
Goldberg and Tarjan [GoTa87] show that an implementation of this algorithm runs 
in O(nmlog(n?/m) log(nC)) time. Coupled with heuristic improvements, such as set 
relabels, price buckets, flow fixing, and push look-ahead, this algorithm has been shown 
to be experimentally competitive [Go97]. 
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Strongly Polynomial Algorithms 


There are numerous algorithms to solve the minimum cost flow problem, and detailed 
descriptions and comparisons already exist in [AhMaOr93, IwMcSh00, Sc03]. Both 
algorithms presented here have run-time complexities that depend on the size of numbers 
in the input data. The first strongly polynomial time algorithm (an algorithm with a 
run-time complexity that does not depend on the size of the numbers in the input data) 
to solve minimum cost flows is due to Tardos [Ta85] and introduces the idea of fixing 
flows on arcs. Most subsequent strongly polynomial time algorithms fix either arc flows 
or vertex labels. The minimum-mean cycle-canceling algorithm of Goldberg and Tarjan 
mentioned in Subsection 11.2.2 can also be made to run in strongly polynomial time 
[GoTa89]. The fastest strongly polynomial algorithm is due to Orlin [Or88], is based on 
capacity scaling and arc-flow fixing, and has run-time complexity O(m log nS(n,m,C)). 


11.2.5 Extensions to Minimum Cost Flow 


Convex Cost Flows 


Separable convex flow costs model some natural phenomena not captured by linear 
costs, for example traffic flows and matrix balancing. 


DEFINITIONS 


D23: Given separable convex functions c, : R > R for all e € A, and a flow network 
G, the minimum convex cost flow is a flow that minimizes }°.. 4 Ce(fe) over all 
feasible flows. 


D24: Convex costs in the discrete model are defined by piecewise linear curves, 
where each cost curve Ce is defined by at most p linear segments. The linear segments 
meet at breakpoints. Convex costs in the continuous model are defined in functional 
form. 


FACT 


F34: In the discrete model, breakpoints of the cost curve can be assumed to be integers. 
This problem can be solved in polynomial time by transforming the convex cost flow 
problem into a minimum cost flow problem. 


COMPUTATIONAL NOTE: In the continuous model, there is an issue of error introduced 
due to approximation of the values of the continuous function. This can be handled ei- 
ther by restricting the search for optimal flows to integral optimal flows, or by obtaining 
a close to optimal solution, within a tolerance dictated by the accuracy of the oracle for 
the continuous function. In this sense, the continuous problem can be solved using tech- 
niques from nonlinear programming, or by extending various algorithms for minimum 
linear cost flows (e.g., capacity-scaling augmenting-path algorithm [Mi86]; minimum- 
mean cycle-canceling algorithm [KaMc97]; cost-scaling algorithm [BePoTs97]). 
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Flows Over Time 


Flows over time model problems where time plays a crucial role, such as transportation 
and telecommunications. They are also called dynamic flows. 


DEFINITIONS 


D25: A flow-over-time network is a flow network G, = (V, A, cap,7, b) where each 
arc (v,w) € A has an associated transit time 7,,. The transit time 7T,,, represents 
the amount of time that elapses between when flow enters arc (v, w) at v and when the 
same flow arrives at w. 


D26: A flow over time x on G, with time horizon T is a collection of Lebesgue- 
measurable functions x, : [0,T) > R where x.(0) is the rate of flow (per time unit) 
entering arc e at time 0. For notational convenience, define x-(@) = 0 for all @ outside 
the interval [0, 7). A flow over time satisfies the following conditions: 


nonnegativity constraints: yw() > 0 for all (v,w) € A and 6 € (0, T]. 
flow conservation: The flow entering arc (w,v) at time @ arrives at v at time 04+ Tw», 


Le., 
g 
| os Lyw(9) — S- ren 0 ru) = 0 


wEeV weVv 
for all € € [0,T), and v € V with b, =0. 


time horizon: There is no flow after time T: x.(0) = 0 for 0 € [T —7.,T); and at 
time 7, no flow should remain in the network, i.e., 


i 
| (Se - lO ree) a 


wEeV weV 


I 


b,, for allue V 


(flow rate) capacity constraints: x-(0) < cape, for all 0 € [0,T) ande€ A. 


D27: In the setting with costs, the cost of a flow over time x is defined as 


rT 
C(x) = aH CeXe(O) dO 


ecA 


D28: A maximum s-t flow over time with time horizon T is a flow over time for 
which b, = 0 for all v € V — {s,t} and 6, is maximum among all such flows over time 
with time horizon T. 


D29: A path flow (or cycle flow) f in a flow network G is a flow on a set of arcs 
S that forms a path (or cycle) such that f(e) =v one € S and f(e) =Oone¢ S. In 
this case, the volume of the path flow (or cycle flow) is v. 


D30: Let f be a standard flow in a network G = (V,A,cap,c,b), and let T be a set 
of path flows and cycle flows in G. The set [ is a flow decomposition of the flow f 
if for each arc (v,w) € A, and for each 7 € T, y(v,w) > 0 only if f(v,w) > 0, and the 
flow f(v,w) = Dyer 1%: w). 
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NOTATION: For a given flow-over-time network G',, the standard flow network obtained 
by ignoring transit times 7 is denoted G. 


NOTATION: For a path or cycle flow 7 € I, denote by v(y) the volume of 7, and denote 
by P(7) the path of arcs corresponding to +. If T is a flow decomposition of flow in the 
standard flow network obtained from G,, let T(y) denote the sum of the transit times 
of arcs in P(¥). 


FACTS 


F35: For any standard flow f in G = (V, A, cap, c, b), there exists a flow decomposition 
I’ of flows on simple paths and cycles, where |I'| < m. 


F36: [FoFu58] Let G be an s-t flow network and let G be the augmented network 
obtained by adding an infinite capacity arc (t,s) with cost c(t,s) = —T. Suppose f is 
a minimum cost circulation in G and T is a flow decomposition of f restricted to G. 
Then r(y) < T for all path flows y € T. Define a flow over time as follows: for each 
path flow y € T, send flow along P(+) by inserting flow into the first arc in P(y) at 
rate v(y) from time 0 until time T — r(y). This flow will arrive at the end of P(y) in 
the interval [r(y),T). Ford and Fulkerson [FoFu58] showed that this flow over time is a 
maximum s-t flow over time with time horizon T. 


F37: Unlike the case with a standard multiple-source multiple-sink, maximum flow 
problem (Subsection 11.1.5), there is no simple transformation of a multiple-source, 
multiple-sink flow-over-time problem into an s-t flow-over-time problem. However, there 
is a combinatorial algorithm that solves this problem in polynomial time [HoTa00]. 


F38: Finding a minimum cost flow-over-time with time horizon T is NP-hard [K]Wo95], 
but, for any fixed € > 0, an integral solution with time horizon T(1 + €) and cost at 
most the cost of the minimum cost flow-over-time with time horizon T can be found in 
polynomial time (where the run time complexity depends linearly on €~?) [FISk03]. 


Flows with Losses and Gains 


Flows with losses and gains model flow problems where leakage or loss may occur. 
Some examples of this are financial transactions, shipping, conversion of raw materials 
into products, and machine loading. These flows with losses and gains are also called 
generalized flows. 


DEFINITIONS 


D31: <A gain network G = (V,A,cap,y,c,s,t) is a network G = (V,<A,cap,c, b) 
with positive-valued gain function y : EF — R* and supply function by = 0 for all 
v €V —{s,t}. The gain factor y(e) > 0 for arc e enforces that for each unit of flow 
that enters the arc, y(e) units exit. If y(e) < 1, the arc e is lossy; if y(e) > 1, the arc e 
is gainy. In general, the term gains is used to denote the gain functions of both lossy 
and gainy arcs. For standard network flows, the gain factor of every arc is one. 
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D32: A flow with gains is a function f : A > R that satisfies: 
capacity constraints: f(v,w) < cap(v,w) for all (v,w) € A, 
nonnegativity constraints: f(v,w) > 0 for all (v,w) € A, 


flow conservation constraints: 


S- [f(v, w) — f(w,v)y(w,v)] =0, for each v € V — {s, t}. 
weVv 


D33: A maximum flow with gains is a flow with gains that maximizes the amount 
of flow reaching ¢ given an unlimited supply at s. 


D34: A minimum cost maximum flow with gains is a maximum flow with gains 


that minimizes S> c(e)g(e). 
ecA 


FACT 


F39: There are combinatorial optimality conditions for flows with losses and gains 
that generalize the optimality conditions for standard flows described in §11.2.2 [On67, 
Tr77]. 


COMPUTATIONAL NOTE: Maximum and minimum cost flows with gains can be solved 
by linear programming. The versions in which f is restricted to be integral are NP-hard. 
Combinatorial, polynomial algorithms for maximum flow with gains are based on high 
capacity paths (paths that send the most to the sink) [GoP1Ta91], or on high gain paths 
[GoJiOr97]. A combinatorial, polynomial time algorithm for minimum cost flow with 
gains is based on a generalization of the minimum-mean cycle-canceling algorithm for 
minimum cost flow [Wa02]. It is an interesting open question if there exists a strongly 
polynomial algorithm for this problem. 
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INTRODUCTION 


In an undirected graph, the maximum matching problem requires finding a set of 
nonadjacent edges having the largest total size or largest total weight. This graph 
optimization problem arises in a diverse number of applications, often involving the 
optimal pairing of a set of objects. 


11.3.1 Matchings 


Matchings are defined on undirected graphs, in which the edges can be weighted. 
Matchings are useful in a wide variety of applications, such as vehicle and crew schedul- 
ing, sensor location, snowplowing streets, scheduling on parallel machines, among others. 


Basic Terminology 


DEFINITIONS 


D1: Let G = (V,£) be an undirected graph with vertex set V and edge set E. Each 
edge e € EF has an associated weight we. 


D2: A matching in G = (V,E) isaset M C E of pairwise nonadjacent edges. 


D3: A vertex cover in G is a set C of vertices such that every edge in G is incident 
on at least one vertex in C. 
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D4: A perfect matching in G = (V,E) is a matching M in which each vertex of V 
is incident on exactly one edge of M. 


TERMINOLOGY: A perfect matching of G is also called a 1-factor of G; see §5.3. 


D5: The size (cardinality) of a matching M is the number of edges in M, written 
|M|. The weight of a matching M is wt(M) = oO .ej4 We- 


D6: A maximum-size matching of G is a matching M having the largest size |M]. 


D7: A mazimum-weight matching of G is a matching M having the largest weight 
wt(M). 


D8: Relative to a matching M in G = (V, E), edges e € M are matched edges, while 
edges e € EF — M are free edges. Vertex v is matched if it is incident on a matched 
edge; otherwise vertex v is free (or unmatched). 


D9: Every matched vertex v has a mate, the other endpoint of the matched edge 
incident on v. 


D10: With respect to a matching M, the weight wt(P) of path P is the sum of the 
weights of the free edges in P minus the sum of the weights of the matched edges in P. 


D11: An alternating path has edges that are alternately free and matched. An 
augmenting path is an alternating path that starts at a free vertex and ends at another 
free vertex. 


NOTATION: Throughout this section, edges are represented as ordered pairs of vertices, 
and when discussing matchings, paths are represented as edge sets. 


EXAMPLES 


El: Figure 11.3.1 shows a graph G together with the matching M; = {(2,3), (4,5)} of 
size 2; the matched edges are highlighted. The mate of vertex 2 is vertex 3, and the mate 
of vertex 5 is vertex 4. Relative to matching M,, vertices 1 and 6 are free vertices, and 
an augmenting path P from 1 to 6 is given by P = {(1, 2), (2,3), (3, 4), (4, 5), (5, 6)}. The 
matching M, is not a maximum-size matching; the matching Mz = {(1, 2), (3, 4), (5, 6)} 
of size 3 is a perfect matching, and so is also a maximum-size matching. 


Figure 11.3.1: A matching in a graph. 
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E2: Figure 11.3.2 shows a graph G with 6 vertices. The matching M = {(2, 4), (3,5)} 
displayed is a maximum-size matching, of size 2. This graph G does not have a perfect 
matching since vertices 1 and 5 are only adjacent to vertex 3. 


1 2 
3 4 
5 6 


Figure 11.3.2: A maximum-size matching that is not perfect. 


E3: In the weighted graph G of Figure 11.3.3 below, the weight we is shown next 
to each edge e. The weight of matching M = {(1, 2), (3,5)} is wt(M) = 7. Relative 
to this matching, the path P = {(1, 2), (2,5), (3,5), (3,6)} is an alternating path with 
wt(P) =7+1—2-—5=1. The path {(1, 4), (1, 2), (2,3), (3,5), (5, 6)} is an augmenting 
path, joining the free vertices 4 and 6. 


1 2 2 4 3 
Eza 
4 ° 5 3 6 


Figure 11.3.3: A matching in a weighted graph. 


Some Fundamental Results 


FACTS 


F1: If M is a matching of G = (V,E), then the number of matched vertices is 2|M| 
and the number of free vertices is |V| — 2|M]. 


F2: If M is any matching in G, then |M| < |S]. 


F3: (Weak Duality) The size of any vertex cover of G is an upper bound on the size 
of any matching in G. Consequently the minimum size of a vertex cover is at least as 
large as the maximum size of a matching. 


F4: Every augmenting path has an odd number of edges. 


F5: If M is a matching and P is an augmenting path with respect to M, then the 
symmetric difference MAP is a matching of size |M| +1. 

NOTE: The symmetric difference MAP is taken with respect to the edge sets defining 
M and P. 


F6: (Augmenting Path Theorem) M is a maximum-size matching if and only if there 
is no augmenting path with respect to M. (See [Pel891, Be57, NoRa59].) 
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F7: If M isa matching and P is an augmenting path with respect to M, then wt( MAP) 
= wt(M) + wt(P). 


F8: Suppose M is a matching having maximum weight among all matchings of a fixed 
size k. If P is an augmenting path of maximum weight with respect to MW, then MAP 
is a maximum-weight matching among all matchings of size k + 1. 


F9: Let M; be a maximum-weight matching among all matchings of a fixed size 7, 
i= 1,2...,k, and let P; be a maximum-weight augmenting path with respect to M;. 
Then wt(P,) > wt(P2) >--- > wt(P,). 


F10: An immediate consequence of Facts F7 and F8 is a weighted-matching analogue 
of the Augmenting Path Theorem: A matching M is of maximum weight if and only if 
the weight of every augmenting path relative to M is nonpositive. 


EXAMPLES 


E4: In Figure 11.3.1, the path P = {(1, 2), (2,3), (3, 4), (4, 5), (5, 6)} is augmenting with 
respect to the matching M, = {(2,3), (4,5)}. As guaranteed by Fact F4, path P has an 
odd number of edges. The new matching Mz = M,AP = {(1, 2), (3,4), (5,6)} has size 
one greater than M1, and is a maximum-size matching. There are other maximum-size 
matchings, such as {(1, 2), (3,6), (4,5)} and {(1, 5), (2,6), (3, 4)}. 


E5: In Figure 11.3.2, the set S = {3,4} is a vertex cover of G. Thus, by Fact F3, the 
size of any matching M satisfies |M| < 2 = |S|. On the other hand, S = {2,3,4,5} isa 
(minimum cardinality) vertex cover of the graph in Figure 11.3.1, yet a maximum-size 
matching M for this graph satisfies |M| = 3 < |S]. 


E6: Figure 11.3.4(a) shows a matching M, of size 1, with wt(M,) = 7. Since edge 
(2,5) has maximum weight among all edges, M, is a maximum-weight matching of 
size 1. Relative to M,, the augmenting path P, = {(1,5), (2,5), (2,3)} has weight 
wt(P,) =6+4-—7 = 3, whereas the augmenting path P) = {(3,6)} has weight 1. It can 
be verified that P, is a maximum-weight augmenting path relative to M,. Ilustrating 
Fact F8, Mz = M, AP, = {(1,5), (2,3)} is a maximum-weight matching of size 2, with 
wt(M2) = wt(M,) + wt(Pi) = 10 (see Figure 11.3.4(b)). Relative to M2 there are 
several augmenting paths between the free vertices 4 and 6: 


Qi = {(1,4), (1,5), (5, 6)}, wt(Q1) =1+3-6=-2, 

Qe = {(1,4), (1,5), (2,5), (2,3), (3, 6)}, wt(Q2) =1+7+1-6-4=-1, 

Q3 = {(4,5), (1, 5), (1, 2), (2,3), (3,6)}, wt(Qs) =5+2+1-6-4=-2. 
Path Q2 is a maximum-weight augmenting path and so (by Fact F8) M3 = MzAQ2 = 
{(1, 4), (2, 5), (3,6)} is a maximum-weight matching of size 3, with wt(M3) = 9. All 
augmenting paths relative to M2 have negative weight, and so (by Fact F9) M2 is a 
maximum-weight matching in G. 


1 2 2 4 3 1 2 2 4 3 
4 5 5 3 6 4 5 5 3 6 


Figure 11.3.4: Maximum-weight matchings of sizes 1 and 2. 
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REMARKS 


R1: Fact F6 was obtained independently by C. Berge [Be57] and also by R. Z. Norman 
and M. O. Rabin [NoRa59]. This result can also be found in an 1891 paper of J. Petersen 
[Pe1891]. 


R2: An historical perspective on the theory of matchings is provided in [P192]. 


R3: Plummer [P193] describes a number of variations on the standard matching prob- 
lem, together with their computational complexity. 


11.3.2 Matchings in Bipartite Graphs 


Bipartite graphs arise in a number of applications (such as in assigning personnel to 
jobs or tracking objects over time). See the surveys [AhMaOr93, AhMaOrRe95, Ge95] 
as well as the text [LoP186] for additional applications. This section describes properties 
and algorithms for maximum-size and maximum-weight matchings in bipartite graphs. 


DEFINITIONS 
D12: Let G=(X UY,E£) bea bipartite graph with edge weights w-. 


D13: If S CX then I(S) = {y € Y | (a, y) € E for some x € S} is the set of vertices 
in Y adjacent to some vertex of S. 


D14: A complete (or X-saturating) matching of G = (X UY, E) is a matching M 
in which each vertex of X is incident on an edge of M. Such a matching is also called 
an assignment from X to Y. 


APPLICATIONS 


Al: A drug company is testing n antibiotics on n volunteer patients in a hospital. Some 
patients have known allergic reactions to certain of these antibiotics. To determine 
if there is a feasible assignment of the n different antibiotics to n different patients, 
construct the bipartite graph G = (X UY, E), where X is the set of antibiotics and Y is 
the set of patients. An edge (7,7) € E exists when patient j is not allergic to antibiotic 
i. A complete matching of G is then sought. 


A2: An important preprocessing step in solving large sparse systems of linear equa- 
tions involves rearranging the given n x n coefficient matrix A using row and column 
permutations. The objective is to place the maximum number of nonzero coefficients 
on the diagonal of the permuted matrix [Du81]. This can be viewed as a maximum-size 
matching problem on a bipartite graph G = (X UY, E). The set X contains the n row 
indices of A and the set Y contains the n column indices of A. An edge (i,j) € E exists 
when a;; 4 0. 
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A3: There are n applicants to be assigned to n jobs, with each job being filled with 
exactly one applicant. The weight w,;; measures the suitability (or productivity) of 
applicant i for job 7. Finding a valid assignment (matching) achieving the best overall 
suitability is a weighted matching problem on the bipartite graph G = (X UY, E), where 
X is the set of applicants and Y is the set of jobs. 


A4: The movements of n objects (such as submarines or missiles) are to be followed over 
time. The locations of the group of objects are known at two distinct times, though 
without identification of the individual objects. Suppose X = {21,22,...,2,} and 
Y = {yi,y2,---,;Yn} represent the spatial coordinates of the objects detected at times 
t and t+ At. If At is sufficiently small, then the Euclidean distance between a given 
object’s position at these two times should be relatively small. To aid in identifying 
the objects (as well as their velocities and directions of travel), a pairing between set 
X and set Y is desired that minimizes the overall sum of Euclidean distances. This 
can be formulated as a maximum-weight matching problem on the complete bipartite 
graph G = (X UY, E), where edge (2,7) indicates pairing position x; with position yj. 
The weight of this edge is the negative of the Euclidean distance between x; and y;. A 
maximum-weight matching of size n in G then provides an optimal (minimum distance) 
pairing of observations at the two times t and t + At. 


FACTS 


F11: (Konig’s Theorem) For a bipartite graph G, the maximum size of a matching 
in G is the minimum cardinality of a vertex cover in G. Thus for bipartite graphs the 
general inequality stated in Fact F3 can always be satisfied as an equality. (See [Bo90].) 


F12: (Hall’s Theorem) G = (X UY,E) has a complete matching if and only if 
[[(S)| > |S| holds for every S C X. In words, a complete matching exists precisely 
when every set of vertices in X is adjacent to at least an equal number of vertices in Y. 
(See [Bo90, Gr04].) 


F13: Suppose there exists some & such that deg(x) > k > deg(y) holds in G = 
(X UY, E) for all a € X and y © Y. Then G has a complete matching. (See [Gr04].) 


EXAMPLES 


E7: In the bipartite graph of Figure 11.3.5, S = {1,3,0,d} is a vertex cover of mini- 
mum cardinality, and M = {(1,a), (3,c), (4,5), (5,d)} is a maximum-size matching. As 
guaranteed by K6énig’s Theorem, |M]| = |S|. Also, by choosing A = {2,4,5} we have 
T(A) = {b, d}. Since |[(A)| < |A| holds, Hall’s Theorem shows that there is no complete 
matching with respect to the set X = {1,2,3,4,5}. In fact, the maximum matching 
above has size 4 < 5. 


1 a 
2 b 
3 c 
4 d 
5 e 


Figure 11.3.5: Covers and matchings in a bipartite graph. 
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E8: In the chessboard of Figure 11.3.6 we are to place non-taking rooks at certain 
allowable positions, those marked with an X. For example, we can place rooks at the 
independent positions (1,3), (2,4), (4,1): no two selected positions are in the same row 
or column. It turns out that three is the maximum number of rooks that can be so placed 
with regard to the allowable positions X. Also, notice that row 2, row 4, and column 3 
are three lines in the chessboard containing all X entries; in fact, no fewer number of 
lines suffice. Here the maximum number of non-taking rooks among the X entries equals 
the minimum number of lines containing all the X entries. This is a manifestation of 
K6nig’s Theorem, obtained by constructing the bipartite graph G = (X UY, E) where 
X contains the rows {1,2,3,4} and Y contains the columns {1, 2,3, 4,5}; edge (7,7) € E 
indicates an X in row z and column j. In this context, independent positions correspond 
to a matching and covering lines correspond to a vertex cover in G. 


1 2 3 4 5 


as 


- WO PD 


Figure 11.3.6: A chessboard with allowable X entries. 


REMARKS 


RA: Konig’s Theorem and Hall’s Theorem can be derived from the Max-Flow Min-Cut 
Theorem of §11.1. 


R5: Maximum-size matching problems in bipartite graphs can be formulated as maxi- 
mum flow problems in unit capacity networks and solved using maximum flow algorithms 


(§11.1). 


R6: Maximum-weight matching problems in bipartite graphs can be formulated as 
minimum cost flow problems in two-terminal flow networks and solved using minimum 
cost flow algorithms (§11.2). 


Bipartite Maximum-Size Matching Algorithm 


Algorithm 11.3.1, based on Fact F6, produces a maximum-size matching of the bipartite 
graph G = (X UY,£). Each iteration involves a modified breadth-first search of G, 
starting with all free vertices in set X. The vertices of G are structured into levels that 
alternate between free and matched edges. Algorithm 11.3.1 can be implemented to run 
in O(nm) time on a graph with n vertices and m edges; see [PaSt82]. 
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Algorithm 11.3.1: Bipartite Maximum-Size Matching 


Input: Bipartite graph G = (X UY, E). 
Output: Maximum-size matching M. 
M:=90 
DONE := FALSE 
While NOT DONE 
Let FREE consist of all the free vertices of G. 
Sx := XN FREE 
SEEN := 0 
STILL_LOOKING := TRUE 
While STILL_LOOKING (for an augmenting path} 
Sy := {y|y¢ SEEN and (2,y) € FE, x € Sx} 
If Sy 0D FREE 4 @ {an augmenting path exists} 
Construct an augmenting path P to y*. [x] 
M := MAP 
STILL_LOOKING := FALSE 
Else {continue looking for an augmenting path} 
SEEN := SEEN U Sy 
Sy := {x | (y,7) © M, ye Sy} 
If Sx =0 
STILL_LOOKING := FALSE 
DONE := TRUE 


REMARK 


R7: The augmenting path at Step [x] is constructed in reverse, starting at the free 
Y-vertex y*. Choose a vertex x € Sx (adjacent to y*) by which y* was defined to be 
an element of Sy. Then choose the vertex y € Sy that is matched to x in M. Vertices 
from X and Y are alternately chosen in this way until an x is chosen from the initial 
Sx, which means that it is a free vertex. 


EXAMPLE 


E9: Algorithm 11.3.1 can be used to find a maximum-size matching in the bipartite 
graph of Figure 11.3.7 below. We begin with the matching M = {(1,a), (2,b)} of size 
2, shown in Figure 11.3.7(a). At the next iteration, Sx = {3,4} and Sy = {a,b}. 
Since both vertices of Sy are matched, the algorithm continues with Sx = {1,2} and 
Sy = {c}. Since c € Sy is free, with augmenting path P = {(3,a), (a, 1), (1,c)}, the new 
matching produced is M = {(1,c), (2, 6), (3, a)}; see Figure 11.3.7(b). The next iteration 
produces Sx = {4}, Sy = {b}; Sx = {2}, Sy = {a,c}; finally Sx = {1,3}, Sy = 9, 
Sx =. No further augmenting paths are found, and Algorithm 11.3.1 terminates with 
the maximum-size matching M = {(1,c), (2,b), (3, a)}. 


a 


(a) 


1 
2 

: (b) 3 
4 


Cc 


BR WwW NO 


Cc 


Figure 11.3.7: Maximum-size matching in a bipartite graph. 
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Bipartite Maximum-Weight Matching Algorithm 


Algorithm 11.3.2, based on Facts F8-F10, produces a maximum-weight matching of 
G = (X UY,E£). Each iteration finds a maximum-weight augmenting path relative to 
the current matching M. The algorithm terminates when the path has nonpositive 
weight. A straightforward implementation of Algorithm 11.3.2 runs in O(n?m) time on 


a graph with n vertices and m edges. 


NOTATION: The tentative largest weight of an alternating path from a free vertex in X 


to vertex 7 is maintained using the label d(). 


Algorithm 11.3.2: Bipartite Maximum-Weight Matching 


Input: Bipartite graph G = (X UY, E) with edge weights we. 
Output: Maximum-weight matching M. 
M:=90 
DONE := FALSE 
While NOT DONE 
Let Sx consist of all the free vertices of X. 
Let d(j) := 0 for 7 € Sx and d(j) := —oo otherwise. 
While Sx #40 
Sy := 0 
For each edge (a, y) € E— M with x € Sx 
If d(x) + Way > dy) 
d(y) := d(x) + wey 
Sy := Sy U {y} 
Sx = 0 
For each edge (y,x) € M with y € Sy 
If d(y) — Wyx > d(x) 
d(x) := d(y) — Wyz 
Sx = Sx U {x} 
Let y be a free vertex with maximum label d(y) 
and let P be the associated path. 


If d(y) > 0 
M := MAP 
Else 


DONE := TRUE 
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EXAMPLE 


E10: Algorithm 11.3.2 can be used to find a maximum-weight matching in the bipartite 
graph of Figure 11.3.8. If we begin with the empty matching, then the first iteration 
yields the augmenting path P, = {(3,a)}, with wt(P,) = 6, and the maximum-weight 
matching (of size 1) M = {(3,a)}, with wt(M) = 6; see Figure 11.3.8(a). The next 
iteration starts with Sx = {1,2}. The labels on vertices a,b,c are then updated to 
d(a) = 4, d(b) = 4, d(c) = 5, so Sy = {a,b,c}. Using the matched edge (a, 3), vertex 
3 has its label updated to d(3) = —2 and Sx = {3}. No further updates occur, and 
the free vertex c with maximum label d(c) = 5 is selected. This label corresponds 
to the augmenting path P, = {(2,c)}, with wt(P2) = 5. The new matching is M = 
{(2,c), (3, a)}, with wt(M) = 11; see Figure 11.3.8(b). At the next iteration, Sx = {1} 
and vertices a,b receive updated labels d(a) = 4, d(b) = 1. Subsequent updates produce 
d(3) = —2, d(c) = 3, d(2) = —2, d(b) = 2. Finally, the free vertex b is selected with 
d(b) = 2, corresponding to the augmenting path P3 = {(1,a), (a, 3), (3, c), (¢, 2), (2, b)}, 
with wt(P;) = 2. This gives the maximum-weight matching M = {(1,a), (2, b), (3,c)}, 
with wt(M) = 13, shown in Figure 11.3.8(c). As predicted by Fact F9, the weights of 
the augmenting paths are nonincreasing: wt(P,) > wt(P2) > wt(Ps). 


4 4 4 
1 a1 a1 a 
20 fe» 2 at fed 2 oie 

5 5 5 


Figure 11.3.8: Maximum-weight matchings of sizes 1, 2, and 3. 


11.3.3. Matchings in Nonbipartite Graphs 


This section discusses matchings in more general (nonbipartite) graphs. Algorithms 
for constructing maximum-size and maximum-weight matchings are considerably more 
intricate than for bipartite graphs. The important new concept is that of a blossom. 


DEFINITIONS 


D15: Suppose P is an alternating path from a free vertex s in graph G = (V, F). Then 
a vertex v on P is even if the subpath P,,, of P joining s to v has even length; it is odd 
if P,,, has odd length. 


D16: Suppose P is an alternating path from a free vertex s to an even vertex v and 
edge (v,w) € E joins v to another even vertex w on P. Then PU {(v,w)} contains a 
unique cycle, called a blossom. 


D17: A shrunken blossom results when a blossom B is collapsed into a single vertex 
b, whereby any edge (x,y) with « ¢ B and y € B is transformed into the edge (z, b). 
The reverse of this process gives an expanded blossom. 
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FACTS 


F14: A blossom B has odd length 24 + 1 and contains & matched edges, for some 
k>1. 


F15: A bipartite graph contains no blossoms. 


F16: (Edmonds’s Theorem) [Ed65a] Suppose graph G? is formed from G by collapsing 
blossom B. Then G? contains an augmenting path if and only if G does. 


F17: (General Maximum-Size Matching) Algorithm 11.3.3, based on Fact F6, produces 
a maximum-size matching of G. At each iteration, a forest of trees is grown, rooted 
at the free vertices of G, in order to identify an augmenting path. As encountered, 
blossoms B are shrunk, with the search continued in the resulting graph G?. 


Algorithm 11.3.3: General Maximum-Size Matching 


Input: Graph G = (V, E). 
Output: Maximum-size matching M. 
M:=90 
DONE := FALSE 
While NOT DONE 
Mark all free vertices as even. 
Mark all matched vertices as unreached. 
Mark all free edges as unexamined. 
While there are unexamined edges and no augmenting path is found 
Let (v,w) be an unexamined edge. 
Mark (v, w) as examined. 
{Case 1} 
If v is even and w is unreached 
Mark w as odd and its mate z as even. 
Extend the forest by adding (v, w) and matched edge (w, z). 
{Case 2} 
If v and w are even and they belong to different subtrees 
An augmenting path has been found. 
{Case 3} 
If v and w are even and they belong to the same subtree 
A blossom B is found. 
Shrink B to an even vertex b. 
If an augmenting path P has been found 
M := MAP 
Else 
DONE := TRUE 
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NOTATION: Throughout we let n = |V| and m = |E|. 


F18: Algorithm 11.3.3 was initially proposed by Edmonds [Ed65a] with a time bound 
of O(n*). An improved implementation of Algorithm 11.3.3 runs in O(nm) time; see 
[Ta83, Ge95]. 


F19: Maximum-size matchings in nonbipartite graphs can also be found using the 
algorithm of Gabow [Ga76], which runs in O(n) time, and the algorithm of Micali and 
Vazirani [MiVa80], which runs in O(m,/n) time. 


F20: More complicated algorithms are required for solving weighted-matching prob- 
lems in general graphs. The first such algorithm, also involving blossoms, was developed 
by Edmonds [Ed65b] and has a time bound of O(n‘). 


F21: Improved algorithms exist for the weighted-matching problem, with running times 
O(n3) and O(nmlogn), respectively; see [AhMaOr93, Ge95].. An O(nm + n? log n) 
algorithm is given by Gabow [Ga90]. An efficient implementation of the Edmonds 
blossom algorithm is given in [Ko09]. 


EXAMPLES 


Ell: In Figure 11.3.9(a), P = {(1,2), (2,3), (3,4), (4,5)} is an alternating but not 
augmenting path, with respect to the matching M = {(2,3),(4,5)}. Relative to path 
P, vertices 1,3,5 are even while vertices 2,4 are odd. Since (5,3) is an edge joining 
two even vertices on P, the blossom B = {(3, 4), (4,5), (5,3)} is formed. On the other 
hand, Q = {(1, 2), (2,3), (3,5), (5, 4), (4,6)} is an augmenting path relative to M so 
that MAP = {(1, 2), (3,5), (4,6)} is a matching of larger size—in fact a matching of 
maximum size. Notice that relative to path Q, vertices 1,3,4 are even while vertices 
2,5,6 are odd. 


E12: Shrinking the blossom B relative to path P in Figure 11.3.9(a) produces the graph 
G® shown in Figure 11.3.9(b). The path P? = {(1,2), (2,b), (b,6)} is now augmenting 
in G®. By expanding P? so that (2,3) remains matched and (4,6) remains free, the 
augmenting path Q = {(1, 2), (2,3), (3, 5), (5, 4), (4,6)} in G is obtained. 


Figure 11.3.9: Shrinking blossom B. 
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E13: Algorithm 11.3.3 can be applied to the nonbipartite graph shown in Figure 
11.3.10(a). Suppose the matching M = {(3, 4), (6,8)} of size 2 is already available. 


Iteration 1: The free vertices 1,2,5,7 are marked as even, and the matched vertices 
3,4,6,8 are marked as unreached. The initial forest consists of the isolated vertices 
1, 2,5, 7. 

e If the free edge (2,3) is examined then Case 1 applies, so vertex 3 is marked odd and 
vertex 4 even; the free edge (2,3) and the matched edge (3,4) are added to the forest. 
e If the free edge (7,4) is next examined then Case 2 applies, and the augmenting path 
P = {(2,3), (3,4), (4, 7)} is found. Using P the new matching M = {(2,3), (4,7), (6,8)} 
of size 3 is obtained; see Figure 11.3.10(b). 

Iteration 2: The forest is initialized with the free (even) vertices 1, 5. 

e If the free edge (1,2) is examined then Case 1 applies, so vertex 2 is marked odd and 
vertex 3 even; edges (1,2) and (2,3) are added to the forest. 

e Examining in turn the free edges (3, 4) and (7,6) makes 4,6 odd vertices and 7, 8 even. 
Edges (3, 4), (4,7), (7,6), (6,8) are then added to the subtree rooted at 1. 

e If edge (8,7) is examined, then Case 3 applies, and the blossom B = {(7,6), (6,8), 
(8,7)} is detected and shrunk; Figure 11.3.10(c) shows the resulting G?. The current 
subtree rooted at 1 now becomes {(1, 2), (2,3), (3, 4), (4, b)}. 

e If the free edge 


(0, 
{(1, 2); (2,3), (3,4), (4 
in G is P = {(1, 2), (2 
new matching {(1, 2) 
11.3.10(d). 


5) is examined, then Case 2 applies and the augmenting path 
,b),(b,5)} is found in G?. The corresponding augmenting path 
, 3), (3, 4), (4, 7), (7, 8), (8, 6), (6,5)}. Forming MAP produces the 

, (3, 4), (5, 6), (7, 8)}, which is a maximum-size matching; see Figure 


1 2 3 4 1 2. 3 4 

(a) (b) 
5 fe 5 7 

8 8 
1 2 3 4 
1 2 3 4 

c) nie a (d) 

5 7 
Figure 11.3.10: Illustrating Algorithm 11.3.3. 


APPLICATIONS 


A5: Pairs of pilots are to be assigned to aircraft serving international routes. Pilots 
i and 7 are considered compatible if they are fluent in a common language and have 
comparable flight training. Form the graph G whose vertices represent pilots and whose 
edges represent compatible pairs of pilots. The problem of flying the largest number of 
aircraft with compatible pilots is then a maximum-size matching problem on G. 
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A6: Bus drivers are hired to work two four-hour shifts each day. Union rules require 
a certain minimum amount of time between the shifts that a driver can work. There 
are also costs associated with transporting the driver between the ending location of 
the first shift and the starting location of the second shift. The problem of optimally 
combining pairs of shifts that satisfy union regulations and incur minimum total cost 
can be formulated as a maximum-weight matching problem. Namely, define the graph 
G with vertices representing each shift that must be covered and edges between pairs of 
compatible shifts (satisfying union regulations). The weight of edge (i, 7) is the negative 
of the cost of assigning a single driver to shifts 7 and 7. It is convenient also to add 
edges (i,i) to G to represent the possibility of needing a part-time driver to cover a 
single shift; edge (2,2) is given a sufficiently large negative weight to discourage single- 
shift assignments unless absolutely necessary. A maximum-weight perfect matching in 
G then provides a minimum-cost pairing of shifts for the bus drivers. 


11.3.4 Stable Matchings 


The stable matching problem, first discussed by D. Gale and L. S. Shapley [GaSh62], 
is a fundamental model for assignment problems with preferences (e.g., the assignment 
of medical residents to hospitals in the United States). 


DEFINITIONS 


D18: Given two disjoint sets X and Y with |X| = |Y], let G be the complete bipartite 
graph on the vertex set X UY. For each x € X, let >, be an ordering of Y and for 
each y € Y, let >, be an ordering of X. 


D19: Let M be a perfect matching in G. For any v € XUY, M(v) = wif (v,w) € M. 


D20: Relative to a perfect matching M of G, a pair (x,y) € X x Y is a blocking pair 
ify >: M(x) and «>, M(y). 


D21: A perfect matching M of G is stable if there are no blocking pairs relative to 
M. 


EXAMPLES 


E14: Figure 11.3.11 shows an instance of the stable matching problem with X = 
{1,2,3,4} and Y = {a,b,c,d}, along with the corresponding ordering for each member 
of X and Y. The matching {(1, a), (2,5), (3, c), (4, d)} is not stable: (2,a) is a blocking 
pair for this matching because a >2 6 and 2 >, 1. 


BwWNY Fe 
acrodc 
eos gk 
a ag Qs 
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BP wr 
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Figure 11.3.11: An instance of the stable matching problem. 
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E15: The matching {(1,d), (2,c), (3,0), (4,a)} is stable. To verify this, observe that 2 
and 3 are matched with their top choice, so they cannot be part of a blocking pair. The 
only potential blocking pair involving 1 is (1,6), but b prefers 3 to 1; finally, the only 
potential blocking pair involving 4 is (4,d), but d prefers 1 to 4. 


E16: The matching {(1, d), (2, a), (3, b), (4, c)} is also stable. That this is the only other 
stable matching can be verified by an exhaustive search. 


Gale-Shapley Algorithm 


Algorithm 11.3.4, discovered by Gale and Shapley, produces a stable matching for any 
complete bipartite graph on X UY with |X| = |Y|. Each iteration involves a free 
vertex x “proposing” to its most-preferred choice y € Y that has not yet rejected it. 
The algorithm is such that each vertex in X UY is either tentatively matched or free; 
vertices in X—the active side—may go from one of these states to the other, but once 
a vertex in Y—the passive side—becomes tentatively matched, it will stay so until the 
end (although its partner may change during the course of the algorithm). From the 
description of the algorithm, it is not obvious that the algorithm terminates (and it 
does so with a stable matching); furthermore, as stated, the algorithm needs to make a 
choice whenever there are multiple free vertices in X. However, it can be shown that 
the algorithm terminates with a stable matching, and that the outcome is the same 
regardless of the order in which the free vertices in X are examined. 


Algorithm 11.3.4: Stable Matching 


Input: Complete bipartite graph G = (X UY, X x Y), with |X| = |Y]. 
Output: Stable matching M. 
M:=90 
FREE :=XUY 
for each x € X, REJECT(x) =0 
While XN FREE #0 
pick any x € FREENX 
let y be the greatest (according to >,) in Y \ REJECT(z). 
If y € FREE 
Add (x,y) to M 
FREE := FREE \ {x,y} 
Else 
If c >, M(y) then 
FREE := (FREE \ x)U M(y) 
REJECT(M(y)) := REJECT(M(y)) U {fy} 
M :=[MU(z,y)]\ (M(y).¥) 
Else 
REJECT (a) := REJECT(a) U {y} 
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FACTS 
Proofs of the following facts about the GS algorithm may be found in the first chapter 
of [Gulr89]. 


F22: Given any instance of the stable matching problem, the Gale-Shapley (GS) 
algorithm terminates with a stable matching. In particular, there is always at least one 
stable matching. 


F23: The outcome of the GS algorithm does not depend on the order in which the free 
vertices in X are examined. 


F24: Say that x and y are stable partners if (x,y) € M for some stable matching M. 
In the stable matching computed by the GS algorithm, every x € X is matched with 
their most-preferred stable partner. Consequently, the GS algorithm finds an optimal 
stable matching for X. 


F25: In the stable matching computed by the GS algorithm, every y € Y is matched 
with their least-preferred stable partner. Consequently, the GS algorithm finds the worst 
stable matching for Y. 


F26: By reversing the roles of X and Y, we can obtain a symmetric algorithm that 
computes an optimal stable matching for Y. 


F27: Ifthe same stable matching is produced by applying the GS algorithm as pre- 
scribed in Facts F24 and F26, then the given instance has a unique stable matching. 


F28: Let M and M’ be two distinct stable matchings, and suppose that x and y are 
matched to each other in M. Then exactly one of the following three possibilities holds: 
(i) and y are matched to each other in M’; (ii) x prefers M’(x) to y and y prefers x 
to M'(y); (iii) x prefers y to M'(x) and y prefers M’(y) to x. 


F29: Suppose M and M’ are two distinct stable matchings. Define the assignment 
p(x) = max,,{M (x), M’(x)}. That is, assign to x the better of the two stable partners 
that x has (in M and M’). Then yp is a stable matching. (A priori it is not even clear 
that ps is a matching.) Similarly, define v(y) = min, , {M(y), M'(y)}. That is, assign to 
y the worse of the two partners that y has (in M and M’). It can be shown that v is a 
stable matching and that v and yp are identical. 


F30: Given any stable matchings M1, Mo,...,M; and any k& € {1,2,...,t}, let pz (z) 
be the assignment that gives each x its kth best partner (among the t stable partners, 
with multiplicity). Similarly, let vz, assign to each y the kth worst partner among its ¢ 
stable partners in M,, Mo,..., M:. Then uz and vz, are both stable matchings; moreover 
Uk = Vk. 


EXAMPLES 


E17: Applying the GS algorithm to the instance of Figure 11.3.11 results in the 
following. First, 1 proposes to b and is tentatively accepted; 2 proposes to c and is 
tentatively accepted; 3 proposes to 6, who accepts 3 and rejects 1; 1 then proposes to d 
and is tentatively accepted; 4 proposes to d and is rejected; 4 then proposes to a and is 
tentatively accepted. At this point none of the members of X is free and the algorithm 
terminates with the matching {(1,d), (2,c), (3,), (4,a)}. The symmetric execution of 
the GS algorithm with X and Y reversed results in the matching mentioned in Example 
E16. 
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E18: Figure 11.3.12 shows an instance of the stable matching problem with X = 
{1,2,3,4} and Y = {a,b,c,d}, along with the corresponding ordering for each member 
of X and Y. Consider the stable matchings M = {(1,b), (2,d), (3, a), (4,c)} and M’ = 
{(1,c), (2, a), (3, d), (4,0)}. With 2 = 2 and y = d, we see that x prefers M’ to M, 
whereas d prefers M to M’, consistent with case (ii) of Fact F28. The matching of 
Fact F29 is {(1, 6), (2, a), (3, d), (4,c)}, which is yet another stable matching. 
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Figure 11.3.12: An instance with 10 stable matchings. 


APPLICATIONS AND EXTENSIONS 


A7: The stable matching problem as formulated requires |X| = |Y| and a complete 
bipartite graph. These assumptions can be relaxed—one could formulate the stable 
matching problem on an arbitrary bipartite graph. In this case, each x € X has an 
ordering of I(x), the set of vertices it is adjacent to; similarly, each y € Y has an 
ordering of I'(y), the set of vertices it is adjacent to. One can think of I'(v) as the set 
of all vertices in the other set that are acceptable partners for v. A stable matching 
may not be perfect any longer. The only requirement for M to be stable is that no 
pair of vertices that are not matched with each other (in M) prefer each other to their 
respective partners (in M7). It can be shown that the vertices that are unmatched in 
any stable matching remain unmatched in all stable matchings. 


A8: Most applications of the stable matching model involve entities that are allowed 
to have multiple partners. For example, in the assignment of schools to students, each 
student is typically assigned to a single school, but schools are allowed to admit many 
students. This generalization can easily be accommodated in the stable matching model 
by allowing each y € Y to have an integer quota q, > 1. A pair (x, y) blocks the matching 
M if x prefers y to M(x) and y prefers x to at least one student in the set of students 
M(y). As Gale and Shapley observed [GaSh62], one can reduce this to the classical 
one-to-one matching problem by making q, copies of y, and by letting each copy have 
the same preference ordering over X as the original y. Each « € X now has a preference 
ordering over the copies in a consistent way: if y >. y’ then any copy of y is preferred 
by «x to any copy of 7’; also the copies of a given y are always ranked 1,2,...,qy. It is 
easy to verify that any stable matching of the expanded instance is stable in the original 
instance and vice versa. 


A9: The stable matching problem can be posed on a general graph with an even 
number of vertices. In this model, each vertex has an ordering over the other vertices, 
and the objective is to find a matching with no blocking pair. Unlike the bipartite case, 
a stable matching may not exist. However, it is possible to decide in polynomial time 
whether or not a given instance admits a stable matching, and much is known about 
the structure of such instances. See Gusfield and Irving [Gulr89] for additional details. 
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A10: The most celebrated application of the stable matching model is the National 
Resident Matching Program (NRMP) in the United States which assigns medical interns 
(or residents) to hospitals. Roth [Ro84] discusses the early history of this program and 
observes that the assignment algorithm used by this centralized matching program was 
in fact equivalent to the hospital-optimal version of the GS algorithm. It is astonishing 
that this algorithm had been in use from 1951, predating the seminal paper of Gale 
and Shapley by more than a decade. Roth and Peranson [RoPe99] led the effort to 
redesign the NRMP matching algorithm, which is now a variant of the resident-optimal 
GS algorithm. 


All: A relatively recent application of the stable matching algorithm occurs in the 
assignment of school students to public schools in various US cities that have active 
school choice programs. In New York City, a variant of the student-optimal GS al- 
gorithm is used to assign students to specialized high schools. Students submit their 
preferences to a central authority, and school preferences are determined by various 
factors such as grades, test-scores, etc. An important new element in some districts is 
that school preferences have indifferences (or ties) and the algorithm has to be enhanced 
to accommodate such preferences. See Abdulkadiroglu et al. [AbPaRo09] for additional 
details. 


REMARKS 


R8: The stable matching model was introduced by Gale and Shapley [GaSh62]. The 
books by Gusfield and Irving [Gulr89] and by Knuth and Goldstein [KnGo97] deal 
with algorithmic and structural aspects of the stable matching problem and their gen- 
eralizations. For an overview of applications, especially in economics, see Roth and 
Sotomayor [RoSo91]. Subramanian [Su94] and Feder [Fe95] apply a fixed-point ap- 
proach to the study of stable matchings and discuss applications to circuit complexity. 
Fleiner [F103] discusses substantive generalizations of the basic model that are amenable 
to a fixed-point approach. 


R9: The stable matching problem can be formulated as an integer programming prob- 
lem. Vande Vate [Va89] and Rothblum [Ro92] discuss linear programming formulations 
of the stable matching problem that are integral; these results have been generalized by 
Teo and Sethuraman [TeSe98], who give further insight into the structure of fractional 
stable matchings. 
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INTRODUCTION 


Graph Pebbling is a network optimization model for the transportation of resources 
that are consumed in transit. Electricity, heat, or other energy may dissipate as it moves 
from one location to another, oil tankers may use up some of the oil they transport, or 
information may be lost as it travels through its medium. The central problem in this 
model asks whether discrete pebbles from one set of vertices can be moved to another 
while pebbles are lost in the process. A typical question asks how many pebbles are 
necessary to guarantee that, from any configuration of that many pebbles, one can move 
a pebble to any particular vertex. This section will describe this question and other 
variations of it, and will present the main results and applications in the theory. Good 
surveys of the subject can be found in [Hu05, Hul2, HurlGPP]. 

All graphs considered are simple and connected. 


11.4.1 Solvability 


Here we develop the notion of moving from one configuration of pebbles to another 
via pebbling steps. 


NOTATION 


The set of nonnegative integers is denoted by NV’. We use n = n(G) to denote the 
number of vertices of a graph G. When H is a subgraph of G,, we write G— H to denote 
the graph having vertices V(G — H) = V(G) and edges E(G — H) = E(G) — E(#). 
The eccentricity at vertex r, diameter, girth, connectivity, and domination number of 
a graph G are written eccg(r), diam(G), gir(G), «(G), and dom(G), respectively, while 
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distg(u,v) denotes the distance between vertices u and v in G (we may write ecc(r) 
and dist(u,v) when G is understood). Also, the minimum degree of G is denoted 6(G) 
and we write lg for the base 2 logarithm. 


DEFINITIONS 
D1: A configuration C on a graph G is a function C : V(G)>N. The value C(v) 
signifies the number of pebbles at vertex v. We also write C(S) = >), gC(v) for a 


subset S C V(G) of vertices. 


D2: For an edge {u,v} € E(G), if u has at least two pebbles on it, then a pebbling 
step from u to v removes two pebbles from u and places one pebble on v. That is, if C 
is the original configuration, then the resulting configuration C’ has C’(u) = C(u) — 2, 
C’(v) = C(v) +1, and C’(x) = C(z) for all x € V(G) — {u,v}. 


D3: A pebbling step from u to v is r-greedy if dist(v,r) < dist(u,r). It is r- 
semigreedy if dist(v,r) < dist(u,r). 


D4: We say that a configuration C' on G is r-solvable if it is possible from C to place 
a pebble on r via pebbling steps. It is r-wnsolvable otherwise. 


D5: More generally, for a configuration D, we say that C' is D-solvable if it is 
possible to perform pebbling steps from C to arrive at another configuration C’ for 
which C’(v) > D(v) for all v € V(G). It is D-unsolvable otherwise. We denote by 
G(S) the directed subgraph of G induced by a set S of pebbling steps. 


D6: We say that a configuration C' on G is k-fold r-solvable if it is possible from C 
to place k pebbles on r via pebbling steps. 


NOTE: The k-fold r-solvability of C is the specific instance of D-solvability for which D 
has k pebbles on r and none elsewhere. 


D7: The size |C| of a configuration C on a graph G is the total number of pebbles on 
G; ie, |C] = Vreviay Cl). 


D8: For a graph G and a particular root vertex r, the rooted pebbling number 
m(G,r) is defined to be the minimum number ¢ so that every configuration C on G of 
size t is r-solvable. 


D9: A sequence of paths P = (P[1],..., P[h]) isa maximum r-path partition of a 
rooted tree (T,r) if P forms a partition of E(T), r is a leaf of P[1], T; = U5_, Ply] is a 
tree for all 1 <i <h, and P[é] is a maximum length path in T — T;_1, among all such 
paths with one endpoint in T;_;, for all 1 <i<h. 


D10: We define the function f(T,r) = Sn ai —h+1, where (l,...,l,) is the 
sequence of lengths J; = diam(P{i]) in a maximum r-path partition P of a rooted tree 


(T,r). Also, set f,(T,r) = f(T,r) + (k-1)2". 


D11: A thread in a graph G is a subpath of G whose vertices have degree two in G. 
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EXAMPLES 


El: Figure 11.4.1 shows two r-unsolvable configurations (of maximum size, right) on 
the path with 7 vertices. 


11002 0 7 30000 0 15 
o—-o—__o_ o_o —_0—_0 6—»—_1__0__4—_0—__6 
T r 


Figure 11.4.1: Two r-unsolvable configurations on the path P7. 


E2: Figure 11.4.2 shows a maximum sized r-unsolvable configuration on a tree. 


Figure 11.4.2: An r-unsolvable configuration on a tree. 


E3: Figure 11.4.3 shows an r-solvable configuration on the 4-cycle with pendant edge. 


Figure 11.4.3: An r-solvable configuration. 


FACTS 


F1: If A is a connected, spanning subgraph of a graph G then m(H,r) > a(G,r) for 
every root vertex r. 


F2: Every graph G on n vertices has rooted pebbling number 7(G,r) > n for every 
root vertex r. 


F3: The complete graph K,, on n vertices has rooted pebbling number 7(Ky,,1r) = n 
for every root vertex r. 


F4: Every graph G on n vertices has rooted pebbling number 1(G,r) > 2°¢“) for 
every root vertex r. 


Section 11.4. Graph Pebbling 1431 


F5: The path P,, on n vertices has rooted pebbling number 7(P,,,7) = 2-1 when r is 
one of its leaves. 


F6: Every graph G on n vertices has rooted pebbling number 1(G,r) < (n—1)(2°¢e™ — 
1) +1 for every root vertex r. 


F7: [Ch89] If (7,r) is a tree with root r then 7(T,r) > f(T,r). 


F8: [No-Cycle Lemma] [BuChCrMiWe08] If a configuration C is D-solvable then 
there exists a D-solution S for which G(S) is acyclic. 


F9: [Squishing Lemma] [BuChCrMiWe08] For every root vertex r of a graph G there 
is a maximum-sized r-unsolvable configuration such that, on each thread not containing 
r, all pebbles sit on one vertex or two adjacent vertices. 


Weight Functions 


Weight functions can be used to provide upper bounds on rooted pebbling numbers of 
graphs. 


DEFINITIONS 


D12: For a tree T rooted at a vertex r we define the parent of vertex v € V(T) — {r} 
to be the unique neighbor v* of v for which dist(vt,r) = dist(v,r) — 1. We say also 
that v is a child of vt. 


D13: We say that a rooted subtree (T,1) of (G,r) is an r-strategy if associated with 
it is a weight function w : V(G)>N having the properties that w(v) = 0 for all 
v ¢ V(T) and w(ut) > 2w(v) for every vertex v # r. The r-strategy T is basic if 
equality holds for all such v € V(T). 


D14: For a rooted graph (G,r) with r-strategy (T,w), we say that the weight of a 
vertex v is w(v) when v € T and 0 otherwise, and define the weight of a configura- 
tion C on G to be 

w(C)= S> C(v)w(r). 


vEV(G) 


NOTATION: We denote by J, the configuration on any rooted graph (G,r) having no 
pebbles on r and one pebble on every other vertex. Furthermore, let wt denote the 
weight function for any breadth-first search spanning tree (G,r), where wt(r) = 1; that 
is, wt(v) = 2—4st(-") for all v € V(G). 


D15: For a rooted graph (G,r) on n vertices, let C be the set of all r-unsolvable 
configurations on G, viewed as points in N"~!: each C € C is identified with the 
coordinates (C(v2),...,C(vn)), where V(G) = {r,v2,...,Un}. The convex hull of C 
is called the r-unsolvability polytope of G, denoted U(G,r). Define the r-strategy 
polytope T(G,1r) by the set of linear inequalities given by the Weight Function Lemma 
over all r-strategies. 
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FACTS 


F10: If C is a configuration on the rooted graph (G,r) and C” is the configuration 
obtained from C' after a pebbling step from u to v then, for any r-strategy (T,w) of 
(G,r) containing the edge {u,v}, we have w(C’) < w(C), with equality if and only if 
w(v) = 2w(u) (when w = wt this means that the step is greedy). 


F11: If C is an r-solvable configuration on G then wt(C) > 1. 


F12: A configuration C on a path rooted at a leaf r is r-solvable if and only if 
wt(C) > 1. 


F13: Every r-strategy is a conic combination of basic r-strategies; that is, for every r- 
strategy (T, w) of a rooted graph (G, 1), there are basic r-strategies (T1,w1),..-, (Zn, Wn) 
of (G,r) and nonnegative coefficients a1,...,a, so that, for all v € v(G), we have 


w(v) = oj, wi(v). 


F14: [Weight Function Lemma] [Hu10] Let (T,w) be an r-strategy of the rooted 
graph (G,r) and suppose that C is an r-unsolvable configuration on G. Then w(C) < 
w(J;-). 


EXAMPLES 


E4: Figure 11.4.4 displays the upper bound for the rooted tree (T,7r) of Figure 11.4.2 
given by a basic r-strategy: 7(T,1r) = 320. 


Figure 11.4.4: A rooted tree with a basic r-strategy. 


E5: Figure 11.4.5 displays the lower and upper bounds for the rooted cycle (C7,r) 
given by an r-unsolvable configuration and basic r-strategy, respectively: 7(C7,r) = 15. 


Figure 11.4.5: A rooted cycle with (left) its maximum r-unsolvable configuration and 
(right) two basic r-strategies. 
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E6: Figure 11.4.6 displays the upper bound for a rooted Petersen graph (P,r) given 
by three basic r-strategies: 1(P,r) = 10. 


Figure 11.4.6: Three basic r-strategies of a rooted Petersen graph. 


FACTS 


F15: [Ch89] If (T,r) is a tree with root r then 7(T,r) > f(T,r) (and hence a(T,r) = 
f(L,r)). 


F16: Every rooted graph (G,r) has rooted pebbling number 7(G,r) < f(T,r) for any 
breadth-first search spanning tree T of G rooted at r. 


F17: [CuHiHuMo09] Let G be a graph in which each of its blocks is a clique, and 
suppose that T is a breadth-first search spanning tree of G rooted at r. Then 1(G,r) = 
m(T,r). 


F18: [PaSnVo95] For every root vertex r in the cycle C,, we have 7(C2..-) = 2* for all 
k > 2 and m(Cox41,7) = [(2**? — 1)/3] for all k > 1. 


F19: {Hul0] For a polytope P of configurations on a rooted graph (G,r) define 
zp(G,r) = maxcep |C| and mp(G,r) = |zp(G,r)| +1. The Weight Function Lemma 
implies that U(G,r) C T(G,r), and hence 2(G,r) = tu (G,r) < mr(G,r). 


F20: [Uniform Covering Lemma] [Hul0] Let G be a graph on n vertices. If some 
collection of r-strategies {(T;,w;)}*_, has the property that there is a constant ¢ such 
that, for every v € V(G) — {r}, we have ee w;(v) =c, then 7(G,r) =n. 


F21: [Ch89] The d-dimensional cube Q¢ has 7(Q?¢,r) = n(Q?) = 2% for every root 
vertex 1. 

RESEARCH PROBLEMS 

RP1: Is there a characterization for r-solvable configurations on trees rooted at r? 
RP2: Is mp(G,r) < 27(G,r) for every rooted graph (G,r)? 


RP3: Find larger classes of strategies than those arising from trees. 
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Complexity 


Here we discuss questions such as how long it takes to decide if a particular configuration 
C on a graph G is D-solvable, or to calculate 7(G,r) for a rooted graph (G,r). 


DEFINITIONS 


D16: A graph G is a split graph if its vertices can be partitioned into a clique K and 
an independent set J. 


D17: Let H be a hypergraph with vertices V(H) and edges E(H) = {e1,...,ex}. For 
a given t define the pebbling graph G = G;(H) as follows. The vertices of G are 
given by V(G) = V(H)UE(H) Uf{u,..., ug} U {r, wi,..., we}. The edges of G include 
ve; € E(G) for every v € e;, as well as the paths w,u,;e; for every i < k and the path 
TW Wz. 


D18: We define SOLVABLE to be the problem of deciding, for configurations C and D 
on a graph G, if C' is D-solvable. 


D19: We define UPPERBOUND to be the problem of deciding, for given k and config- 
uration D on a graph G, if 7(G, D) <k. 


FACTS 


F22: [MiCl06] The configuration C is D-solvable on G if and only if there is a 
nonnegative integral solution to the system {C(u) + 0 ,ey (vu — 28u,v) => D(u) for all 
u € V}. Hence SOLVABLE € NP. 


F23: [Huki05] Let 1 be a 4-uniform hypergraph on 2°*? vertices with pebbling graph 
G = G.(H). Define the configuration C on G by C(v) = 2 for all v € V(H) and 
C(v) = 0 otherwise. Then C is r-solvable if and only if H has a perfect matching. 
Hence SOLVABLE is NP-complete. 


F24: [CuLeSiTal2] When restricted to the class of diameter two graphs, SOLVABLE 
remains NP-complete. 


F25: [CuDiLel2] When restricted to the class of planar graphs, SOLVABLE remains 
NP-complete. 


F26: [CuDilLel2] When restricted to the class of diameter two planar graphs, SOLVABLE 
EP. 


F27: [MiCl06] UPPERBOUND is complete for the class of decision problems computable 
in polynomial time by a co-NP machine equipped with an NP-complete oracle (II5- 
complete). 


F28: [BuChCrMiWe08] If (7,1) is a rooted tree then 7(T, 7) can be calculated in linear 
time. Moreover, for any configuration C’, in linear time, we can find an r-solution or 
determine that none exists. 


RESEARCH PROBLEMS 
RP4: Is r-SOLVABLE € P when restricted to the class of cubes? 


RP5: Is r-SOLVABLE € P when restricted to the class of split graphs? 
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11.4.2 Pebbling Numbers 


We turn our attention now to configurations that solve every possible root. 
DEFINITIONS 


D20: We say that a configuration C on G is (k-fold) solvable if it is (k-fold) r-solvable 
for every vertex r. 


D21: The pebbling number 7(G) is defined to be the minimum number ¢ so that 
every configuration C' on G of size t is solvable. 


D22: For two graphs G, and G2, define the cartesian product GnG2 to be the 
graph with vertex set V(GyoG2) = {(v1,v2)|u1 € V(Gi),v2 € V(G2)} and edge set 
E(GioG2) = {{(v1, v2), (wi, w2)}|(v1 = wi and (v2,we) € E(G2)) or (vg = we and 
(v1,wi) € E(G1))}. We write IT*_,G; to mean Gjo...0G, and set G* = II#_,G. 


D23: The support s(C) of a configuration C on G is the set of vertices that have a 
pebble of C; i.e., s(C) = {v € V(G) | C(v) > 0}. The size of the support is denoted 
o(C) =|s(C)I.- 


D24: A graph G has the 2-pebbling property if every configuration C of size at least 
2n(G) — o(C) +1 is 2-fold solvable. A Lemke graph is any graph that does not have 
the 2-pebbling property; the smallest of these is called the Lemke graph. 


D25: For n = 2k(+1), the sun S,, is the split graph with perfect matching joining 
I =I, to K = kK; (and one extra leaf when n is odd). 


D26: For m > 2t+1 the Kneser graph K(m,t) has as vertices all t-subsets of 


{1,2,...,m} and edges between every pair of disjoint sets. For example K,, = K(n, 1) 
and P = K(5,2). 


EXAMPLES 
E7: The complete graph K,, on n vertices has 7(K,) =n. 


E8: The complete graph K,, has the 2-pebbling property because the maximum number 
of pebbles that can be placed on o vertices without having two vertices with at least 
two pebbles or one vertex with at least four pebbles is o + 2, which is stricly less than 
2n-—o+1. 


E9: The smallest graph without the 2-pebbling property is the Lemke graph L, shown 


in Figure 11.4.7. We have 7(L) = 8 and |C| = 12 = 2(8) —5+1, but C cannot place 
two pebbles on r. 


1 


Figure 11.4.7: The Lemke graph with a 2-fold r-unsolvable configuration. 
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FACTS 


F29: [Ch89] The path P, on n vertices has 7(P,) = 2”~'. More generally, let r* be a 
leaf of a longest path in a tree T. Then 7(T’) = x(T,1*). 


F30: [Hu99] The Petersen graph P has 1(P) = 10. 


F31: [PaSnVo95] The cycle C,, on n > 3 vertices has 7(Co,,) = 2" for all k > 2 and 
m(Coe41) = [(2*t? — 1)/3] for all k > 1. 


F32: [AlGuHul2] If G is a diameter 3 split graph then 7(G) is given as follows. Let x 
be the number of cut vertices of G and, for a vertex r, define 6*(G,r) to be the minimum 
degree of a vertex at maximum distance from r. 


1. Ifx > 2 then 
mG) =n+x+4+2. 
2. Ifx = 1 then 
“(@)={ n+5—06* ifr isa leaf with ecc(r) =3 and 6* = 06*(G,r) < 4; 


n+1 otherwise. 
3. If x = 0 then 
n+4-—0* if there is a cone vertex r with deg(r) = 2, 
(G) = ecc(r) = 3, and 6* = 0*(G,r) < 3; 
= ~ ) n+l if no such r exists and G is Pereyra; 
n otherwise. 


F33: [Ch89] The d-dimensional cube Q¢ has 7(Q¢) = 2¢. More generally, let G = 
T*_,Pi,41 be the cartesian product of k paths of lengths 1; = diam(Pi+i+1), with 
t= 8k. Then x(G) = 2!. 


F34: [FoSn00, He03] If G; and G2 are both cycles then 7(Gi5G2) < 1(G1)a(G2). 
F35: [Ch89, FoSn00] If G; and G2 are both trees then 7(Gi0G2) < m(G1)m(G2). 


F36: [Ch89, Mo92, He08] If G is a tree, cycle, complete graph, or complete bipartite 
graph and H has the 2-pebbling property then 7(GaoH) < 7(G)n(#). 


F37: [Wa01] None of the graphs in Figure 11.4.8 has the 2-pebbling property. 


Figure 11.4.8: An infinite family of Lemke graphs. 
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F38: [GaYil2] If G is a bipartite graph with largest part size s > 15 and minimum 


degree at least [+] then G is Class 0 and has the 2-pebbling property. 


F39: [CzHukKiTr02] If G; and G2 are connected graphs on n vertices that satisfy 
6(G;) > k and k > 24"/F+15 | then 1(GinG2) < (G1)(Go). 


F40: If G is a graph on n vertices with diam(G) = d then e,(G) > d/Ign. 


RESEARCH PROBLEMS 
RP6: Does every bipartite graph have the 2-pebbling property? 


RP7: [Graham’s Conjecture] Every pair of graphs G; and G2 satisfy 7(G oG2) < 
m(G1)m(G2). 


RP8: Is 7(L?) = 64? 


Diameter, Connectivity, and Class 0 


In this subsection we study graphs having smallest possible pebbling number. 
DEFINITIONS 
D27: A graph G is of Class 0 if 7(G) =n. 


D28: The k*" graph power G“) of a graph G is formed from G by adding edges 
between every pair of vertices of distance at most k in G. 


D29: The pyramid is any graph on 6 vertices isomorphic to the union of the 6-cycle 
(r,a,p,c,q,b) and the (inner) triangle (a,b,c). A near-pyramid is a pyramid minus 
one of the edges of its inner triangle (a, b,c). 


D30: A graph G is pyramidal if it contains an induced (near-) pyramid, having 
6-cycle C and inner (near-) triangle kK, and can be drawn in the plane so that 


1. the edges of kK are drawn in the interior of the region bounded by C’' and 


2. every other edge of G can be drawn inside the convex hull of exactly one of the 
sets {r,a,b}, {p,a,c}, {¢,b,c}, or {a,b,c}. 


D31: Define the pebbling exponent e,(G) of a graph G to be the minimum k such 
that G) is Class 0. 
EXAMPLES 


E10: Complete graphs, balanced complete bipartite graphs, cubes, and the Petersen 
graph are all Class 0. 


E11: If wis acut vertex of G then, for vertices r and v in different components of G—u, 
the configuration C' with C(r, u,v) = (0,0,3) and C(w) = 1 otherwise is r-unsolvable 
of size n. 
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E12: Let G be pyramidal with 6-cycle (r, a, p,c,q, 0) and inner (near-) triangle (a, b,c). 
Then the configuration C with C(r, a, p,c, q, 6) = (0,0,3,0,3,0) and C(v) = 1 otherwise 
is r-unsolvable of size n. 
FACTS 
F41: [ChGo03] Set d = diam(G). Then 

1. r(G) < (n—d)(24-1) +1, 


2. m(G) < (n+ [354] —1)241-n+2, and 
3. m(G) < 24 1(n + 2dom(G)) — dom(G) + 1. 


The inequalities in parts 1 and 2 are sharp, and the coefficient of 2 in part 3 can be 
reduced to 1 in the case of perfect domination. 


F42: If G is Class 0 then «(G) > 2. 
F43: [BlCzFuHeHuScl2] If G is Class 0 with n vertices and e edges then e > |3n/2]. 


F44: [PaSnVo95] If G is a graph with n vertices and e edges and e > ee) + 2 then 
G is Class 0. Because the complete graph K,_; plus a pendant edge has a cut vertex, 
this result is tight. 


F45: [PaSnVo95] If diam(G) = 2 then 7(G) < n(G) +1. 


F46: [ClHoHu97] If diam(G) = 2 and «(G) > 2 then 7(G) = n + 1 if and only if G is 
pyramidal. 


F47: [ClHoHu97] If diam(G) = 2 and «(G) > 3 then G is Class 0. 


F48: [CzHukiTr02] There is a function k(d) < 2?4+3 such that if G is a graph with 
diam(G) = d and k(G) > k(d) then G is of Class 0. Moreover, k(d) > 24/d. 


F49: [CzHukiTr02] For any constant c > 0 there is an integer to such that, for t > to, 
s > c(t/lgt)!/? and m = 2t+s, we have «(K(m,t)) > 2?4+3, where d = diam(K(m, t)); 
hence K(m,t) is Class 0. 


F50: [CzHukiTr02] Let G € G(n,p) be a random graph on n vertices with edge 
probability p and let d = diam(G). If p > (nlgyn)!/4/n then Pr[«(G) > 2?4+3)]-1 as 
n—00; hence Pr/G is Class 0]>1 as noo. 


F51: [AlGuHul2] If G is a split graph with 6(G) > 3 then G is Class 0. 
F52: [Hul0] The pebbling exponent of the cycle satisfies 


n/2 


n/2 Z 
~ Ign —lIglen 


< 
lgn — en(Cn) 


F53: [PoStYe] If diam(G) = 3 then 1(G) < |[3n/2|+2, which is best possible, as shown 
by the sun S;. 


F54: [PoStYe] If diam(G) = 4 then 1(G) < 3n/2 +c, for some constant c. 
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F55: [Pol2] If diam(G) = d then 1(G) < (2/4/21 — 1)n/[d/2] +c, for some constant c. 


F56: [CzHu03] There is a constant c so that if 6(G;) > en/|gn for i € {1,2} then 
G\o0G>2 is Class 0. 


F57: [CzHu06] Let go(n) denote the maximum number g such that there exists a Class 
0 graph G on at most n vertices with finite gir(G) > g. Then for all n > 3 we have 


| Vea n)/2+1/4 - 1/2] < go(n) <1 + 2lgon. 


RESEARCH PROBLEMS 


RP9: Find infinitely many Class 0 graphs with n vertices and at most 3n/2 + o(n) 
edges. 


RP10: Decide if K(m,t) is Class 0 for all m = 2t+s with s € O((t/Igyt)'/?). 


RP11: Find the smallest k(d) such that G is Class 0 for every diameter d graph G 
with «(G) > k(d). 


Complexity 


Calculating 7(G) and 7(G,r) are polynomially equivalent, but it may be possible to 
calculate 7(G) faster than by calculating 7(G,r) for every r. 


DEFINITION 


D32: We define PEBBLINGNUMBER to be the problem of deciding if 7(G) < k. 


FACTS 


F58: [HeHeHu13] Calculating 7(G) when G is a diameter two graph can be done in 
O(n*) time. 


F59: [AlGuHul2] Calculating 7(G) when G is a split graph can be done in O(n’) time, 
where w & 2.376 is the exponent of matrix multiplication and 6 = 2w/(w+1) = 1.41. 
RESEARCH PROBLEM 


RP12: Is PEBBLINGNUMBER € P when restricted to interval graphs of fixed diameter? 


11.4.3. Optimal Pebbling 


While pebbling can be thought of as a worst-case scenario — we give an adversary 
enough pebbles so that we can solve the graph no matter how he arranges them — 
optimal pebbling can be considered a best-case scenario — we place few pebbles carefully 
so as to solve the graph. 
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DEFINITIONS 


D33: The optimal pebbling number x*(G) is the minimum number ¢ for which 
there exists a solvable configuration of size t. 


D34: Let C be a configuration on G and suppose that deg(v) = 2 and C(v) > 3. A 
smoothing move at v removes two pebbles from v and adds one pebble to each of 
its neighbors. A smooth configuration has no smoothing move available; that is, C is 
smooth if C(v) < 2 whenever v has degree 2. 

D35: For S C V(G), the operation of collapsing S forms a new graph H in which S 


is replaced by a single vertex that is adjacent to all the neighbors of vertices of S that 
are in V — S. (Note that S need not be connected.) 


EXAMPLES 


E13: Figure 11.4.9 displays the upper bound of 7* (Ps) < [2(8)/3]. 


0 2002 00 2 
o—o—__o—_e o_o —_ 00 


Figure 11.4.9: A minimum solvable configuration on the path Px. 


E14: The configuration with 2 pebbles on a single vertex can reach any other vertex 
of the complete graph, and so 7*(K,) = 2 for all n. 


FACTS 
F60: Every graph G satisfies 7*(G) < 2dom(G). 


F61: [Smoothing Lemma] [BuChCrMiWe08] If G has at least 3 vertices then G has 
a smooth minimum solvable configuration with no pebbles on leaves. 


F62: [Collapsing Lemma] [BuChCrMiWe(08] If H is obtained from G by collapsing 
sets of vertices then 1*(G) > 1*(H). 


F63: [BuChCrMiWe08] Every graph G satisfies 7*(G) < [2n/3], with equality for 
paths and cycles. 


F64: [Mo98] The d-cube has (4/3)¢ < 1*(Q¢) < (4/3)4+ 0089, 
F65: [FuSh00, HeHeHul11] For all graphs G and H we have 7*(GoH) < 1*(G)n*(H). 


F66: [BuChCrMiWe08] If G has n vertices and 6(G) = k, then 7*(G) < rae 


F67: [BuChCrMiWe08] For all ¢ > 1, k = 3t and n > k +3, there is a graph G with n 
vertices, 6(G) = k and m*(G) > (2.4 SECT o(1)) gyr- 
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Complexity 


DEFINITION 


D36: We define OPTIMALPEBBLINGNUMBER to be the problem of deciding if 7*(G) < k. 


FACT 


F68: [MiCl06] The problem OPTIMALPEBBLINGNUMBER is NP-complete. 


RESEARCH PROBLEMS 
RP13: Is there a graph G with 1*(G) > 3n(G)/(6(G) + 1)? 


RP14: Does 6(G) > 3 imply that 7*(G) < [n(G)/2]? 


11.4.4 Thresholds 


The probabilistic model of pebbling studies the typical case; that is, small configura- 
tions are usually unsolvable and large configurations are usually solvable — at roughly 
how many pebbles is the transition? 

We assume that all sequences G = (Gi,...,Gx,...) of graphs considered have an 
increasing number of vertices n = nx = n(Gx). 


NOTATION 

The sequences of complete graphs, stars, paths, cycles, and cubes are denoted K, 
S, P, C, and Q, respectively. The sequence of graph products is written GoH = 
(GjoM,,...,GpoH,,...), with G? = GoG. For sets of functions A and B on the in- 
tegers we write A < B to mean that a € O(b) for every ac A,bE B. 


DEFINITION 


D37: Let Cy : [n| NV denote a configuration on V(G;) and, for a function h : N>N 
and fixed n = nx, define the uniform probability space X,,,, of all configurations C; of 
size h = h(n). Denote by P** the probability that C,, is solvable on G;, and let t : N>N 
be any function. We say that t is a pebbling threshold for G, and write 7(G) = O(t), 
if P*—>0 whenever h(n) < t(n) and P}->1 whenever h(n) > t(n). 

EXAMPLE 

E15: Solvability on K;, is equivalent to the labelled version of Feller’s Birthday Problem 
(also hashing collisions in computer science). Thus 7(K) = O(./n). 

FACTS 

F69: [BeBrCzHu03] Every graph sequence G has nonempty threshold 7(G). 


F70: [CzEaHuKa02] Every graph sequence G satisfies T(KC) S 7(G) S 7(P). 
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F71: [CzEaHuKa02, BeBrCzHu03, GoJaSaWi04, CzHu08] For every constant c > 1, 
we have T(P) C 2 (nove ne) are) (noevier 
F72: [BeHu08] The sequence of squares of cliques have threshold 7(K?) = O(,/n). 


F73: [Al03, CzWa03] For all « > 0 the sequence of cubes has threshold 7(Q) € 
Q(n*~*) N O(n/(Iglgn)!~£). 


F74: IfG is a sequence of graphs of bounded diameter then T(G) C O(n). 


F75: [CzHu08] Let ¢; and tz be functions satisfying T(K) S ti < te S$ O(n). Then 
there is some graph sequence G such that t1 S 7(G) S ta. 


F76: [BjHol2] There exist graph sequences G = (Gj,...,Gz,...) and H = (HMj,..., 
Hy...) such that 7(G,) < 7(A;) for all & but T(H) S 7(G). 


F77: [CzHu06] Suppose that t € 7(P) and s € 7(P?). Then s(n) € O (i (vn)’). 


F78: [CzHu03] Define G(n, 6) to be the set of all connected graphs on n vertices having 
minimum degree at least 6 = d(n). Let Gs = {Gi,...,Gz,...} denote any sequence 
of graphs with each G, € G(k,5). For every function n!/? < 5 = d(n) < n—-1, 
T(Gs) © O(n3/?/5). In particular, if in addition 6 € Q(n) then 7(Gs) = O(n'/?). 
REMARKS 


R1: Note the need to rescale threshold functions of products of graph sequences in 
terms of the new number of vertices n(G,aH;,) = n(G,)on(H;,); for example, in Fact 


F72 we have Vn? = /n’. 

RESEARCH PROBLEMS 

RP15: Determine 7(P). 

RP16: Determine 7(Q). 

RP17: Extend Fact F75 to the range Q(n)M 7(P). 


RP18: Suppose that G is any graph sequence, t € 7(G) and s € 7(G?). Is it true that 


s(n) € 0 (t(Vn)")? 


11.4.5 Other Variations 


Here we present a few variations on the pebbling theme and a taste of the main 
results for each. 


NOTATION 
We write kI, for the configuration with k pebbles on v and 0 elsewhere, and J for the 
configuration with 1 pebble on each vertex. Also k.¥ denotes the set of all such k,, and 
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@;, is the set of all configurations of size t. Denote by .@(G) the set of all configurations 
corresponding to dominating sets in a graph G; that is, the configuration corresponding 
to a dominating set has one pebble on each of its vertices and none elsewhere. Next, 
consider the set of all induced paths on d +1 vertices in G and write A} (G) for those 
configurations on such paths with two pebbles on one leaf of the path, no pebbles on the 
other leaf, and one pebble on all other vertices of the path. Finally, for d = (di,...,dm) 
let P* denote the graph Py,+410---oPa,,41. 


DEFINITIONS 


D38: Let G be a weighted graph with edge weights w : E(G)N. For an edge 
{u,v} € E(G), if u has at least w(uv) pebbles on it, then a weighted pebbling step 
from u to v removes w(uv) pebbles from u and places one pebble on v. The corre- 
sponding weighted pebbling number 1(G\,,) is defined to be the minimum number t 
so that every configuration of size t solves any r via weighted pebbling steps. 


D39: Fora graph G and set of configurations J on G the (optimal) pebbling number 
m(G, Y) (resp. 7*(G, Y)) is the minimum t for which every (resp. some) C € &, is D- 
solvable for every D € 9. The k-fold (optimal) pebbling number 1;,(G) = 1(G,k.Z) 
(resp. 12 (G) = 1*(G,k)). 


D40: For a graph G the fractional (optimal) pebbling number is defined to be 
i(G) = limg-yco TWe(G)/k (resp. 7*(G) = limp_,o0 77 (G)/k). 


D41: The cover pebbling number of a graph G is defined to be 7(G, J). A configu- 
ration D is positive if D(v) > 0 for every vertex v. For positive D on G we define the 
function s(G, D) = max, >, D(u)2%*(»), 


D42: For a set of configurations Z the configuration C is weakly Y-solvable if C 
solves some D € Y. The target pebbling number x~(G,Y) is the minimum ¢t for 
which every C' € ©, is weakly D-solvable. 


D43: The domination target pebbling number of G is defined to be 17 (G,.@). 


D44: For given d the distance pebbling number 7a(G) of a graph G is defined to 
be the minimum ¢ such that, for every size ¢ configuration, there is some pebble that 
can move to a vertex at distance d from where it started; in other words, 7a(G) = 


n(G, Pr). 


D45: A rubbling step on a graph G is either a pebbling step or a strict rubbling 
step. A strict rubbling step takes one pebble from each of two neighbors u and w 
of a vertex v and places one pebble on v. The (optimal) rubbling number of G, 
denoted p(G) (resp. p*(G)), is defined to be the minimum t so that every (resp. some) 
configuration of size t can solve any root vertex r via rubbling steps. 


EXAMPLES 


E16: For a positive configuration D on the complete graph we have s = s(K,,D) = 
2|D| — min D. If D(v) = min D then the configuration that places s — 1 pebbles on v 
and none elsewhere cannot solve D. 


E17: For a vertex v € V(G), the configuration that places no pebbles on v and all its 
neighbors and one pebble on every other vertex cannot solve any dominating set of G. 
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E18: The pigeonhole principle implies that 71(G) = n(G) + 1 for every graph G. 


E19: Every configuration of two pebbles on K,, solves every vertex via rubbling steps. 


FACTS 


F79: [Ch89] Given d = (d,,...,dm) we represent the vertices of P¢ by coordinates 
(U1,---; Um) with each 0 < vu; < d; and denote by e; = (0,...,1,...,0) the ith standard 
basis vector. For any w = (wj,.. 1 Wm) define the weight function w(uv) = w; when 
|u — v| = e; and write w? = J], wt’. Then (P92) = w?. 

F80: [Ch89] For every root r of a tree T’ we have m(T,r) = fx (Tr) for all k. 


F81: [HeHeHul3] If G is a diameter two graph with n vertices and m edges then 
™K(G) < n(G) + 4(k — 1). Furthermore, from any configuration of size at least 7(G) + 
4(k—1), k pebbles can be placed on any root vertex r in at most 6n+ min{3t, m} steps. 


F82: [HeHeHul3, HoMaOkZul1] Every graph G satisfies #(G) = 2%), 


F83: (HeHeHu13] If G is a complete graph, cycle, tree, or has 7(G) = 24™©@) then 
1(G, 6) = 74 (G). 


F84: (HeHeHul3] For all d > 0, n > 1, and k > 1 we have 7*(K,) = 2n/(n + 1), 
ft* (Py) = (n+2)/3, a*(Cap) = k2+7/3(2" —1), #*(Corya) = (2h-+1)(2-4)/(3(2"-}) — 
1), #°(Q) = (4/3)4, and #*(P) = 5/2. 

F85: [Stacking Theorem] [Sj05] Every positive configuration D on a graph G has 
m(G, D) = s(G, D). 


F86: [GaGoTeVuWaYe08] The complete r-partite graph K = Kg,,....5, has domination 


bata: 


target pebbling number a7 (K,.@) = 3 if every s; = 2 and max; s;. 


F87: [GaGoTeVuWaYe08] For the path P, on n vertices, 7 (Pn,W@) = 2(2” — 
gn mod 3) /7+ | mod 3|. 


F88: [Kn12] The distance pebbling number of the cycle is 7a(C,) = (24 — 1) |n/d| + 
gn mod a 


F89: [BeSi09] If G has n vertices and diameter d then p(G) < (n-—d+1)(241—1). 


F90: [KaSi] If G has n vertices and diameter 2 ae p(G) < V2n —1+5. Furthermore, 
for all n there is a diameter 2 graph G with p(G > |/2n - 1| +2 
(d+ <p 


F91: [KaSi] If G has n vertices and diameter d then [ 2) /2] 


“(@) S$ [(n+1)/2]. 


REMARK 

R2: Note that the Stacking Theorem implies that the computational complexity of 
calculating pebbling numbers applies only to nonpositive target configurations. 
RESEARCH PROBLEMS 

RP19: Is it true that 7(G,@,) = 7,(G) for every graph G? 


RP20: Is there a constant c such that 7*(G) < ca*(G) for every graph G? 
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11.4.6 Applications 


Graph pebbling arose as a method to prove a conjecture of Erdés and Lemke in com- 
binatorial number theory. It has since produced a more general result in combinatorial 
group theory and another in p-adic diophantine equations. 


FACTS 


F92: [Ch89, ElHu05] Fact F79 implies that, if g1,...,gn is a sequence of elements of 
an abelian group G of size n, then there is a nonempty subsequence (gx)kex such that 
rek Gk = UG and Yo ,.¢% 1/|gx| < 1, where |g| denotes the order of the element g in 
G and 0é is the identity element in G. 


F93: [Kn12] Write n = 2'm, where m is odd. Define d = 1 for odd n (t = 0) and 
d=t+2 for even n (t > 0). If s > #(Cy,d) then, for all integer coefficients a1,..., ds, 
the additive form F(x) = )>;_, a;z? has a nontrivial (not all zero) solution to F(x) = 0 
in the 2-adic integers. 


REMARK 


R3: The pebbling steps studied here have a cost in the loss of pebbles. Various no-cost 
rules for pebbling steps have been studied for years and have found applications in a 
wide array of areas. One version, dubbed black and white pebbling, was applied to 
computational complexity theory in studying time-space tradeoffs, as well as to optimal 
register allocation for compilers. Connections have been made also to pursuit and 
evasion games and graph searching. Another (black pebbling) is used to reorder large 
sparse matrices to minimize in-core storage during an out-of-core Cholesky factorization 
scheme. A third version yields results in computational geometry in the rigidity of 
graphs, matroids, and other structures. 


RESEARCH PROBLEM 


RP21: Prove that Fact F92 holds for all groups G. 
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Glossary for Chapter 11 


alternating path — relative to a matching: a path whose edges alternate between free 
and matched. 
augmenting path, — relative to a matching: an alternating path that starts at one 
free vertex and ends at another free vertex. 
augmenting path, P — in a flow network: a directed path from s to t in the residual 
network. 
__, capacity of — in a residual network Gy = (V, Ey, s,t,uz): denoted Ap and given 
by Ap = minyy,wyez, Tf(U, W)- 
backward arc (v,w) — across a cut (S,T): when v € T andwe 8S. 
bandwidth — of a communication facility: capacity of a communication facility in bits 
per second (bps). 
blossom: an odd length cycle formed by joining two even vertices of an alternating 
path, rooted at a free vertex. 
cartesian product of two graphs G,o0G2: the graph with vertex set V(GyaG2) = 
{(v1, v2) | v1 € V(G1), ve € V(G2)} and edge set E(GioG2) = {{(v1, v2), (wi, we)} | 
(vy = wy and (v2, w2) € E(G2)) or (ve = we and (v1, wi) € E(G1))}. 
circulation — in a cost-flow network G = (V, A,cap,c,b): a flow for the supply vector 
b=0. 
Class 0: the set of graphs having pebbling number equal to its number of vertices. 
complete graph K,,: the graph on n vertices with an edge between every pair of 
vertices. 
configuration on a graph G: a function from the vertices of G to the nonnegative 
numbers that indicates how many pebbles are on each vertex. 
___, capacity of: the number of pebbles in the configuration, i.e., the sum of the 
function values of the configuration. 
cost-flow network G = (V, A, cap,c,b): a directed graph with vertex-set V, arc-set A, 
a nonnegative capacity function cap: A — N, a linear cost function c: A> Z, and 
an integral supply vector b: V > Z that satisfies )>,¢y b(w) = 0. 
__, s-t: a flow network G = (V, A, cap, c, b) that contains two distinguished vertices s 
and t such that b(v) = 0 for all uv € V — {s,t} and b(s) = —b(t) > 0. 
__, extended: an s-t network G’ = (V’, A’, cap’) of G = (V, A, cap, c, b) with vertex- 
set V’ = VU {s,t}, arc-set A’ = AU {(s,v)|b(v) > O} U {(w, t)|b(w) < O}, and 
capacity function cap’ defined by 


cap(v,w), if (v,w) € A; 
cap'(v,w) = ¢ d(v), ifv =s; 
—b(w), ifw=t 


cover pebbling number of a graph G: equals 7(G, J), where J is the configuration 
with one pebble on each vertex. 
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cube graph: see d-dimensional cube graph. 

s-t cut (S,T) — corresponding to a partition (5,7) of V: the set of arcs that have one 
endpoint in one of the sets and the other endpoint in the other set. 

___, capacity of: the sum of the capacities of the arcs crossing the cut in the forward 
direction, i.e., cap(S,T) = > cap(v, w). 
(v,w)E EES, weT 
__, minimum: a cut of minimum value, i.e., min{cap(S,T) : ($,T) is an s-t cut}. 

cycle graph C;,: the path graph P, with an extra edge joining the last vertex to the 
first. 

d-dimensional cube graph Q,: the graph on the 2¢ vertices labeled by all binary 
d-tuples, with an edge between pairs of vertices whose labels differ in exactly one 
coordinate. 

distance pebbling number 74(G): the minimum t such that, for every size t config- 
uration, there is some pebble that can move to a vertex at distance d from where it 
started. 

dual — of a minimum-cost flow problem: see §11.2, Definition D14. 

even vertex — relative to an alternating path: a vertex that is an even distance (number 
of edges) from the root of the path. 

excess e(v) — at vertex vi e(v)= S> f(w,v)- SS. flv,w). 

(w,v)EE (v,w)EE 

exponent: see pebbling exponent. 

facility: a transmission medium (e.g., fibre-optic or wireless) installed on an edge for 
sending data, voice or video signals. 

flow, f — in an s-t flow network G = (V, E,s,t, cap): a function f : E + N which obeys 
three types of constraints: 
capacity constraints: f(v,w) < cap(v, w), for each arc (v,w) € E. 
conservation constraints: 

VM w,yeze fw, 0) = Vw wen f(v, w) for each v € V — {s, t}. 

nonnegativity constraints: f(v,w) > 0, for each are (v,w) € E. 

flow, f — in acost-flow network G = (V, A, cap,c,b): a function f : A > Z that satisfies 
capacity constraints: f(v,w) < cap(v,w) for all (v,w) € A, 
flow conservation constraints: )~[f(v,w) — f(w,v)] = b(v) for each v € V, 


WwW 
nonnegativity constraints: f(v,w) > 0 for all (v,w) € A. 
__, minimum: a flow f with minimum c! f value among all flows. 
__, value of: the total flow into the sink, ie., val(f)= S> f(v,#). 
(v,t)hEE 
flow across cut (S,T): the flow crossing the cut in the forward direction minus the 
flow crossing the cut in the backward direction, i.e., 


f(S,T) = a f(v,w) - iS f(v,w). 


(v,w)E EWES weT (v,w)EEvET wes 


s-t flow network G = (V,E,s,t,cap): a directed graph with vertex set V and arc-set 
FE, two distinguished vertices, a source s and a sink t, and a nonnegative capacity 
function cap: E 4 N. 

flow with gains — in a gain network G = (V,A,cap,y,c,s,t): a function f : A + R 
that satisfies: 
capacity constraints: f(v,w) < cap(v,w) for all (v,w) € A, 
nonnegativity constraints: f(v,w) > 0 for all (v,w) € A, 
flow conservation constraints: 


ewevlf(v, w) — f(w,v) (wu, v)] = 0, for each v € V — {s, t}. 
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___, maximum: a flow with gains that maximizes the amount of flow reaching t given 
an unlimited supply at s. 
__, minimum-cost maximum-: a maximum flow with gains that minimizes 
Yea cle)g(e)- 
flow-over-time network: a flow network G, = (V, A, cap, 7, b), where each arc (v, w) € 
A has an associated transit time Tyy. The transit time Ty, represents the amount of 
time that elapses between when flow enters arc (v,w) at v and when the same flow 
arrives at w. 
flow-over-time: see $11.2, Definition D26. 
k-fold r-solvable configuration: a configuration from which it is possible to place at 
least k pebbles on vertex r via pebbling steps. 
forward arc (v,w) — across a cut (S,T): when v € S and we T. 
free edges — of a matching M: the edges of the graph not in M. 
free vertices — of a graph G: the vertices of G not incident on a matched edge. 
gain network G = (V,A,cap,y,c,s,t): a network G = (V,A,cap,c,b) with positive- 
valued gain function y : E — R* and supply function b, = 0 for all v € V — {s, t}; 
the gain factor y(e) > 0 for arc e enforces that for each unit of flow that enters the 
arc, y(e) units exit. For standard network flows, the gain factor of every arc is one. 
graph power G*): the graph formed from G by adding edges between every pair of 
vertices of distance at most k in G. 
increasable arc (v,w) — in network G for a given flow: f(v,w) < cap(v, w). 
Kneser graph K(m,t): the graph having as vertices all t-subsets of {1,2,...,m}, with 
edges between every pair of disjoint sets. 
Lemke graph: a graph that does not have the 2-pebbling property is a Lemke graph; 
the Lemke graph is the smallest among them. 
matched edges — of a matching M: the edges of M. 
matched vertices — of a graph G: the vertices of G incident on a matched edge. 
matching — in a graph G: a set of pairwise nonadjacent edges. 
__, complete — of a bipartite graph G = (X UY, E): a matching that meets each 
vertex of X; also called X-saturating. 
maximum-size: a matching M having the largest size |M|. 
maximum-weight: a matching M having the largest weight wt(M). 


o) 


perfect — of a graph G: a matching that meets each vertex of G exactly once. 
, Size of: the number of edges in the matching. 
__, weight of: the sum of the weights of edges in the matching. 

maximum flow problem: given a flow network G = (V,E,s,t,cap), find a flow of 
maximum value. 

maximum multicommodity flow problem: for each commodity 7, find a flow f; of 
value (demand) val(f;) such that ae val(f;) is maximized. 

maximum r-path partition of a tree T: a sequence of paths P[1],...,P[h] that 
partitions the edges of T so that r is a leaf of P{1], T; = Ui_,P{j] is a tree for all 
1<i<h, and P/i] is a maximum length path in T — T;_1, among all such paths 
with one endpoint in 7;_;, for all 1 <i <h. 

multicommodity flow — in a multicommodity flow network G = (V, E, K,u): a set of 
k =|K| functions f; : E — Rt satisfying the following conditions: 


’ 


joint capacity constraints: ah fi(v,w) < cap(v,w), for each arc (v,w) € E. 
conservation constraints: ))(,,.,)cr fi(W,¥) = iw,w)er Si(v, w) for each vertex 


vEeV— {s;, ti} and i = ul oe sth and aun er fiw, ti) oa te wer filti, w) = d; 
for each i=1,...,k. 
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nonnegativity constraints: f;(v,w) > 0, for each arc (v,w) € E and each 
aa eens 

multicommodity flow network G = (V,E,K,u): a directed graph with vertex-set 
V and arc-set FE, commodity set K, and a nonnegative capacity function u: E> N. 
We adopt the convention that if arc (v,w) € E then the reverse arc (w,v) ¢ E. The 


commodities are indexed by the integers 1,2,...,k. 
near pyramid graph: a pyramid graph minus one of the edges of its inner triangle 
(a,b,c). 


odd vertex — relative to an alternating path: a vertex that is an odd distance (number 
of edges) from the root of the path. 

optimal pebbling number 7*(G): the minimum size of a solvable configuration on 
G. 

parent of a vertex in a rooted tree: the unique neighbor that is closer to the root 
of the tree. 

path graph P,: the graph with a sequence of n vertices and n — 1 edges joining 
consecutive pairs of vertices. 

pebbling exponent e,(G) of a graph G: the minimum & such that the graph power 
G) is Class 0. 

pebbling number 7(G) of a graph G: the maximum rooted pebbling number 7(G,r) 
over all root vertices r. 

2-pebbling property of a graph G: the property that every configuration C of size 
more than 27(G) — |o(C)| is 2-fold r-solvable for all r. 

pebbling step: the removal of two pebbles from one vertex, along with the placement 
of one pebble on an adjacent vertex. 

pebbling threshold of a graph sequence G,G2,...: a function t : MN such 
that a random configuration on G,, of asymptotically more (less) pebbles than t(n) 
is almost surely solvable (unsolvable). 

power: see graph power. 

preflow: a relaxed version of a flow, a function f : E ~ Z+ which obeys three types 
of constraints: 
capacity constraints: f(v,w) < cap(v,w), for each are (v,w) € E. 
relaxed conservation constraints: 

Uwe f(w,0) — Viewer f(v,w) 2 0 for each vertex v € V — {s, t}. 

nonnegativity constraints: f(v,w) > 0, for each arc (v, w) € E. 

pyramid graph: the graph on 6 vertices formed by the union of the 6-cycle (r, a, p, c, q, d) 
and the (inner) triangle (a, b,c). 

pyramidal graph: a graph G that contains an induced (near-) pyramid, having 6-cycle 
C and inner (near-) triangle K, and can be drawn in the plane so that the edges 
of K are drawn in the interior of the region bounded by C' and every other edge of 
G can be drawn inside the convex hull of exactly one of the sets {r,a,b}, {p, a, c}, 
{q, b,c}, or {a, b,c}. 

reducible arc (v,w) in network G for a given flow: f(v,w) > 0. 

residual capacity r;(v,w) — of arc (v,w) in residual network Gy for a given flow or 
preflow f: see §10.1, Definition D10. 

residual network, — for a maximum-flow network: see §11.1, Definition D10. 

residual networkg — for a cost-flow network: see §11.2, Definition D9. 

rooted pebbling number 7(G, D) of a graph G: the minimum number t of pebbles 
such that every size t configuration is D-solvable. 

shrunked blossom: obtained by collapsing a blossom into a single vertex. 

solvable configuration: a configuration that is r-solvable for every vertex r. 
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D-solvable configuration: a configuration from which it is possible via pebbling steps 
to reach another configuration having at least as many pebbles on each vertex as the 
configuration D. 

r-solvable configuration: a configuration from which it is possible to place a pebble 
on vertex r via pebbling steps. 

split graph: a graph whose vertices can be partitioned into a clique and an independent 
set. 

Steiner tree problem: given a weighted graph in which a subset of vertices are 
identified as terminals, find a minimum-weight connected subgraph that includes all 
the terminals. 

strategy: a weight function on a rooted tree with the property that, for every nonroot 
vertex v, the parent of v has weight at least twice that of v. 

support o(C) of a configuration C on a graph G: the set of vertices of G that 
have at least one pebble. 

switch — in a communication network: node equipment for routing and processing 
communication traffic. 

thread: a set of degree two vertices that form a path in a graph. 

threshold: see pebbling threshold. 

transshipment network: a cost-flow network G = (V,A,cap,c,b) in which all arcs 
have infinite capacity. 

__, associated: the transshipment network obtained by replacing each arc e = (v, w) 
by three arcs (v,2e), (Ye, Ve), and (ye, w) having infinite capacity and with costs 
Ce, 0, and 0, respectively; the supplies at the new nodes x, and y, are defined to 
be b(a-) = —cape and b(ye) = cape. 

__, completion of: the complete transshipment network obtained by adding all 
missing arcs and giving each of them infinite capacity and arc cost M +1, where 
M =Y- c(e)cap(e). 

unsplittable flow problem: multicommodity flow problem with the additional re- 
striction that each commodity must be routed on one path. 

vertex cover — of a graph G: a set of vertices incident on all edges of G. 

weight of a path P — relative to a matching M: the sum of the weights of the free 
edges in P minus the sum of the weights of the matched edges in P, denoted wt(P). 
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INTRODUCTION 


The study of complex networks analyzes graph-theoretical properties arising in real- 
world networks, ranging from technological, social, and biological. Web pages and their 
links, protein-protein interaction networks, and on-line social networks such as Facebook 
and LinkedIn are some of the commonly studied examples of such networks. 

Never before have we confronted graphs of not only such tremendous sizes but also 
extraordinary richness and complexity, both at a theoretical and a practical level. Nu- 
merous problems arise. For example, what are basic structures of such large networks? 
How do they evolve? What are the underlying principles that dictate their behavior? 
How are subgraphs (that we observe) related to the large (and often incomplete) host 
graphs? What are the main graph invariants that capture the myriad properties of such 
large graphs? 

To deal with these questions, graph theory comes into play. There have been a great 
many advances in the field over the past thirty years in combinatorial, probabilistic, and 
spectral methods. Still, the traditional random graphs mostly consider the same degree 
distribution for all nodes or edges while real-world graphs are uneven and clustered. The 
classical algebraic and analytic methods are efficient in dealing with highly symmetric 
structures, whereas real graphs are quite the opposite. Guided by examples of complex 
networks, many new and challenging directions in graph theory emerge. Here we include 
several selected topics that have been developing. This article is based in part on the 
surveys [BoTil2, Ch10], and further references can be found in [Bo08, ChLu04b]. 
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12.1.1 Examples of Complex Networks 


Nowadays we are surrounded by various information networks that are of pro- 
hibitively large size. Dealing with graphs arising from these networks, the usual graph 
parameters such as the exact number of nodes are no longer as important. Instead, only 
partial or locally available information can be obtained. This leads to a whole range 
of new graph parameters and problems, which are motivated by examples of complex 
networks. Some of these complex networks we will mention here. Our main focus in 
this survey will be on the web graph and on-line social networks. 


EXAMPLES 


E1: The web graph has nodes consisting of web pages, and edges corresponding to 
links between them. The web graph may be viewed as directed or undirected, depending 
on the context. 


E2: The collaboration graph has nodes as co-authors and two authors are connected 
by an edge if they have written a joint paper together. The collaboration graph (accord- 
ing to the Math Reviews database) has 401,000 nodes and 676,000 edges. The reader 
is referred to the website of Grossman [Gr23] for many interesting properties of the 
collaboration graph. 


E38: On-line social networks have nodes consisting of users on some social net- 
working site such as Facebook, and edges consisting of friendship links between them. 
Twitter may be viewed as a directed graph, where users follow each other, but following 
may not be reciprocal. 


E4: Protein-protein interaction networks have nodes consisting of proteins in a 
living cell, with two proteins joined if they share some biochemical interaction. For a 
survey of protein-protein interaction networks, see [Pr05]. 


E5: Other important examples of complex networks are router graphs (nodes are 
routers and edges correspond to physical connections between them), call graphs 
(nodes are phone numbers and directed edges correspond to calls placed between them), 
citation graphs (nodes are academic papers in a given discipline and there are directed 
edges between cited papers). 


12.1.2 Properties of Complex Networks 


Complex networks arise is diverse arenas but have a completely unexpected coher- 
ence. The prevailing characteristics of the complex graphs are large-scale, small-world, 
and possessing power-law degree distribution, which we will now describe. Throughout, 
graphs are taken as finite and undirected, unless otherwise stated. 


DEFINITIONS 


D1: The order and size of complex networks vary considerably, but all of them are 
large-scale. 
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The web graph has over a trillion nodes, with billions of pages appearing and dis- 
appearing each day. Facebook [Fal2] has over 955 million users, and over 70 billion 
friendship links. Some of the nodes of Twitter corresponding to well-known celebrities 
including Lady Gaga and Justin Bieber have out-degree over 26 million [Tw12]. 

Because of the large-scale property of complex networks, an examination of almost all 
nodes, called a sweep or crawl, is costly and rarely done, and only performed off-line. 
However, on-line and local computation should ideally be computed in constant time or 
cost (independent of n, the number of nodes), or of order O(log n) or O(log log n). 


D2: Complex graphs are often sparse, which means that the graphs have a linear 
number of edges (that is, |E(G) < c|V(G)|, for some small constant c. In fact, the 
constant is usually less than 10 in most examples.) Some networks, such as on-line 
social networks, however, tend to be dense graphs. 


In extremal graph theory and random graph theory, dense graphs are quite well un- 
derstood, partly due to Szemeredi’s regularity lemma. To deal with certain complex 
graphs with billions of nodes such as the web graph, research has focused primarily on 
the study of sparse graphs. 


D3: The small world phenomenon can be regarded as a combination of small 
distances and clustering. Namely, there is a short path joining any two nodes and if two 
nodes share a common neighbor, they are more likely to be adjacent. 


There are two different notions of “small distance.” The diameter of a connected 
graph G is the maximum distance between a pair of nodes, and is written as diam(G). 
The average distance, denoted by adist(G), is defined by 


d(u, v) 
|F| ° 


adist(G) = S- 


Uu,veF 


where F is the set of pairs of distinct nodes u, v of G with the property that the distance 
d(u, v) between u and v is finite. The directed analogue of this parameter, where distance 
refers to shortest directed paths (that is, with no back edges), is denoted by adisty(G). 


FACT 


F1: As evidence of the small world property for the web graph, in [AlJeBa99] it was 
reported that some subgraph of the web graph in 1999 has average distance 19, while 
some other values were reported earlier in [Br-etal00]. The collaboration graph of Math 
Reviews has diameter 23 and average distance 7.64 (see [Gr23]). This can be phrased 
as, “eight degrees of separation” among mathematicians. In some complex networks 
such as on-line social networks and citations networks, distances have been observed to 
decrease over time; see [LeK]Fa05]. In a recent study of 700 million users of Facebook 
in [Ba-etall2], the average distance of that network was given as 4.74. 


There are also a number of graph parameters associated with clustering. A direct way 
is to define the clustering coefficient of G to be 


2 E(N(w)) 
<<, dea(v)(deg(v) — 1)’ 


n 
rE 


where E(N(v)) denotes the number of edges in the subgraph induced by the neighbor- 
hood of v in G. 
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D4: A useful isoperimetric graph invariant related to clustering is the Cheeger con- 
stant, which can be defined as the maximum Cheeger ratios h(S) over all subsets S$ 
of nodes in G. The Cheeger ratio, written h(S), is defined by 


|E(S, S)| 
Ose ot Ree ee 
vol(s)<vol(G) /2 


where we define the volume vol(S) = >>. deg(v), vol(G) = vol(V(G)) and E(A, B) 
denotes the set of edges with one endpoint in A and one endpoint in B. We note that 
in the literature, the isoperimetric number i(S') = ao ignores the uneven degree 
distributions of complex graphs and is therefore less effective for capturing clustering in 


complex networks. 


D5: Ina graph G on n nodes, let Ny = N,(n) denote the number of nodes of degree 
k. The degree distribution of G follows a power law in some range of k if Nz is 
proportional to k~°, for a fixed exponent b > 2. If G is directed, then we may consider 
(possibly different) power laws for the in- and out-degree distributions by defining Nj", 
and NPG, respectively, in the obvious way. 


Power law degree distributions are sometimes called heavy-tailed distributions, since 
the real-valued function f(k) = k~® exhibits a polynomial (rather than exponential) 
decay to 0 as k tends to co. If G possesses a power law degree distribution, then we 
simply say G is a power law graph. 


FACTS 


F2: By taking logarithms, power laws can be expressed as 


log( Nx) ~ log(t) — 6 log(k). 


Hence, in the log-log plot, we obtain a straight line with slope —(. 


F3: Based on their crawl of the domain of Notre Dame University, Albert, Barabdsi, 
and Jeong [AlJeBa99] claimed that the web graph exhibited a power law in-degree dis- 
tribution, with 6 = 2.1. An independent crawl corroborating the findings of [AlJeBa99] 
was reported in [Ku-etal99], which considered 40 million web pages from 1997 data. The 
exponent of 3 = 2.1 was further corroborated by a larger crawl of the entire web (includ- 
ing 200 million web pages) reported in Broder et al. [Br-etal00]. There were several other 
reports concerning slightly different power law exponents 8 = 2.45 and 6 = 2.72 for 
Web graphs [AlJeBa99, Br-etal00]. Kumar, Novak, and Tomkins [KuNoTo06] studied 
the evolution of Flickr and Yahoo!360, and found that these networks exhibit power-law 
degree distributions. Power law degree distributions for both the in- and out-degree 
distributions were documented in Flickr, YouTube, LiveJournal, and Orkut [Mi-etal07], 
as well as in Twitter [Ja-etal07, Kw-etal10]. 
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DEFINITION 


D6: Ina graph G, let 0 = Ay < AV: +: < An_1 < 2 denote the eigenvalues of the 
normalized Laplacian £L = I — D~'\/? AD~'/? where A is the adjacency matrix and 
D is the diagonal degree matrix of G. The eigenvalue \, is intimately related to the 
Cheeger constant hg by the Cheeger inequality (see [Ch97]): 


h2 
2hg =A, = 5 


FACTS 


F4: In 1999 Faloutsos et al. [FaFaFa99] made an experimental study of an autonomous 
systems graph, finding a power law distribution for the highest eigenvalues of the adja- 
cency matrix. 


F5: Mihail and Papadimitriou [MiPa02] showed that the largest eigenvalues of the 
adjacency power law graphs are themselves distributed according to a power law. This 
phenomenon is sometimes referred to as the ezgenvalue power law. In the other 
direction, the eigenvalues of the normalized Laplacian of a random graph with general 
degree distribution satisfy the semi-circle law. 


F6: Social networks often organize into separate clusters in which the intra-cluster 
links are significantly higher than the number of inter-cluster links. In particular, social 
networks contain communities (characteristic of social organization), where tightly knit 
groups correspond to the clusters [NePa03]. As a result, it is reported in [Es06] that 
social networks, unlike other complex networks such as the web graph, possess bad 
spectral expansion properties realized by small spectral gaps in their adjacency matrices. 


12.1.3. Random Graphs with General Degree Distri- 
butions 


To describe a complex graph (or indeed, any graph), we wish to use as few parameters 
as possible. This is exactly what graph models are supposed to do. For example, the 
Erdés—Reriyi random graph model G(n, p) uses only two parameters to describe a family 
of quite complicated graphs by selecting each pair of nodes to be an edge with probability 
p independently. We note that a random graph in G(n, p) has the same expected degree 
at every node, and therefore, G(n,p) does not capture some basic behaviors of complex 
networks. Still, the methods and approaches of classical random graph theory are 
instrumental in the study of general random graphs. 


DEFINITION 


D7: The G(w) model for random graphs with expected degree sequence w is defined 
by selecting an edge between v; and v; independently with probability p;; where p;; is 
proportional to the product w,w,;. For example, G(n,p) is a special case of G(w) by 
taking w to be (pn, pn,...,pn). The G(w) with a power law distribution w is called a 
power law random graph. 
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FACT 


F7: [ChLu02] Suppose that a random power law graph G in G(w) has n nodes and 
expected degree sequence w following a power law with exponent 2 < 8 < 3. Let G 
have average degree d > 1 and maximum degree m satisfy 


log n 


] ———.. 
Ree log logn 


Then, for all values of 8 > 2, asymptotically almost surely the graph G is connected 
and the diameter satisfies 
diam(G) = O(log n), 


and the average distance satisfies 


log log n 
log(1/(8 — 2) 


Many probabilistic and spectral properties of power law random graphs have been stud- 
ied (see [ChLu04b] for details). 


adist(G) < (2 + o(1)) 


DEFINITION 


D8: An important random graph model for general degree sequences is the configura- 
tion model, which is a spin-off of random regular graphs. One way to define random 
regular graphs G;, of degree k on n nodes is to consider all possible matchings in a 
complete graph Ky,. Note that a matching is a maximum set of vertex-disjoint edges. 
Each matching is chosen with equal probability. We then obtain a random k-regular 
graph by partitioning the nodes into subsets of size k. Each k-subset then is associated 
with a node in a random regular graph G,. Although such a random regular graph might 
contain loops (that is, an edge having both endpoints the same node), the probability 
of such an event is of a lower order and can be controlled. Now, instead of partitioning 
the set of nodes of the large graph into equal parts, we choose a random matching of a 
complete graph on }°, d; nodes which are partitioned into subsets of sizes dj, d2,...,dn. 
Then we form the random graph by associating each edge in the matching with an 
edge between associated nodes. Clearly, in the configuration model, there are nontrivial 
dependencies among the edges. 


FACTS 


F8: A useful tool in a configuration model is a result of Molloy and Reed [MoRe95, 
MoRe98}]. For a random graph with (7; + o(1))n nodes of degree i, where 7; are non- 
negative values which sum to 1 and n is the number of nodes, the giant component 
emerges when Q = )7,s, i(i— 2); > 0, provided that the maximum degree is less than 
n'/4-© (where € > 0) and some “smoothness” conditions are satisfied. Also, there is 
asymptotically almost surely no giant component when @ = 7s, i(i — 2)y; < 0 and 


the maximum degree is less than n!/8~*, 
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F9: [AiChLu00] The evolution of a random power law graph with exponent ( is as 
follows: 


1. When 8 > 69 = 3.47875..., the random graph asymptotically almost surely has 

no giant component where the value 69 = 3.47875... is a solution to 

((8 —2)— 2¢(8 — 1) =0. 
When £8 < Go = 3.47875..., there is asymptotically almost surely a unique giant 
component. 

2. When 2 < 8 < Bo = 3.47875..., the second largest component is asymptotically 
almost surely of size O(logn). For any 2 < x < O(logn), there is asymptotically 
almost surely a component of size x. 

3. When £ = 2, asymptotically almost surely the second largest component is of size 
@(128"_). For any 2 < x < O(_°8"_), there is asymptotically almost surely a 


loglog n loglog n 
component of size zx. 


4. When 1 < 6 < 2, the second largest component is asymptotically almost surely of 
size O(1). The graph is asymptotically almost surely not connected. 

5. When 0 < 6 < 1, the graph is asymptotically almost surely connected. 

6. When 6 = 6 = 3.47875..., the situation is complicated. It is similar to the 
double jump of the random graph G(n,p) with p = 1. For 8 = 1, there is a 
nontrivial probability for either case that the graph is connected or disconnected. 


12.1.4 On-Line Models of Complex Networks 


Models of complex networks are usually (but not always) stochastic, and graphs 
evolve over time. As there are a large number of models, our survey is far from ex- 
haustive. We focus on three models which have applications to modelling the web 
graph and on-line social networks. Other models which have been rigorously ana- 
lyzed are competition-based preferential attachment models [Be-ctal04], for- 
est fire models {[LeK1]Fa05], ranking models [FoF1Me06, JaPr09, LuPr06], growth- 
deletion models [ChLu04b], Kronecker graphs [Le-etal05], Multiplicative At- 
tribute Graph model [KiLe12], planar power law graphs [FrTs12], and models 
based on algorithmic game theory [Bo-etal12)]. 


Preferential Attachment Model 


Arguably the most commonly studied complex network models are ones incorporating 
some form of preferential attachment. The first evolving graph model explicitly designed 
to model the web graph was given by Barabasi and Albert [BaA199], and this model is 
now referred to as the preferential attachment. A rigorous analysis of a preferential 
attachment model for the case of 3 = 3 was given in [Bo-etal01]. Here we give a general 
definition for a preferential attachment model for 8 > 2 introduced and analyzed in 
[AiChLu01]. 


DEFINITION 


D9: The preferential attachment model G(p) has one parameter 0 < p < 1 and 
involves two possible operations: 


e Vertex-step: Add a new node v, and add an edge uv from v by randomly and 
independently choosing u in proportion to the degree of u in the current graph. 

e Edge-step: Add a new edge rs by independently choosing nodes r and s with 
probability proportional to their degrees. 
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We start with an initial graph Go (which can usually be taken to be the graph formed 
by one node having one loop). At time ¢, the graph G; is formed by modifying G;_1 as 
follows: with probability p, take a vertex-step, otherwise take an edge-step. 


FACT 


F10: [AiChLu01] For the preferential attachment model G(p), asymptotically almost 
surely the number of nodes with degree k at time t is 


Mgt + O(2\/Ftin(), 


Ewe P 
where My = #2 and My = PE SAU PE? — O(k-+2's)), for k > 2. In other 


words, asymptotically almost surely the graphs generated by G(p) have a power law 
degree distribution with the exponent 8 = 2+ pare 


The Copying Model 


The copying model was introduced in Kleinberg et al. [Kl-etal99]. 
DEFINITION 


D10: The copying model has parameters p € (0,1), d € N*, and an initial directed 
graph H with constant out-degree d. Assume that the graph G at time t has constant 
out-degree d. At time t + 1, we add a new node v4; and the d out-link of vz41 is 
generated as follows: We choose a “prototype” node wu; from the set of all existing 
nodes at random. For each of the d out-neighburs w of u; with probability p, add a 
directed edge (v441, 2), where z is chosen uniformly at random from V(G;), and with 
the remaining probability 1 — p add the directed edge (v441, w). 


FACTS 


F11: [Ku-etal00] If k > 0, then in the copying model with parameter p, asymptotically 
almost surely 


0 (er ). 


In particular, the copying model asymptotically almost surely generates directed graphs 
G, whose in-degree distribution follows a power law with exponent 


F12: [Ku-etal00] Let N;;,; denote the expected number of distinct K;,;’s which are 
subgraphs of G;. Asymptotically almost surely in the copying model with constant 
outdegree d, for i < logt, 

Nii = Q(t exp(—i)). 


This abundance of bipartite cliques, which model community structure, is provably 
absent from the preferential attachment model. 
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Iterated Local Transitivity Model 


We describe a recent model for on-line social networks based on transitivity in social 
networks (that is, friends of friends are often friends). The model is deterministic and 
simple to describe, but leads to complex behaviour over time. 


DEFINITIONS 


D11: In the Iterated Local Transitivity model [Bo-etall1], given some initial graph 
as a starting point, nodes are repeatedly added over time which clone each node, so that 
the new nodes form an independent set. The only parameter of the model is the initial 
graph Go, which is any fixed finite connected graph. Assume that for a fixed t > 0, the 
graph G; has been constructed. To form G;+1, for each node x € V(G;), add its clone 
x’, such that x’ is joined to x and all of its neighbors at time t. Note that the set of new 
nodes at time t + 1 forms an independent set of cardinality |V(G;)|. 


D12: We write deg,(x) for the degree of a node at time t, n; for the order of G;, and 
ez for its number of edges. Define the volume of G; by 


vol(G;)= 5° deg,(x) =2¢. 
rzeEV(G:) 


FACT 


F13: [Bo-etall1] For ¢ > 0, the average degree of G; equals 


3) (ean) 


Note that the average degree tends to infinity with n; that is, the model generates 
graphs satisfying a densification power law. In [LeKIFa05], densification power laws 
were reported in several real-world networks such as the physics citation graph and the 
internet graph at the level of autonomous systems. 


DEFINITION 


D13: Define the Wiener index of G; as 


1 
WG)=5 Dd) aay). 
x,yeV (Gt) 
FACTS 
F14: [Bo-etall1] For t > 0, 


4t (W (Go) + (eo + no) (2 fe (3)')) 


L(G) = 4tn2 — 2'ng 


Note that the average distance of G; is bounded above by diam(Go) + 1 (in fact, by 
diam(Go) in all cases except cliques). Further, for many initial graphs Go (such as large 
cycles) the average distance decreases. 
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F15: [Bo-etall1] If we let ny =n (so t ~ logy n), then 


CG) = nlese(7/8) +01). 


In contrast, for a random graph G(n,p) with comparable average degree 
pn = ((3/2)") = @(nle# 0), 


F16: [Bo-etall1] Define \(G;) to be the spectral gap of the normalized Laplacian of 
G;. For t > 1, A(G;) > 4. This represents a drastic departure from the good expansion 
found in random graphs, where \ = o(1); see [Ch97]. 


12.1.5 Geometric Models for Complex Networks 


In geometric random graph models, nodes are identified with points in some 
metric space S, and edges are determined via a mixture of probabilistic rules and dis- 
tance between nodes in S. Complex networks do not usually live in physical space, but 
live in some feature space, where nodes are associated with vectors of features, and 
nodes with similar features are more likely to be adjacent. For example, we may view 
the web graph as residing in topic-space, where pages are close if they have common 
topics. On-line social networks are embedded in social-space (sometimes called Blau 
space; see [Mi83]), where nodes with similar social attributes are close. 

Several geometric models for complex networks have been proposed, such as geo- 
metric preferential attachment [FIFrVe07], random dot product graphs [ScYo08], 
and spatially preferred preferential attachment [Ai-etal09, CoFrPr12]. We focus 
on the geometric Protean model which was introduced as a model of an on-line social 
network, in particular, as it generates dense graphs. 


DEFINITION 


D14: [BoJaPr12] The geometric Protean model, written GEO-P(a, 3,m, p), pro- 
duces a sequence (G; : t > 0) of undirected graphs on n nodes, where ¢ denotes time. 
We write G; = (V;, E;). There are four parameters: the attachment strength a € (0,1), 
the density parameter 8 € (0,1— a), the dimension m € N, and the link probability 
p € (0,1). Each node v € V; has rank r(v,t) € [n] = {1,2,...,n}. The rank function 
r(-,t) : Vi > [n] is a bijection for all t, so every node has a unique rank. The highest 
ranked node has rank equal to 1; the lowest ranked node has rank n. The initialization 
and update of the ranking is done so that the node added at time ¢ obtains an ini- 
tial rank R,, which is randomly chosen from [n] according to a prescribed distribution. 
Ranks of all nodes are adjusted accordingly. Let S be the unit hypercube in R™, with 
the torus metric d(-,-) derived from the L,, metric. More precisely, for any two points 
x and y in R”™, their distance is given by 


d(a,y) = min{||@— y+ ulloo : we {-1,0,1}"}. 


To initialize the model, let Go = (Vo, Eo) be any graph on n nodes that are chosen from 
S. We define the influence region of node v at time t > 0, written R(v,t), to be the 
ball around v with volume 


|R(v, t)| = r(v,t)-on- 8 . 
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For t > 1, we form G; from G_, according to the following rules. 


1. Add a new node v that is chosen uniformly at random from S. Next, indepen- 
dently, for each node u € V;_1 such that v € R(u,t—1), an edge vu is created with 
probability p. Note that the probability that wu receives an edge is proportional to 
pr(u,t — 1)~%. The negative exponent guarantees that nodes with higher ranks 
(r(u,t— 1) close to 1) are more likely to receive new edges than lower ranks. 


2. Choose uniformly at random a node u € V;_;, and delete u and all edges incident 
to wu. 


3. Vertex v obtains an initial rank r(v,t) = Rz, which is randomly chosen from [n] 
according to a prescribed distribution. 


4. Update the ranking function r(-,t) : Vi > [n]. 


FACTS 


F17: Since the geometric Protean process is an ergodic Markov chain, it will converge 
to a stationary distribution. 


F18: [BoJaPr12] The geometric Protean model asymptotically almost surely generates 
power law graphs. More precisely, let Nz = Nz(n,p,a, 3) denote the number of nodes 
of degree k, and No = Dij5; Ni. Let a € (0,1), 8 € (0,1—a), mEN, pe (0, 1], and 


ni-e-B log!/? n <k< ni-2/2-B loge ri, 
Then asymptotically almost surely GEO-P(a, 6,m, p) satisfies 
a: a “Bhs te 
Nsx = (1+ O(log Le n)) sae a ae a 


F19: [BoJaPr12] Asymptotically almost surely the average degree of GEO-P(a, 3,m, p) 
is 


ni-e-B. 


d=(1+0(1)) 


Note that the average degree tends to infinity with n; that is, the model generates 
graphs satisfying a densification power law. 


F20: [BoJaPr12] Let a € (0,1), 8 € (0,1—a), m € N, and p € (0,1). Then 
asymptotically almost surely the diameter D of GEO-P(a, 8,m, p) satisfies 


D = O(n0-2 logi-a7 7). (12.1.1) 


F21: We note that in a geometric model where regions of influence have constant 
volume and possess the same average degree as the geo-protean model, the diameter is 
(nom ). This is a larger diameter than in the geometric Protean model. If m = Clog n, 
for some constant C > 0, then by (12.1.1) asymptotically almost surely we obtain a 


diameter bounded above by a constant. 


F22: The dimensions of social space quantify user traits such as interests or geography; 
for instance, nodes representing users from the same city or in the same profession would 
likely be closer in social space. The Logarithmic Dimension Hypothesis [BoJaPr12] 
conjectures that the dimension m of an on-line social network is best fit by about log n, 
where n is the number of users in the on-line social network. The motivation for the 
conjecture comes from both the geometric protean and multiplicative attribute models 
(see [KiLe12] for more on the latter model). 
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12.1.6 Percolation in a General Host Graph 


Most information networks that we observe are subgraphs of certain host graphs for 
which only partial and incomplete information is available. A natural question is to 
deduce properties of the host graph from its random subgraphs and vice versa. It is of 
interest to understand the connections between a graph and its subgraphs. What graph 
invariants of the host graph are preserved in its subgraphs? Under what conditions can 
we predict the behavior of the host graph from some samples of subgraphs? 


DEFINITION 


D15: One way to deal with a random subgraph of a given graph G is a type of 
(bond) percolation problem. For a positive value p < 1, we consider G,, which is 
formed by retaining each edge independently with probability p and discarding the edge 
with probability 1 — p. A fundamental problem of interest is to determine the critical 
probability for which G, contains a giant connected component. As an example, if 
the host graph is the complete graph K,,, Gp is the same as the random graph model 
G(n,p). 


FACTS 


F23: In the application of epidemics, the host graph is taken to be a contact graph, 
consisting of edges formed by pairs of people with possible contact. The question of 
interest is to determine the critical probability which corresponds to the problem of 
finding the epidemic threshold for the spreading of diseases. 


F24: Percolation problems have long been studied in theoretical physics, especially for 
the case of the host graph taken to be the lattice graphs Z* or special families of Cayley 
graphs [Bo-etal05, Bo-etal05a, Bo-etal06, MaPa02], and dense graphs [Bp-etal10]. 


F25: For a general host graph G, it was shown in [ChLu02] that under some (mild) 
conditions depending on its spectral gap and higher moments of its degree sequence of 
G, for any « > 0, if p > (1+ .)/d, then asymptotically almost surely the percolated 
subgraph G, has a giant component. In the other direction, if p < (1 — e)/d, then 
asymptotically almost surely the percolated subgraph G,, contains no giant component. 
We note that the second order average degree d is d = >, &/(2, dv), where d, denotes 
the degree of v. The volume of the giant component in G, is given in [ChLu06]. 


F26: Suppose the host graph G has a large spectral gap A, defined by 
A = min{Ay, 2 — An-1}, 


where 0 = Xp < AL <... < An_-1 are eigenvalues of the normalized Laplacian of G. It 
was shown in [ChHo07] that the random subgraph H in G, preserves the spectral gap 


as follows: 
S | logn (log ae 
An =A o( pdmin pdrmin (log log n)3/2 ) , 


F27: Ina paper by Liben-Nowell and Kleinberg [LiK108] it was observed that the 
tree-like subgraphs derived from some chain-letter data seem to have relatively large 
diameter. In the study of the Erdés—Rényi graph model G(n, p), it was shown that the 
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diameter of a random spanning tree is of order \/n, in contrast to the fact that the 
diameter of the host graph K,, is 1. Aldous [A190] proved that in a regular graph G 
with a certain spectral bound oa, the expected diameter of a random spanning tree T of 
G, denoted by diam(T), has expected value satisfying 


CONE 2 iain) OU EE 

logn Jo 
for some absolute constant c. In [ChHoLu12], it was shown that for a general host graph 
G, with high probability the diameter of a random spanning tree of G is between c,/n 
and c’,/nlogn, where c and c’ depend on the spectral gap of G and the ratio of the 
moments of the degree sequence. 


12.1.7 PageRank for Ranking Nodes 


The notion of PageRank, first introduced by Brin and Page [BrPa98], forms the basis 
for Google’s web search algorithms. Furthermore, PageRank can be used to measure 
the global importance of nodes, and capture quantitative correlations between pairs or 
subsets of nodes. Roughly speaking, PageRank is the stationary distribution of some 
random walk on a given directed graph. In other words, the PageRank value of a node 
is the probability that, at any given moment, a random surfer is visiting this node 
following a random walk whose diffusion is under control (by a specified constant a 
which we will define). 


DEFINITIONS 


D16: In a directed graph G without vertices with zero outdegree, the transition 
probability matriz P for a typical random walk on G is defined as 


_ f 1/degt(u) if (u,v) € E(G) 
Ds { 0 : otherwise 


where degt(u) denotes the out-degree of vertex uw. 


D17: Fora teleportation constant a € (0, 1], the PageRank matriz of G is defined 
by 
Py =ad,/n+ (1—-a)P 


where J, is the all 1’s matrix of size n x n and n is the number of vertices in G. 


FACTS 


F28: [BrPa98] For a teleportation constant a, the PageRank matrix P, is stochastic 
(that is, its row sums are all 1) and hence is a transition probability matrix of a Markov 
chain. The teleportation constant a@ is sometimes called the damping or diffusion 
constant. 


F29: [BrPa98] The PageRank Markov chain with transition probability matrix Py 
converges to a stationary distribution pr, which is called the PageRank vector (with 
teleportation constant a) and the entry associated with vertex v is the PageRank of 
the v in G. 
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DEFINITION 


D18: [Be06, JeWi03] A useful generalization of PageRank is defined as follows. Given 
a digraph G, consider a typical random walk on G with the transition probability matrix 
defined by W = D~1A, where D is the diagonal degree matrix and A is the adjacency 
matrix. Personalized PageRank vectors are based on random walks with two govern- 
ing parameters: a seed vector s, representing a probability distribution over V(G), 
and a teleportation constant a, controlling the rate of diffusion. The personalized 
PageRank pr(a,s) is defined to be the solution to the following recurrence relation: 


pr(a,s) = as+(1—a)pr(a,s)W 


where the PageRank is taken to be a row vector. 


FACTS 


F30: The original PageRank pr, is the special case where the seed vector s is the 
uniform distribution. 


F31: Personalized PageRank has found a number of applications, such as finding graph 
clusters and drawing graphs [ChTs10], devising nearly linear time local partitioning 
algorithms [AnChLa06, AnCh07], and to graph sparsification and graph partitioning 
[(ChZh10]. 


F32: PageRank has lead to the discovery of a number of ranking algorithms in various 
complex networks (beyond web search) such as TwitterRank [We-etall0], Social- 
PageRank [Ba-etal07], and ProteinRank [Fr07]. 


F33: Because of the close connection of PageRank with random walks, there are very 
efficient and robust algorithms for computing and approximating PageRank [AnChLa06, 
Be06, JeWi03). 


F34: The usage of PageRank leads to numerous applications including the basic prob- 
lem of finding a “good” cut in a graph with small Cheeger ratio. Some of the most 
widely used approximation algorithms are spectral partitioning algorithms with their 
performance guaranteed by the Cheeger inequality: 
hy. he 
2he =A = = 2 3° 
where hy is the minimum Cheeger ratio among subsets which are initial segments in 
the order determined by the eigenvector f associated with the spectral gap ». Instead 
of using eigenvector (which requires expensive computation), one can use PageRank 
vector to derive a local partitioning algorithm [AnChLa06], based on the local Cheeger 
inequality for a subset S' of vertices in a graph G: 


2 
hs hs 


> 
vol(S') ~ 8log vol(S)’ 


hg > XA5 > 
i © = log 


where Ag is the Dirichlet eigenvalue of the induced subgraph on S, hg is the local 
Cheeger constant of S defined by hg = minrcg h(T), hg is the minimum Cheeger ratio 
over initial segments determined by PageRank vectors g with the seed as a vertex in S, 
and a is appropriately chosen depending only on the volume of S. 
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12.1.8 Network Games 


In morning traffic, drivers choose the most convenient way to get to work without 
necessarily paying attention to the consequences of their decisions on others. The In- 
ternet network can be viewed as a similar macrocosm which functions neither by the 
control of a central authority nor by coordinated rules. The basic motivation for each 
individual can only be deduced by greed and selfishness. Every player chooses the most 
convenient route and use strategies to maximize possible payoff. In other words, we face 
a combination of game theory and graph theory for dealing with large networks both 
in quantitative analysis and algorithm design. There has been a great deal of progress 
in algorithmic game theory; see [Ni-etal07]. Of special interest are games on networks 
that involve graph theory as well as internet economics. 


DEFINITION 


D19: Classical chromatic graph theory can be examined from the perspective of 
game theory. The chromatic number of a graph G, written (G), is the minimum 
number of colors needed to properly color the nodes of G so that adjacent nodes have 
different colors. Suppose there is a payoff of 1 unit for each player (represented by 
a node) if its color is different from all its neighbors. A proper coloring is a Nash 
equilibrium, while no player has an incentive to change their color. 


FACT 


F35: Kearns et al. [KeSuMo06] conducted an experimental study of several coloring 
games on specified networks. A multiple round model of graph coloring games was 
analyzed in [ChChJa08]. 


DEFINITIONS 


D20: The analysis of selfish routing arises naturally in network management. The 
“price of anarchy” refers to the ratio of the average travel time of the decentralized 
selfish routing versus the coordinated routing respecting the collective welfare. 


D21: Braess’ paradoz is the counter-intuitive observation that, if the travelers were 
behaving selfishly it was possible to improve everyone’s travel time by removing roads. 


FACTS 


F36: Valiant and Roughgarden [VaRo10] showed that in a random graph G(n, p) 
with edge density p > n—'/?+©, Braess’ paradox occurs asymptotically almost surely. 
Chung and Young proved that Braess’ paradox occurs in a sparse random graph with 
p> clogn/n. In fact, it was shown [ChYoZh12] that Braess’ paradox is ubiquitous in 
expander graphs. 


F37: There has been extensive research done on selfish routing [RoTa02]. The reader 
is referred to several surveys and some recent books on this topic [Ro06]. 
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INTRODUCTION 


A communication network can be modeled by a connected graph with vertices repre- 
senting nodes of the network and edges representing communication links between them. 
A low level operation in a communication network is the transmission of a message from 
one node to an adjacent node. A collection of such transmissions can be combined to 
achieve higher level goals such as broadcasting, in which information originating at one 
node in the network must be distributed to all other nodes of the network. A protocol 
specifies the transmissions and the order in which they are made to achieve a higher 
level goal and can be described by a labeled subgraph. We consider two broad categories 
of problems — finding the best network for a given high level goal and finding the best 
protocol for a given goal and network. 

We assume that the nodes of the network are synchronized and that transmitting a 
message from a node to its neighbor takes one unit of time. A protocol can be specified 
by a sequence of sets, each containing the transmissions made during a particular time 
unit. We represent this in a graph by assigning labels to each edge indicating the time(s) 
during which that edge transmits a message. In this model, a message originating at 
vertex u is received by a vertex v if there is a path from u to v with increasing edge 
labels. 

We assume that every vertex can send a message to at most one neighbor during a 
given time unit. In the labeled graph, this means that no two edges incident on a single 
vertex have the same label. We will discuss variants of this model at the end of this 
section. 
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12.2.1 Broadcasting 


In this subsection, we consider broadcasting. We begin by looking in more detail 
at the notion of broadcast time in this model, giving some basic definitions and simple 
facts. 


DEFINITIONS 


D1: Broadcasting is the goal of sending a single message from a particular vertex 
(called the originator) to all of the other vertices in the graph. 


D2: A broadcast protocol for originator (source) s is represented by a graph 
G = (V,F) on n vertices such that every edge e is labeled with at most one positive 
integer 7 and the labels satisfy the following constraints: 


e the set of labels on the edges incident on any vertex are disjoint, 
e there is exactly one path with increasing labels from s to each of the other vertices 
in the graph. 


D3: A vertex v is informed at time t by a broadcast protocol for originator s if 
the last edge in the path from s to v is labeled with t. The completion time of a 
broadcast protocol for originator s is the least integer t such that every vertex v in the 
graph is informed by time ft. 


D4: A broadcast protocol for a graph G is a collection of broadcast protocols for 
each originator s € V(G). 


D5: The broadcast time of a broadcast protocol for a graph G is the maximum 
completion time for any originator s € V(G). 


D6: The broadcast time b(G) of a graph G is the minimum broadcast time for any 
broadcast protocol for G. 


D7: Let b(n) be the minimum 0(G) over all graphs G with n vertices. 


EXAMPLE 


E1: Figure 12.2.1 shows examples of two broadcast protocols for a given originator s. 


Figure 12.2.1: Two broadcast protocols for originator s. 
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REMARK 

R1: To broadcast quickly, the labeled edges incident on s should have labels 1,2,...,2 
and for each other vertex v, one of its incident edges is labeled with the time ¢ at 
which v is informed, and the other labeled edges incident on v should be labeled 
ey ae ete eee o 

FACTS 


F1: Due to the restrictions on the labels, at most 2° vertices can be informed at time 
t > 0 by any broadcast protocol for any originator. 


F2: b(n) = [flogn]. The previous fact implies b(n) > [logn]. Protocols showing 
b(n) < [logn] are discussed next. 


Minimum Broadcast Trees 


Here we discuss those trees which correspond to minimum time broadcast protocols. 


DEFINITIONS 


Ds: A minimum broadcast tree is a rooted tree on n vertices with root s for which 
there exists a broadcast protocol for originator s with completion time [log n]. 


D9: The binomial tree on 2" vertices is a rooted tree defined recursively as follows. A 
single vertex is a binomial tree on 2° vertices. A binomial tree on 2" vertices is formed 


from two binomial trees on 2*~! vertices by joining their roots with an edge and making 
one of them the root of the binomial tree on 2” vertices. 


EXAMPLE 


E2: Figure 12.2.2 shows a binomial tree and two of its subtrees that are minimum 
broadcast trees. 


Figure 12.2.2: Binomial tree (left) and two subtrees. 
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FACTS 
F3: The binomial tree on 2* vertices is a minimum broadcast tree [Prosk]. 


F4: Every n vertex connected subtree, 2*-! < n < 2*, of the binomial tree on 2* 
vertices that includes the root is a minimum broadcast tree and these subtrees are the 
only minimum broadcast trees on n vertices [Prosk]. 


B(n) and Minimum Broadcast Graphs 


One research goal has been to determine the graphs on n vertices with the fewest edges 
that allow minimum time broadcasting from each vertex. 


DEFINITIONS 


D10: A graph G = (V, £) on n vertices is a broadcast graph if there is a broadcast 
protocol with completion time [logn] for each originator v € V. In other words, for 
every v € V there is a spanning subgraph of G that is a minimum broadcast tree rooted 
at v. A broadcast graph on n vertices is a minimum broadcast graph (MBG) if 
the number of its edges is the minimum over all broadcast graphs on n vertices. The 
number of edges in an MBG on n vertices is denoted B(n). 


D11: A k-dimensional binary hypercube H,, is a k-regular vertex-transitive bipar- 
tite graph with n = 2* vertices and k-2*—! edges. Each vertex is labeled with a different 
binary string 71%2---x,, and there is an edge between two vertices if and only if their 
labels differ in exactly one bit position 1 < ¢< k. Such vertices are called dimension @ 
neighbors and the edge connecting them is a dimension ¢ edge. The set of 2*~! dimen- 
sion £ edges is a perfect matching in Hz. Thus, Hx, can be defined recursively as two 
copies of H,z_1 joined by a perfect matching corresponding to dimension k. 


D12: The Cayley graph D, is a (k—1)-regular vertex-transitive bipartite graph with 
n = 2" — 2 vertices and (k — 1) - (2*-! — 1) edges. The vertices are labeled with the 
integers mod 2* — 2. There is an edge between two vertices with labels i and j if and 
only if («+ 7) (mod 2* — 2) = 2 —1 for some 1 < ¢ < k—1. Such vertices are called 
dimension ¢ neighbors and the edge connecting them is a dimension ¢ edge. The set of 
2*-! _ 1 dimension ¢ edges is a perfect matching in Dz. 


This definition is specifically for Cayley graphs based on dihedral groups. 


D13: The recursive circulant graph G,4, d > 2 has n vertices labeled with the 
integers mod n and an edge between two vertices with labels i and j if and only if 
i+d‘=j (mod n) for some 0 < ¢< flogyn] — 1. 


D14: The Knédel graph W,,« is a A-regular bipartite graph, 2 < A < |logn], with 
n vertices, n even, and ne edges. The vertices are labeled (i,j), i= 1,2,0<j < $-1. 
There is a dimension ¢ edge, 0 < @ < A—1, between every pair of vertices (1,7) and 
(2,7 + 2-1 (mod #)),0 < j < A—1. The set of 2 dimension £ edges is a perfect 


matching in W,,a. 
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EXAMPLES 
E3: Figure 12.2.3 shows three non-isomorphic MBGs on 16 vertices. 


Figure 12.2.3: H4 (left), Giga (center), Wie,4 (right). 


E4: Figure 12.2.4 shows two isomorphic MBGs on 14 vertices. 


Figure 12.2.4: Heywood graph Dy (left), Wi4,3 (right). 


FACTS 


F5: The set {H, | & = 1,2,3,...} of binary hypercubes is an infinite family of 
MBGs [FHMP]. Hz, is a broadcast graph because there is a spanning subgraph isomor- 
phic to a binomial tree on 2" vertices rooted at each vertex. One broadcast protocol is 
to use dimension order: each vertex that is informed at time ¢ informs its dimension 
t+1,t+2,...,k neighbors during time steps t+1,¢+2,...,k, respectively. Hy, is an 
MBG because the number of informed vertices must double during each of k time units 
to inform all 2* vertices and this requires that the originator has at least k neighbors. 
Any vertex can be the originator, so the minimum number of edges is k - 2*~!. 


F6: The set {Gx 4 | k = 2,3,4,...} of recursive circulant graphs is an infinite family of 
MBGs. Go 4 is isomorphic to Hy for k = 2 and non-isomorphic to Hy for k > 3 [FR98}. 
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F7: The set {Wor | k = 2,3,...} of Knédel graphs is an infinite family of MBGs. 
Wx is isomorphic to H; for k = 2,3 and non-isomorphic for k > 4 [FR98, FPO1]. It 
is isomorphic to Gax 4 for k = 2 and non-isomorphic for k > 3 [FR98}. 


F8: If G, and G2 are MBGs on 2" vertices, then the graph that results from adding 
any perfect matching between the vertices of G; and Gz is an MBG on 2**! vertices. 
This construction can be applied recursively with different matchings at each stage to 
construct many non-isomorphic MBGs [FR98]. 


F9: The set {Dy | k = 2,3,4,...} of Cayley graphs is an infinite family of MBGs with 
2* — 2 vertices and (k — 1) - (2*-! — 1) edges [DFF91, KH90]. One broadcast protocol 
is a dimension order protocol similar to the hypercube protocol except that there are 
k; — 1 dimensions instead of k. The originator and its dimension 1 neighbor are idle 
during the k*” time step because they have no uninformed neighbors after k — 1 time 
steps; every other informed vertex informs its dimension 1 neighbor during the k“” time 
step. D;, is an MBG because any originator with degree less than k — 1 cannot inform 
2* — 3 other vertices in k time units, so every vertex must have degree at least k — 1. 


F10: The Cayley graph D; is isomorphic to the Knédel graph W2x_2;,_, for k > 3 
[FR98, HMP97], so {Woe_2 4-1 | k =3,4,...} is an infinite family of MBGs. 


F11: Table 12.1 shows the known values of B(n) for small n, indicated by asterisks, 
and the best upper bounds currently known for other small n. Some of the graphs that 
verify these values are the results of ad hoc constructions, while others are the results of 
construction methods that have produced several MBGs. The contents of this table are 
based on [BFP92, BHLP92b, DFF91, FHMP, KH90, Labahn, MS94, MitHed, Sac96, 
VenWen, WenVen95, WenVen, XiaWan, ZZO01]. 


Bin) | n Bin) | n Bi(n)| n Bi(n)| n Bin) | n Bin) | rn Bin) 
O* | 10 12* | 19 25* | 28 48* | 37 57 | 46 82 | 55 111 
1* | 11 13* | 20 26* | 29 52* | 38 57 | 47 83 | 56 111 
2* | 12 15* | 21 28* | 30 60* | 39 60 | 48 83 | 57 126 
4* 13. 18* | 22 31* | 31 65* | 40 60 49 94 58 121* 
5* | 14 21* | 23 34 | 32 80* | 41 65 | 50 95 | 59 124* 
6* | 15 24* | 24 36 | 33 48 | 42 66 | 51 100 | 60 130% 
8* | 16 32* | 25 40 | 34 49 | 43 71 | 52 99 | 61 136% 
12* | 17 22* | 26 42* | 35 51 44 72 53 107 | 62 155* 
10* | 18 23* | 27 44* | 36 52 | 45 81 | 54 108 | 63 162* 


CO} G0] MI] @] oY} A] eo] DO] BA] S 


Table 12.1: Upper bounds on B(n) for small n. Values that are known to be optimal 
are indicated by asterisks. 


F12: B(n) € O(L(n)n), where L(n) is the number of consecutive leading 1’s in the 
binary representation of n—1 [GP91]. 


F13: B(n) < (n flogn])/2 [Far]. 
F14: B(n) < (n |logn]|)/2 for even values of n [FR98]. 


F15: B(n) is monotonically non-decreasing for any n in the first quarter of the range 
between any two consecutive powers of 2. More precisely, B(n) < B(n+1) for 2™-1+1 < 
n< 2-1 49m-3 _ 1 [HLOS). 
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RESEARCH PROBLEMS 


RP1: Is B(n) monotonically non-decreasing between two consecutive powers of 2? 
This question has only been solved for the first quarter of each range (see F15). 


RP2: Ina broadcast graph on n = 2" — 1 vertices, every vertex must have degree at 
least k — 1 and if the originator has degree / — 1 then it must have at least one neighbor 


of degree k. This gives a lower bound B(2* — 1) > | $(n(k-1)+ | tx])]- MBGs 
matching this lower bound are known for k < 7 [BHLP92b, FHMP, H08, Labahn]. The 
question is open for k > 7. A good starting point to explore this problem would be the 
values of k that are one less than a prime because the lower bound formula is exact for 


these values by Fermat’s little theorem (i.e., the ceilings disappear). 


RP3: B(n) is only known for a few infinite families of graphs and some small values of 
n. The general problems of determining B(n) and of finding more infinite families are 
open. MBGs on n = 2" and n = 2" —2 vertices are k-regular and (k—1)-regular graphs, 
respectively. For most other values of n the MBGs will not be regular and the difficulty 
of proving upper and lower bounds seems to increase the farther that n is from a power 
of 2. MBGs for n = 2* — 1 vertices are known for small k (see RP2). General lower 
bounds for n = 2* —1, 1 = 3,4,5,6, k > 4 were proved in [Sac96] and MBGs matchings 
these lower bounds are known for k = 4,5,6. It is unknown if there are MBGs matching 
these bounds for k > 7. The degree of any originator in an MBG on n = 2° — 2? —r 
vertices is at least k — p [GarVac] and this gives a general lower bound on B(n). 


Construction of Sparse Broadcast Graphs 


As MBGs have proven to be difficult to construct, numerous methods have been pro- 
posed to construct sparse broadcast graphs that have a small number of edges. Almost 
all of these methods are combinations and variations of a few techniques. 


DEFINITIONS 


D15: Given a broadcast protocol in a broadcast graph G on n vertices, a vertex wu is 
idle at time t < [logn] if and only if u is aware of the message at (the beginning of) 
time step t and u does not communicate with any of its neighbors during time step t. 


D16: A subset of vertices C in a broadcast graph G is a solid 1-cover if and only if 
C is a vertex cover of G, and for each u ¢ C, there is a broadcast protocol for u such 
that at least one neighbor of wu is idle at some time during the broadcast. 


D17: Let G = (V(G), E(G)) and H = (V(#), E(A)) be two graphs. The compound 
of G into H relative to a set S C V(G), denoted Gg[H], is obtained by replacing 
each vertex x of H by a graph G, isomorphic to G and adding a matching between two 
sets S, and S, if x and y are adjacent in H. More precisely, the matching between S; 
and S, connects each vertex in S; with its copy in Sy. For any vertex u € S, we use 
#, to denote the graph isomorphic to H that interconnects the copies of u € V(G). 
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EXAMPLES 


E5: Figure 12.2.5 shows a broadcast protocol for Cg (left) in which the vertex to the 
left of the originator is idle during time step 3, and two copies of Cg joined by a partial 
matching between their solid covers of size 3 (right). 


Figure 12.2.5: Partial matching of two copies of Cé. 
E6: Figure 12.2.6 shows a solid 1-cover of the Heywood graph. 


Figure 12.2.6: Solid 1-cover of the Heywood graph D4. 
FACTS 


F16: If G = (V,E) is a broadcast graph on n vertices and C is a solid 1-cover of G, 
then B(2n) < 2|E|+|C| [BFP92, KH90]. Construct G’ by joining two isomorphic copies 
of G, G, and G2, with a perfect matching between their solid covers C, and C2. Let 
ui € V(G1) be the originator of the broadcast. If wu: € C1, then wu; sends the message 
to its copy ug, and then u; and ug broadcast in G; and Go, respectively. If ur ¢ C1, 
then wu initiates a broadcast in G; and whenever a vertex v1 € Ci is informed, it first 
informs its copy v2 and then both copies of v continue to broadcast in their copies of 
G. Cy is a solid cover so uz can be informed by an idle vertex. See Figure 12.2.5 for an 
example. 


F17: Let G = (V(G), E(G)) and H = (V(#), E(A)) be broadcast graphs on ny 
and ng vertices, respectively, and let C be a solid 1-cover of G. If [logning|] = 
flog n1|]+[log nz], then Go[H] is a broadcast graph and therefore B(nin2) < n2|E(G)|+ 
|C||E(H)| [BFP92]. A specific case is seen in F16, where H is Ho. 


F18: B(2* — 2?) < 2h (2k —p-—1) for any 1 <p<k-—2 and k > 4 [KH90]. This 
also follows from F17. 


F19: Vertex deletion was introduced in [XiaWan]. If G is a broadcast graph with 
n vertices, 2*—! +1 <n < 2", e edges, and a vertex v of degree d, then B(n — 1) < 
e+4d(d—3). Construct G’ on n—1 vertices by deleting v and adding enough edges to 
form a clique among the former neighbors of v. Each broadcast protocol for G can be 
modified so that the vertex u that informed v in G informs the former neighbors of v 
in the same order that v informed them in G. 
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REMARKS 


R2: Vertex addition was introduced in [BHLP92b]. In this construction method, a 
vertex is added to a broadcast graph on n — 1 vertices and is connected to some of 
the original vertices, yielding a broadcast graph on n vertices. This method is used 
in [BHLP92b, H08, HL12]. 


R3: Solid 1-covers are vertex covers for which the broadcast protocols have enough 
idle vertices to “cover” all paths of length 1 (i.e., edges). They can be generalized to 
solid h-covers which cover all paths of length h. Solid h-covers with h > 1 are used 
in [BFP92, KH90, MS94]. 


R4: m-way splits were introduced in [ChaLie] (for m = 5,6,7) and generalized 
in [BFP92]. The construction is similar to graph compounding and uses a graph G 
that has a perfect matching with certain broadcast properties, and graphs that have 
even adjacency splits. (A graph on ¢ vertices has an even adjacency split if it has 
two dominating sets of size [5] and |S |.) The method produces sparse broadcast 
graphs on n = mi+ vertices for which 7 is not a power of 2, 0 < 7 < m, and 
flog(mi + 7)] = Jlogm] + [logi]. This method produces broadcast graphs for more 
values of n than solid h-covers, which require that n = mi for m,i € N. 


R5: Several other construction methods have been developed. Most are combinations 
of matchings, partial matchings, compounding, m-way splits, vertex deletion, vertex 
addition, and operations on hypercubes. The methods in [Di99, VenWen, WenVen95] 
are similar to the solid covers method with vertex deletion and give similar results. The 
methods in [Che, GarVac] use solid covers, hypercubes, and vertex deletion. Match- 
ings were used in the first construction methods [Far] and early uses of solid covers, 
vertex addition, and vertex deletion appear in [BHLP92a]. The methods in [HL99] in- 
clude compounding, merging vertices, and deleting edges based on a broadcast graph 
construction using binomial trees. 


R6: The constructions in [GP91] use hypercubes and generalized Fibonacci numbers 
to produce asymptotic bounds on B(n). 


Bounded Degree Broadcast Graphs 


If we restrict our attention to graphs of given maximum degree, the time required to 
broadcast increases. Researchers have developed efficient broadcast protocols for some 
bounded degree graphs that are of interest to network designers. 


DEFINITIONS 


D18: The k-dimensional cube-connected cycles graph CCC; is derived from Hy 
by replacing each vertex = ©1%2...X%~ of Hy by a cycle of length & with vertices 
labeled (i,x), 0<i<k-—1. Each of the k vertices on a cycle that replaces x inherits 
one of the k edges that were incident to x in H,. In particular, vertex (2, 2) is connected 
to its neighbors (i+ 1,2) and (¢—1,2) on its cycle and to vertex (i, v1%2-++-%iqi ++: Lr) 
where %j+1 is the complement of the binary value x;+1. 


D19: The k-dimensional (wrapped) butterfly graph BF; is derived in a similar 
way to CCCy. BF; has the same vertex set as CCC;. The difference is that BF; has two 
edges corresponding to each former hypercube edge instead of one: (7,2) is connected 
to (@+1,a,02---Taq-:-a,) and (¢— 1,2 ,22-+-Tj-++ ey). 
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D20: The k-dimensional binary shuffle-exchange graph SE; has 2" vertices 
with the same labels as the vertices of Hz. Each vertex x1%2---x, is connected to 
its shuffle neighbor r2%3---Xp21 and its unshuffle neighbor xpx1---Xp_1, and to its 
exchange neighbor «1 %2-+--X,. Parallel edges and loops are removed. 


D21: The k-dimensional binary de Bruijn graph UB, has the same vertex set 
as H, and SE. As in SE;,, each vertex 11 %2-+-- x, is connected to its shuffle neighbor 
%2%3°++£,x, and its unshuffle neighbor x,.21---x,p_1. It is also connected to its shuffle- 
exchange neighbor 2223 :++ Xp and its unshuffle-exchange neighbor xp.21 +++ XR—-1. Par- 
allel edges and loops are removed. 


D22: The de Bruijn graph UBa, with degree 2d and diameter k has d* 
vertices with labels that are strings of length k over the alphabet {0,1,...,d— 1}. 
There is an edge between vertex 2%1,%2,...,2,% and each vertex %2,%3,...,%%,A and 
A, %1,%2,-..,€e-1 With A € {0,1,...,d—1}. UBo,, is the special case UB, in D21. 


D23: Let b(n, A) be minimum broadcast time over all graphs on n vertices with 
maximum degree A. 
EXAMPLE 


E7: Figure 12.2.7 shows CCC3, BF3, SE3, and UB3. 


000 100 O10 110 OO1 101 O11 111 
O 


010 oll 


000 ~=—(«001 110 111 


100 101 


Figure 12.2.7: CCC3 (top left), BF3 (top right), SE3 (bottom left), and UB; (bottom 
right). 
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Table 12.2 lists some parameters of hypercubes and bounded-degree approximations of 
hypercubes. 


vertices | edges degree diameter 
Fe | OF ee regular k k 
COG, | -K2* 3. k2k regular 3 2k — 1+ max(1, | “5?]) 
BFR. |" 2+ Kae regular 4 | 3 | 
SE, | 2" 3.2 —3+4(k (mod 2)) | maximum 3 | 2k —1 
UB, | 2* 2.9% 3 maximum 4 | k 


Table 12.2: Some parameters of hypercube-derived graphs. 


FACTS 


F20: Let aS be the maximum number of vertices that can be informed after t time 
units in any graph with maximum degree A. There is one informed vertex at time 
t = 0 (the originator), and aS < 2-a, for t > 1 (see F1). Furthermore, at time 
t+ A-—1, t > 1, all vertices that were informed by time t have no more uninformed 
neighbors, so a®_, < a ,_,+(aA_,_;—af) = 2a:4a-1—a}. Based on this recurrence, 
1.440 log, n—1.769 is a lower bound on b(n, 3) and 1.137 log, n—0.637 is a lower bound on 
b(n, 4) [LP88]. Closed form lower bounds are not known for A > 4, but asymptotically 
b(n, A) > p(n, A) where p(n, A) = (1+ 82°) [BHLP92a]. 


F21: 6(CCC,) = [2£] — 1 [LP88]. 


F22: 1.7417k < b(BF,) < 2k —1. The upper bound may be improved to 2k — 
4 log log k + c for some constant c and a sufficiently large k [KMPS94]. 


F23: b(SE,) = 2k —1 [HJM93]. 


F24: 1.3171k < b(UB;,) < 3(k +1). The upper bound is from [BP88] and the lower 
bound is from [KMPS94]. 


F25: b(n, A) < p(n, A) where p(n, A) ~ (1 + 244°) logy n [BHLP92a]. This bound 
is obtained using graph compounds H[U6,q;,| where H is a hypercube and UByz, is a 
de Bruijn graph. Better upper bounds for particular values of A can be obtained by 
compounding different graphs in de Bruijn graphs [BHLP92a]. See F20 for a lower 
bound on p(n, A). 


12.2.2 Gossiping 


In this subsection, we turn our attention to gossiping — a related problem that was 
first investigated before broadcasting. Results on broadcasting provide lower bounds on 
gossiping and graphs constructed for broadcasting are also useful for gossiping. 


DEFINITIONS 


D24: Gossiping is the goal of sending a unique message from each vertex to all of 
the other vertices in the graph. 


1488 Chapter 12. Communication Networks 


D25: A gossip protocol is represented by a graph G = (V,£) on n vertices such 
that every edge e is labeled with a set of positive integers ce and the labels satisfy the 
following constraints: 


e the sets of labels on any two edges incident on a vertex are disjoint, 
e there is a transmission path between any two vertices u and v; in particular a 


path of edges (e1,e€2,...,e%) such that there exist elements of the corresponding 
label sets t; € ce, such t) < tg < +++ < ty. Such a transmission path “ends” at 
time tp. 


D26: A vertex v is informed at time t by a gossip protocol if there exists a 
transmission path from every other vertex to v that ends at or before time tf. 


D27: The completion time of a gossip protocol is the least integer t such that every 
vertex v in the graph is informed at time t. 


D28: The gossip time g(G) of a graph G is the minimum completion time of any 
gossip protocol for G. 


EXAMPLE 


E8: Figure 12.2.8 shows an example of a gossip protocol for a unicyclic graph. 


Figure 12.2.8: A gossip protocol. 


REMARK 


R7: Implicit in the above definitions is a basic model of communication. This model 
allows “calls” between two vertices such that all messages currently known to both 
vertices are exchanged. The model allows this unbounded exchange of information to 
occur in one time unit. Further, the model allows calls on disjoint edges, e, and e€2, to 
occur during the same time unit t, when t € ce, and t € ce,; each time unit is a “round.” 
When these assumptions are found to be untenable, researchers have augmented the 
basic model in a variety of ways that are too numerous to consider within the bounds 
of this paper. 


FACTS 


F26: Clearly g(G) > b(G) since from a gossip protocol a broadcast protocol for each 
vertex of G can be inferred that has a completion time which is at most the completion 
time of the gossip protocol. 


F27: Given a gossip protocol, the time-reversed protocol derived from it is also a valid 
gossip protocol. In particular, if for each edge e we replace each t € ce with g(G)—t+1, 
transmission paths still exist for all pairs of vertices [T71]. 
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Optimal Gossip Graphs 


Here, we discuss the structure of gossiping in terms of the number of calls made without 
regard to the time needed to make the calls. 


DEFINITIONS 


D29: The size of a gossip protocol for a graph G is the total number of calls used. 
In particular, the size is }) cna |¢e|. Let f(G) be the minimum size of any gossip 
protocol for G. A protocol of size f(G) is an optimal gossip protocol for G. 


D30: The optimal gossip size f(n) is the minimum f(G) over all graphs with n 
vertices. 


REMARKS 


R8: As remarked above, the basic model of communication allows calls to occur “at 
the same time.” For the purpose of understanding f(G) this is de-emphasized; it is as 
if parallel calls have been linearized. Bandwidth consumed by a protocol is measured 
by this quantity, not time. 


FACTS 


F28: f(n+1) < f(n)+2 for n > 1 [T71]. This follows from the following observation: 
Given a graph G with f(G) = f(n), add an edge from any vertex v of G to a new 
vertex v’ to form a new graph G’. A valid gossip protocol for G’ can be obtained by 
adding a call between v and v’ both before and after a valid gossip protocol for G. 
This observation can be used to recursively construct gossip protocols from protocols 
for smaller graphs. 


F29: f{(G) < 2n—3 for any (connected) graph G. This follows from the previous fact, 
building a protocol recursively using a trivial protocol for the subgraph of two adjacent 
vertices as the basis. For some graphs G with n vertices we have f(G) > f(n); for 


example, the graph which is a path of four vertices requires, by exhaustive analysis, 5 
calls, while f(n) = 4. 


F30: f(n) = 2n—4 for n > 4, while f(1) = 0, f(2) = 1, and f(3) = 2. Various 
researchers independently proved this fact using very different proof techniques [BS72, 
HMS72, T71]. Other proofs appeared later as corollaries to stronger theorems. The 
difficulty was the lower bound; the upper bound follows from the next fact. 


F31: f(G) = 2n —4 for any graph G which contains C, as a subgraph; C4 is a cycle 
of four edges. Note that f(C,) = 4. This fact follows from the above facts, building a 
protocol recursively using a protocol for the Cy subgraph as the basis. An example of 
such a protocol was seen in Figure 12.2.8. 


F32: f(G) = 2n—4 if and only if the graph G contains Cy as a subgraph. This resisted 
proof for many years even though it was widely conjectured. (The difficulty is that a C4 
subgraph is necessary even though the protocol in the previous fact can be replaced by 
a fundamentally different protocol.) Eventually two independent proofs appeared [B81, 
KS80]. Later proofs used deeper results about “information flow” [K96, L95]. 
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Minimum Gossip Graphs 


In this subsection, we return our focus to minimum time gossiping and consider the 
graphs on n vertices with the fewest edges that allow minimum time gossiping. 


DEFINITIONS 


D31: Recall that g(G) is the gossip time for G, the number of rounds necessary to 
gossip in G. The minimum gossip time g(n) is the minimum g(G) over all graphs 
with n vertices. 


D32: Let M, = {G|G has n vertices and g(G) = g(n)}. A minimum gossip graph 
G (MGG) is a member of M/,,, for some n, and the number of its edges is the minimum 
over all graphs in M,,. The number of edges in an MGG on n vertices is denoted G(n). 


FACTS 
F33: As noted above g(G) > b(G) for every G, so it follows that g(n) > b(n). 


F34: g(n) = [log.n] if n is even and g(n) = [log.n] + 1 if n is odd. The lower 
bound follows from the bound on b(n) and the fact that with odd n some node does not 
participate in the first round. The upper bounds come from constructions [K75]. This 
result was anticipated in earlier literature where the problem was posed in a different 
context, e.g. [B50]. 


F35: It follows from the above that G(n) > B(n) for even n [F00]. 
F36: For any tree T with n vertices g(T) > 2[log,n] — 1 [L86]. 


F37: For alln = 2", k > 1, G(n) = % logy n. In particular, the set {H; | k = 1,2,3,...} 
of binary hypercubes is an infinite family of MGGs for such n [L93]. Hy has a gossip 
protocol that makes all calls across each dimension at the same time and proceeds in 
dimension order; it will broadcast from each originator in the same manner as was 
discussed for broadcasting. For n = 16, all MGGs have been characterized; they are all 
formed from H4 by perturbing Cy subgraphs [LP91]. Further, when k > 2 the families 
Wr x and Gor 4 are also MGGs [FR98]}. 


F38: For all n = 2*—2, k > 3, G(n) = $ (flog, n]—1). In particular, the set {D, | k = 
3,4,5,...} of Cayley graphs is an infinite family of MGGs for such n [DFF91, L93]. 
One gossip protocol is a dimension order protocol similar to the hypercube protocol 
except there are k — 1 dimensions instead of k; after calling across all dimensions the 
first dimension is repeated. 


F39: For all n = 2° — 4, k > 6, G(n) = #([log.n] — 1). In particular, the infinite 
family W,,~-1 are MGGs for such n [FR98, L93). 


F40: G(n) < (n|logn])/2 for even values of n [BHLP97]. 
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F41: Table 12.3 shows the known exact values of G(n) for small n, indicated by 
asterisks, and the best upper bounds currently known for other small n. These are 
apparently the only known values, beyond the families in the above facts. Some of the 
graphs that verify these values are the results of ad hoc constructions, while others are 
the results of construction methods that have produced several MBGs. The contents of 
this table are based on [F00, FLO0, FR98, L93]. 


n G(n)|n G(n)| n Gin)| n Gin)) n Gin)| n G(n)| n G(n)| nn Gin) 
1 O* |5 5* | 9 9* 113 17 [17 20 [21 27 [25 32 129 45 
2 1* |6 6* {10 138* |14 21* |18 25 | 22 36 |26 52 | 30 60* 
3. 2* |7 7* |11 11* |15 19* |19 22 23 29 |27 34 |31 47 
4 4* )8 12* |12 18* |16 32* | 20 28 | 24 36* | 28 56* | 32 80% 


Table 12.3: Upper bounds on B(n) for small n. Values that are known to be optimal 
are indicated by asterisks. 


12.2.3. Other Variations of Broadcasting and Gossip- 
ing 

Many variations of broadcasting and gossiping have been investigated. Most of 
these involve changes to the model that we have been using, considering digraphs, hy- 
pergraphs, multiple originators, multiple messages from a single source, random trans- 
missions, or restricted protocols. The model described here is somewhat unrealistic for 
gossiping, as it assumes that combined messages can be sent in a single transmission in 
constant time. Many of the other papers on gossiping use different models that account 
for the size of the combined messages sent. 

For more information on these and other variations, see [FL94, HHL88, HKMP96, 
HKPRUO5]. 
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INTRODUCTION 


Telecommunication network design (along with logistics and supply chain configura- 
tion, electricity distribution, and road infrastructure development) represents a major 
application of graph theory. Although telecommunication network design problems 
come in many different flavors that correspond to different telecommunication applica- 
tion contexts, the common theme across all such problems is that we need to connect a 
set of locations (for example, customers, computers, cities, or communication switches) 
using transmission links (which may or may not be capacitated) in order to satisfy 
the demand (consisting of voice, video, or data traffic) between pairs of locations at 
minimum cost. Specific application contexts may warrant additional constraints. While 
simple to state, network design problems are challenging from the modeling, algorithmic, 
and computational perspectives, and even the simplest network design problems belong 
to the class of NP-hard problems. Many classical graph theory problems, for exam- 
ple, the Steiner Tree Problem, the Traveling Salesman Problem, and the two-connected 
problem, are special cases of the general network design problem. 


In this chapter, we first present a general network design model and then consider 
a number of its special cases. Since the vast scope of network design prevents us from 
being comprehensive, our objective is to introduce some central network design models, 
along with their important structural properties and solution algorithms. 
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12.3.1 General Network Design Model 
Preliminaries 


DEFINITIONS 


D1: A network is a graph G = (N, E), where N is the set of nodes (or vertices) and 
E is the set of edges. The nodes denote geographical locations where communication 
demands originate or terminate, or locations where routing hardware and software are 
installed. The edges denote potential transmission links. Throughout this section, 
|N| =n and |E| =m. 


D2: A commodity, denoted by k, refers to a communication demand of value 
d; (measured in bits per second (bps)) between the origin node O(k) € N and 
destination node D(k) € N. Let K denote the set of all commodities and K = |X]. 


D3: A communication facility, or simply a facility, is a transmission medium (e.g., 
a copper cable, a fiber-optic cable, or a cellular tower-to-satellite connection) installed 
on a network edge. In our discussion, we assume that the facilities are undirected 
facilities, that is, if a facility of capacity C’ is installed on an edge, then it permits a 
total flow of C on the edge in each direction. (Facilities that permit flow only in one 
direction are said to be directed facilities.) 


D4: If the facility allows unlimited commodity flow, it is said to be a uncapacitated 
facility, otherwise it is said to be a capacitated facility. The capacity measured in 
bps of a facility is referred to as its bandwidth. 


NOTATION: Let £ = {1,2,..., 2} denote the set of facility levels ordered by facility 
capacity; the capacity of the facility level 7 is C;, 1 = 1,2...,£, where Cy < Co < 
ies SS Gp: 

NOTATION: The (nonnegative) fixed cost of installing a facility of capacity C; on the 
edge {i,j} € E is a, for] = 1,2,...,0, with a, < ai The (nonnegative) variable 
cost of sending one unit of commodity k from node i to node j on edge {%, 7} is denoted 
bk. 


D5: <A switch routes or processes the data signal and can be installed on each node 
in a subset M C N. Each switch has capacity T, and for each node i € M, t; denotes 
the switch installation cost. 


D6: The network design problem is to install capacity, install switches, and route 
the commodities at minimum cost. 


D7: A (linear) mixed-integer program (MIP) is a problem consisting of a linear 
multi-variable function to be minimized or maximized subject to linear inequalities 
involving those variables, where some of the variables are restricted to integer values. 
An integer program (IP) is one in which all variables are restricted to be integer- 
valued, and a linear program (LP) is one in which no variables are restricted to be 
integer (i.e., all variables are continuous variables). 


NOTATION: The optimal value of any mathematical program P defined above (LP, IP, 
or MIP) is denoted zp. 
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SUMMARY OF NOTATION 
e Network G = (N, E), n nodes and m edges. 
e Set K of commodities, k = 1,2,...,K. 
e Each commodity k has origin O(k), destination D(k), demand d, bps, and a 
variable cost bf, of sending one unit from node i to j on edge {i,j}. 
e Set £ of facility levels, = 1,2,...,L. 
e Each facility level / has capacity C; and fixed installation cost ai, on edge {i,j}. 
e Each node i € M CN has a switch installation cost t;. 


General Edge-Based Flow Model 


The network design problem defined above can be modeled by the mixed integer program 
shown below. 
NOTATION: The edge design decision variables, upper bounds, any restrictions on the 
network topology are defined as follows: 
e The continuous decision variable Sf denotes the flow of commodity k on edge 
{i,j} from i to j. 
e The integer decision variable u! ; denotes the number of facilities of level / to install 
on edge {7,7}, with ul < Migs the upper bound. 
e The binary decision variable v; equals one if a switch is installed on node i © M 
and 0 otherwise. 
e For each facility level 1, U' is a set specifying any topological restrictions on the 
design (for example, U' might specify that the chosen edge design variables Ui; 
define a cycle). The m-component vector of uj,’ is denoted u!. 


General Model: Edge-Flow [GM:EF] 


Minimize { S- al jut; + S- bE. fk + a tivib 


{i,j}E EEL {i,j} EB,kEK ieM 
subject to: 
—dy if i= O(k) 
So fi - So fh=4d ifi=D(k) Wie N, VkEK (FC) 
GEN Hi gj}EE JEN Ht jSEE 0) otherwise 

SOE < SoCui; =n} EE EF2) 

kek leEL 
SOE < SoCui; = =Vin J} EE EF3) 

keK leEL 
SS >> fR< TH VieM EF4) 

kEK jEN:{ij}EE 
wevU WleEL EF5) 
Z>0 V{i,j}e€E, VREK EF6) 
0< us; < th, and integer Vij} ek, VleL EF7) 
v4=O0orl VieM EF8) 
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REMARKS 


Ri: The constraints (FC) are the usual flow conservation constraints for each com- 
modity at each node. Constraints (EF2) and (EF3) limit the total flow of all commodi- 
ties in each direction of an edge by its installed capacity, and (EF4) constrains the total 
flow through nodes in M. Constraints (EF5) restrict the topology of the chosen design. 
Constraints (EF6) through (EF8) define the nonnegativity and integrality requirements. 


R2: This formulation assumes that the initial capacity of the edges is zero. By 
modifying the right hand side of constraints (EF2) and (EF3), we can model situations 
where edges have some positive level of initial capacity. 


R3: Constraints (EF2) and (EF3) limit the total flow of all commodities on an 
edge; hence, we refer to them as the bundle capacity constraints. In addition, some 
applications limit the flow for individual commodities on each edge. For an edge {i,j}, 
if Ck denotes the maximum allowable flow of commodity k on the edge from 7 to J, 
then the individual capacity constraints are of the form: 


fi <CK VEER, V{i,j} € E. 


R4: Formulation [GM:EF] allows fractional flows, although some applications require 
integral flows. The formulation also allows a commodity to flow over multiple origin- 
destination paths, called a bifurcated flow. Some applications might require non- 
bifurcated flow where each commodity is required to flow on only one origin-destination 
path. 


R5: Parallel edges in the graph can refer to different transmission technologies, for 
example, copper cables, fiber-optic cables, or wireless transmission. 


R6: In subsequent models that have the flow conservation constraints, we simply 
write “Flow Conservation Constraints (FC)” instead of rewriting those constraints each 
time. 


General Path-Flow Model 


Instead of using the edge-based flow variables as defined in formulation [GM:EF] above, 
an alternate approach for modeling the network design problem is to use path-based flow 
variables. We replace each edge {i, 7} in the original network by two oppositely directed 
arcs (7,7) and (j,i) having the same costs as edge {i, j}. 
NOTATION: The relevant notation for the model given below is as follows: 

e For each commodity k = 1,2,...,K, Py, denotes the set of directed paths from 

origin O(k) to destination D(k). 
e The flow (in bps) of commodity k on path p € P, is denoted Ge 
e The cost of sending one unit of flow of commodity k from origin O(k) to destination 


D(k) on path p is denoted ch = > bf. 
GI)EP 
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General Model: Path-Flow [GM:PF] 


Minimize { we a, Me + S- SS Ca + x tj ivi} 


{i,j} EB,lEL kEK pe Pr iE€M 


subject to: 
So gh =dp WheK 


pe Pr 


EE bs Cent, and 


REK pEPr:(t,j)ep leL 


SS SM gk < Vici, Vij ek 


kEK pe Px:(3,i)€p leL 


So SO ok < Tuy, Vie M 


kEK pEPpiep 
weU' VleEL 
G20 V{i,j} EE, VREK 
O<ui, < pl, and integer V{i,j}e EB, VIEL 
v,=0orl Wie M 


EXAMPLE 


El: Suppose there are seven nodes (n = 7), two facility levels (L=2) with capacities 
C, = 1 and Cy = 10, U! = U? = , and the internodal demand is as follows. 


1} 2)]3 |4)5)6]7 
1} 0/0] 0 ;0}0]0)0 
2/10} 0 | 0)0/)0/0}0 
3/10) 10} 0 |0)0/0/0 
4} 20] 0 | 30)0/0/0|0 
5} 1} 0} 0 4)0)0/0)0 
6} 1] 0/0 )0)0/0)0 
7;/1]0/040)/0/;0/0 


In the Figure 12.3.1 below, which gives a feasible solution for this situation, each thick 
line represents ten units of capacity and each thin line represents one unit of capacity. 


REMARKS 


R7: Nodes 1, 2, 3, and 4 in Figure 12.3.1, represented by squares, are hub nodes; 
they are typically 2-edge connected by high capacity facilities and form the backbone 
network. Nodes 5, 6, and 7, represented by circles, are end-office or terminal nodes. 
Each end-office node is uniquely assigned to a hub. Each subset of end-office nodes 
assigned to the same hub forms the local access network, which is usually a tree. 


R8: Because configuring the entire telecommunication network simultaneously is 
computationally difficult, a decomposition approach that separates the full problem 
into the backbone network and the local access network design problems can be used. 
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Figure 12.3.1: Example showing a capacitated network. 


12.3.2 Uncapacitated Network Design 


In certain applications, we can assume that the edge facilities and nodes switches 
are uncapacitated. Three such scenarios are: 


(1) If ope dk < C1, then installing the lowest capacity facility on an edge permits 
us to send the maximum possible flow on it. 


(2) In the telecommunications setting, because fiber-optic cables have high transmis- 
sion capacity, their capacity may be practically unlimited for some applications. 


(3) We may be initially interested in designing the topology of the network only. (A 
later analysis, if necessary, will determine the edge capacities and flow routes.) 


DEFINITIONS 


D8: The uncapacitated network design model (UND) is the network design 
model that results from the assumption that the edge facilities and node switches are 
uncapacitated. 


D9: A linear programming relaxation (LPR) of an integer or mixed-integer 
program is the linear program that results from relaxing the integrality requirement. 


D10: = Let IP1 and IP2 denote two minimization integer or mixed-integer programs 
for a discrete optimization problem. If LP1 and LP2, respectively, denote their linear 
programming relaxations, then the LP1 relaxation is said to be at least as strong as the 
LP2 relaxation if zp p, > zypo. 


D11: The integrality gap for a minimization integer (or mixed-integer) program IP 
is the quantity (z1p a ZLPR)/ZLPR- 
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Uncapacitated Network Design [UND] 


For any of the three scenarios mentioned above, we are able to make certain assumptions 
about our design problem. The last of the three listed below is an additional assumption 
that we make for the purpose of this presentation. 


ASSUMPTIONS 
e The design will use only (the least cost) facility 1. 
e We can scale all the commodity demands dx to one, and correspondingly, we can 


scale the variable flow costs to by, co bE dk Vk EK,V{i, 7} © E, and set C) =C. 


e We assume U! = 9) for all 1 € £, and we suppress the facility index I. 


Minimize { > ajuy+ So ob ght 


{i,j}EE {i,j}EE,kEK 
subject to: 
Flow Conservation Constraints (FC) 

So fE < Cui and (UND1) 
kEK 

SO FE < Cui V{i,j} eB (UND2) 

kek 

fi 20 V{i,j} eB, VREK (UND3) 

uj =Oorl V{i,j} EE (UND4) 
FACTS 


F1: When all the flow costs are zero, K = n—1,O(k) = k, and D(k) = n for all 
k © K, the above formulation models the (polynomially solvable) minimum spanning 
tree problem. 


F2: If the values of the design variables u;; are known, then solving problem UND 
amounts to finding K shortest paths, one for each commodity k using cost bf on the 
directed graph G(N, A), where A is the set of directed arcs (i, 7), (7,2) for which u,; = 
1, {i,j} € E. 


F3: Replacing the aggregate “forcing” constraints (UND1) and (UND2) by the dis- 
aggregate constraints 
fi <u and (UND5) 
fii Suz VUij} eB, VkEK (UND6) 
results in an equivalent integer program, [UNDStr]. However, the linear programming 
relaxation obtained by eliminating the integrality restriction on the u,; variables for 
formulation [UNDStr] is at least as strong as the linear programming relaxation of 
formulation [UND]. 


F4: Suppose the costs satisfy the mild flow-cost conditions: 
(a) bf, = bf, = b,; for all k € K and for all edges {i,j} € E; 
(b) bij + bys > 0 for all {i,j} € EB. 
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In this case [BaMaWo89], if two commodities kj and kz share the same origin or the 
same destination, then there exists an optimal [UND] solution in which k; and kg flow 
in the same direction on every edge, that is, 


fe + fF? <uig Vika, ke (O(k1) = O(k2) or D(k1) = D(ke)), V{i,j} € BE (UND7) 


F5: Balakrishnan et al. [BaMaWo89] develop a very effective dual ascent method 
that approximately solves the dual of formulation [UNDStr] and includes constraints 
(UND7). Using local search (add-drop) heuristics on an initial solution provided by 
the dual heuristic results in low performance gaps on large-scale, randomly generated 
problems. 


F6: The results in [BaMaWo89] indicate that the integrality gap of formulation [UND- 
Str] with constraints (UND7) is, on average, small for randomly generated problems. 
[BaMaMi98] conduct a theoretical analysis of the uncapacitated network design model 
and develop heuristics with a worst-case integrality gap of VK. 


EXAMPLES 


E2: Consider a complete graph on three nodes 1, 2,3 with three commodities defined 
by O(1) = O(2) = 1, O(3) = 2, D(1) = 2, D(2) = D(3) = 3. Let the fixed costs equal 
one and the flow costs equal zero for all three edges, i.e., aig = a13 = a3 = 1 and 
bi = b13 = bo3 = 0. Setting Uj12 = U13 = U23 = 1/3 and bee = ts = i235 = 1 
gives a feasible solution of cost 1 to the linear programming relaxation of formulation 
[UND]. However, this solution is not feasible to the linear programming relaxation of 
formulation [UNDStr]. The optimal solution to the linear programming relaxation of 
formulation [UNDStr] costs 3/2 (wie = w13 = ue3 = 1/2, fe = fi = fe = SR = FE 
f33 = f33 = f3, = fe, = 1/2), which shows, in this case, that the relaxation of [UNDstr] 
is stronger than the relaxation of [UND]. 


E3: Constraints (UND7) strengthen the linear programming relaxation of formulation 
[UNDStr]. Consider a 3-node, 2-commodity network with O(1) = O(2) = 1,D(1) = 
2, D(2) = 3. Asin Example E2, ai2 = a13 = a23 = i. and by = b13 = bo3 = 0. 
The solution to the linear programming relaxation of the UNDStr model sets all three 
design variables to 1/2; if the model is enhanced by adding constraints (UND7), the 
linear programming relaxation obtains the optimal solution by setting uy2 = uji3 = 1. 


Multi-Level Network Design 


Another uncapacitated network design that occurs in many telecommunication and 
transportation settings is one in which the nodes are classified into a hierarchy of groups 
based on their importance, and different grades of facilities are available. The more crit- 
ical nodes need to be connected using higher grade (level) facilities. The Multi-Level 
Network Design (MLND) model considers topological network design applications in 
such hierarchical settings. 


DEFINITIONS 
D12: Ina multi-level network design (MLND) problem, the facility levels are 


1 = 1,2,...,£, the node-set N is partitioned into N,, No,..., Nz non-empty levels 
(groups), and we assume that the higher indexed node groups are more critical than the 


Section 12.3. Communication Network Design Models 1503 


lower indexed ones and the facility levels are indexed in increasing order of grade (and 
expense). The objective is to assign a facility level to each selected design edge subject 
to the constraint that every pair of nodes 7 € Ny and j € Nj can communicate along 
a path that uses only facilities at level at least min(I’,/’’). We assume that there are no 
flow costs and that the facilities are uncapacitated. 


l 


NOTATION: Installing a level-I facility on each edge {i, j} of the network costs a;;, where 


1 2 E 
ay; Sai; Ss Say. 


D13: The MLND problem has a proportional cost structure if the ratio (al; / aa); 
1 =2,...,L, is the same for all edges. 


D14: The Steiner tree problem: Given a weighted graph in which a subset of ver- 
tices is identified as terminals, find a minimum-weight connected subgraph that includes 
all the terminals. In an optimal solution, the non-terminal nodes are called Steiner 
nodes. 


Multi-Level Network Design Model [MLND] 


In the formulation of the MLND problem shown below [BaMaMi94a], the variable Uj 
equals 1 if a level-l facility is installed on edge {i,7} and 0 otherwise. We also define 
commodities k = 1,2...,n—1 such that for each commodity k, O(k) = n, D(k) = k, 


and dy, = 1 Vk. 


L 
areeie bol 
Minimize ) ) Qj Wis 


{i,j}EE l=1 
subject to: 
Flow Conservation Constraints (FC) 
ths wy ad 
IXV<L 
fs Soul; V{i,j}¢ B, Vk: D(k) EN 
IXV<L 
k . . 
fi; = 0 V{i,j} © B, VkEK 
ui, =Oor 1 Vii,g} © BE, VWi=1,2,...,L 
FACTS 


F7: Since the edge costs are nonnegative, the edges chosen by the optimal MLND 
solution define a tree. 


F8: For each 1,! = 2,3,...,Z, the subtree in the optimal solution defined by facilities 
at level J is embedded in the (sub) tree defined by facilities at level | — 1. 


F9: When L = 2 and a; = 0, the MLND problem is equivalent to the Steiner tree 
problem with Nz defining the terminal nodes and Nj, defining the Steiner nodes. 


F10: Since the MLND problem generalizes the Steiner problem, the MLND problem 
is NP-hard. The problem continues to be NP-hard when L = 2,|N2| = 2 and (i) the 


. on 2 Pa 1 — . . 
costs are proportional or (ii) aj; = 1 and a;; = 0 or 1 for all {i,j} € B [Or9]]. 
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MLND Composite Heuristic 


The following composite heuristic [BaMaMi94a] for L = 2 takes the better of two 
heuristic values to develop a worst-case performance bound for the heuristic solution 
value relative to the optimal solution value. 

Step 1. Minimum Spanning Tree (Forward) heuristic: Treat the level-1 nodes as 
level-2 nodes, and find the minimum spanning tree in G(N, £) using costs aj;. Set 

; ; = 1 on all edges of this tree to get a feasible solution. 

Step 2. Steiner Overlay (Backward) heuristic: Find the Minimum Spanning Tree 

TuMLNp spanning all nodes using costs Gi; Using the incremental costs aj, = ai; for all 


U 


edges in the minimum spanning tree, and as, otherwise, solve a Steiner tree problem 
with N2 as the terminal nodes and Nj as the Steiner nodes. Install level-2 facilities on 
the edges of the Steiner tree and level-1 facilities on the remaining edges of Tyyztnp if 
they do not create a cycle. 


COMPUTATIONAL NOTE: If p < 2 denotes the worst-case performance ratio of the 
Steiner tree solution method used in Step 2, then the worst-case performance ratio of 


4 
the MLND composite heuristic is Z for proportional costs and p+ 1 for general 


costs. 


COMPUTATIONAL NOTE: Although, for simplicity, we have described the heuristic for 
L = 2, it generalizes to an exponential run-time recursive heuristic for the general 
MLND problem (the composite heuristic iterates O(2") times). For the case when 
L = 3,p = 1 and costs are proportional, this generalized recursive heuristic provides a 
worst-case performance ratio of 1.52241 [Mi96]. 


REMARKS 


R9: As with the uncapacitated network design problem, we can “directize” the 
edges of the MLND model (i.e., replace each undirected edge by two oppositely directed 
arcs). A solution approach based on the dual of this directized formulation solves large, 
randomly generated test cases, producing solutions that are within 0.9 percent of the 
optimal solution for Z = 2 and within 6 percent of the optimal solution for L = 5. 


R10: The MLND model can be generalized by imposing more restrictive connectivity 
requirements. For example, we may require that the subgraph defined by the I-level 
facilities be kj-connected for a pre-specified integer k,, Vl. The extant literature has not 
addressed this generalization even when L = 2, ky = 1, and kz = 2. 


12.3.3. Survivable Network Design (SND) 


Communication networks designed purely from a cost minimization perspective to 
satisfy commodity demand tend to be sparse (due to economies of scale), and an edge 
(representing a transmission facility) or a node (representing a communication switch) 
failure can lead to interruptions in communication service. Therefore, network designers 
build in redundancy by providing alternate communication paths in the network, so that 
the network can continue to satisfy communication demands even after a failure. The 
degree of network redundancy depends on the trade-off between network cost and the 
importance of maintaining the required connections between pairs of nodes. Since the 
probability of simultaneous failure of two or more elements (edges or nodes) is very small, 
it is generally assumed for network planning purposes that only one failure occurs at a 
time. 
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Uncapacitated Survivable Network Design [SNDUnc] 


The mathematical program given below is based on the concept of an edge-cut, a set of 
edges whose deletion disconnects the network. Edge-cuts and their relation to connec- 
tivity and internally disjoint paths (Menger’s theorem) are discussed in §4.1 and §4.7. 
The connection between edge-cuts and the algebraic structure of a graph is presented 
in 86.4, and their role in finding maximum flows in networks is discussed in §10.1. 


DEFINITIONS 


D15: A network is said to be a survivable network if it can continue to satisfy 
demand even when one of its edges or nodes fails. 


D16: The connectivity requirement between nodes i,7 € N,i 4 j, denoted by a 
nonnegative integer r;;, is the minimum number of edge-disjoint paths needed between 
i and j. 


D17: When the maximum connectivity requirement is no more than two, the problem 
is said to be a low connectivity survivable [LCS] network design problem. 


D18: Let G = (N, E) be a network and let S be a proper nonempty subset of the 
node-set N. The (edge-)cut defined by S, denoted (5, N\S), is the set of edges defined 
by 

(S,N\S)={{i,j} ee E|ieSandjeNn\S} 


NOTATION: For node subsets A and B, let [A, B] denote the set of node-pairs given by 
[A, B] = {i,j |7€ A andj € B} 
REMARKS 


R11: Since the graph is undirected, we can assume that the connectivity requirements 
are symmetric, that is, rij = rj. 


R12: When both i and j are backbone nodes, r;; is at least two, and when either of 
them is a local access node, r;; is typically one. In addition, r;; = 0 if either 7 or 7 is 
an optional (Steiner) node that the network can, but is not required to, use. 


Cut-Based Formulation of SND [SND-CUT] 


If S CN, the basic cut formulation of the survivable network design (SND) problem is 
given below. 


Minimize ; Ajj Uis 
{i,j} Ek 


subject to: 


iGo PF for all : ty SCN 
- Ui > le gt j} for all proper nonempty 
{i,j} €(S,N\S) 


uy =Oorl forall {i,j} eB 
FACTS 


F11: If there exist node levels r;,i € N such that r;; = min(r;,r;), then an optimal 
solution to SND has at most one 2-connected component. 
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F12: For a slightly more general connectivity requirement function, using a primal- 
dual solution approach, [WiGoMiVa95] develop a worst-case performance bound of 2R, 
where R denotes the maximum connectivity level; [GoGoP]ShTaWi94] improve this 


1 1 1 
bound to 2(1 4 mic 


EXAMPLE 


E4: Figure 12.3.2 shows an example of a survivable network with R = 3 and with 
two 2-connected components. 


(>) (10) Connectivity requirement between 


pairs of nodes = 3 
pairs of /\ nodes = 2 
pairs of @ nodes = 1 


Figure 12.3.2: Solution to an SND problem. 


SND Iterative Rounding Heuristic 


[Ja01] develops a linear programming based iterative rounding heuristic, presented be- 
low, that has a (tight) worst-case performance bound of two relative to the linear pro- 
gramming relaxation value of formulation [SND-CUT]. The two-step strategy repeats 
until the heuristic finds a feasible integer SND solution. 

Step 1. Find an optimal basic solution u,;, {i,j} € E to the linear programming relax- 
ation of formulation [SND-CUT]. (Comment: Except for the first iteration, some of the 
uij variables would have been set to one in Step 2.) 


Step 2. For all edges {7,7} with u,; > 5 fix us; to one. 
FACTS 


F13: Using insightful arguments, Jain shows that in any optimal basic solution to 
the formulation [SND-CUT], at least one uj; variable is at least 4. Rounding this 
edge to one at most doubles its contribution to the solution value. Jain departs from 
traditional rounding heuristics that solve one linear program and then simultaneously 
round all fractional solutions suitably. By setting only those variables with value at 
least a half to one and resolving the linear program, he finds the “valuable” edges in 
each iteration, and provides the first known SND heuristic with a constant worst-case 


performance bound. 
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F14: For specific connectivity requirements, or for specific cost parameters, or when 
defined over special classes of graphs, the SND problem is polynomially solvable. For 
example, when r;; = 1 for all pairs of nodes 7 and j in N, the SND problem simplifies 
to the minimum spanning tree problem, and when r;; = 1 for a specific pair of nodes 
i and j in N amd 0 otherwise, the SND problem is the shortest path problem and is 
thus polynomially solvable. The SND problem defined on Halin and on series-parallel 
graphs is polynomially solvable for several situations, such as when all connectivity 
requirements are even, or are equal. See [KeMa05] for an excellent survey of these as 
well as other results relating to the SND problem. 


F15: [DikKeMa08] provide a complete linear description of the integer polytope and 
show that the SND problem on a subclass of series-parallel graphs, including outerplanar 
graphs, is polynomially solvable when the connectivity requirements are either one or 
two. 


NOTATION: Let G = (N, FE) be a network with requirements r;; as defined above, and let 
A,B CN. Let [A, B],>2 denote the set of node-pairs i, 7 whose connectivity requirement 
is at least 2. That is, [A, B]->2 = {1,7 |i € A,j € B and rj; > 2}. 


FACT 


F16: The following constraint models the redundancy requirement that the network 
contain at least two node-disjoint paths between every pair of nodes i and j for which 


Ui = 1 
{i,j}EF : ije[S, (N\{SU{z}})] 
VzEN, for all nonempty SZ N\{z} : [S, (N\{SU {z})]-o2 #0 


Flow-Based Formulation of SND [SND-FLOW] 


By using flow variables, we can model the SND problem as a special case of the general 
network design model. 


NOTATION: For every pair of nodes with r;; > 0, define a commodity k such that 
dy =7ij, O(k) =i, D(k) = 7, and set L = 1. 
Minimize ey Ajj Uig 
{i,j}EE 


subject to: 
Flow Conservation Constraints (FC) 


fk <j and 

fi <uy V{ij}eB, VkEK 
k os 

fi; 2 0 V {i,j} © E, VkEK 
usg =O or 1 V{Uij}eEk 
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REMARKS 


R13: Using a novel connectivity upgrade strategy, [BaMiNa09] develop several classes 
of valid inequalities to strengthen this flow-based formulation and test their computa- 
tional effectiveness for the case when the SND solution has at most one 2-connected 
component. 


R14: [MaRa05] develop another flow-based formulation, doubling the number of 
commodities with connectivity requirements at least two and show that this formulation 
is stronger than formulation [SND-FLOW] when some connectivity requirements are at 
least one. 


Survivable Network Design: Bounded Cycles 


A limitation of both the SND formulations discussed above is that they allow long 
cycles. Thus, in an extreme case, the solution to an LCS network design problem might 
be a minimum cost Hamiltonian cycle through all the nodes. In such a solution, any 
edge failure requires the rerouting of the affected demands using long alternate paths. 
To prevent long cycles, we can impose the condition that every chosen edge belong to 
at least one cycle with length bounded by a specified constant. 


NOTATION: We use the following notation to impose this requirement: 


e For each edge {i,j} € E, Y;; denotes the set of cycles that contain {i,j} and 
satisfy the length bound. 


e For edge {i,j} € E and cycle C€ Y;,, VG denotes a binary variable that is one if 
cycle C is included in the solution and zero otherwise. The following (exponentially 
sized) set of constraints imposes the bounded cycles condition. 


Sou zu Vij} € E 
CeY;; 
So ng Susy VET EE, ft, 7} E\ {fas} 
CEYi; {i',7/}EC 
yG =Oorl V{i,j}EE, CEeYy 


REMARKS 


R15: — Fortz et al. [FoLaMa00] design a branch-and-cut approach for the design of 
minimum-cost bounded-cycles networks that contain two node-disjoint paths between 
every pair of nodes. 


R16: Another way of making a network robust to failures is to limit the number of 
edges (also called the number of hops) in the path used to satisfy the demand for each 
commodity (e.g., ([BaA192], [Go98}). 
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12.3.4 Capacitated Network Design 


In our discussion so far, we have not considered the capacity constraints appearing 
in the general models of §12.3.1. Many practical applications require activation of these 
constraints; such capacitated situations result in very hard optimization problems even 
when there are no node capacities and the edge capacity levels may be chosen from a 
continuous range (see, for example, [Mi81]). We describe three models below in each of 
which the capacities are available at discrete levels. 


Network Loading Problem [NLP] 


In telecommunication settings, we often encounter situations where we have L different 
types of facilities available such that their capacities are modular, that is, the capacity of 
a level-(1+ 1) facility is a multiple of the capacity of a level-(1) facility, ] = 1,2,..., 2-1. 
Moreover, we can install (load) any number of facilities on an edge. We refer to this 
problem as the Network Loading Problem (NIP). The number of different types 
of facilities is often small (less than five) in practical situations. For example, in some 
telecommunications settings, T1 and T3 facilities may be the available transmission 
facilities. A T3 facility has 28 times the capacity of a T1 line, but costs less than 28 
times the cost of a T1 line. This results in economies of scale in the fixed costs structure 
for any edge. Figure 12.3.3 below depicts an illustrative cost structure when we have 
two facilities and the capacity of the higher level facility is 12 times the capacity of the 
lower level facility. Note that, in this example, the break-even point is eight. 


Total Cost 


2: 8 12 20 


Edge Capacity 


Figure 12.3.3: Cost structure for a network loading problem. 


The following formulation models the Network Loading Problem when L = 2 with 
C, =1 and Co =C. 


Minimize { > (aj 4; + az ;us,) + S- eta 
{i,j}EE {i,j}EE, kek 
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subject to: 
Flow Conservation Constraints (FC) 

fi < uj;+Cuz, and 
kEK 
Sof < ui t+ Cul, V{i,j}eE 
kEK 

f& > 0 V{i,j} © E, VREK 
0< OF < pe and integer V{i,jg} € E,l=1,2 

EXAMPLE 


E5: Consider the single-commodity case (i.e., K = 1) with L = 1,C, =C. Ifd denotes 
the commodity demand, intuition suggests that the solution will use at most two paths: 
one carrying ({d/C] — 1)C units of flow, and the other carrying d— ({d/C] —1)C units 
of flow. The following example ({[Mi89]) (shown in Figure 12.3.4 below) shows that this 
intuition is not correct. In this example, the commodity origin is node 1, the commodity 
destination is node 4, d= 3 and C = 2. 


Fixed cost Variable cost 


Flow value 


Cost Structure Optimal Flows 


Figure 12.3.4: An optimal solution with three flow paths. 


FACTS 


F17: [MaMiVa95] and [BiGu95] develop and implement a polyhedral approach for 
solving the network loading problem, and [MaMiVa92] provide a complete polyhedral 
description of two of its core subproblems. 


F18: The single-commodity network loading problem with one facility type and flow 
costs, or with two facility types and zero flow costs is NP-hard ([ChGiSa98]). [At02] 
develops a complete linear description for the single facility, single commodity cut-set 
polytope. 
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Capacitated Concentrator Location [CCL] 


The design of local access networks (recall that local access networks are frequently 
trees) requires connecting the end-user nodes (terminals) on the local access tree to 
consolidation points, called concentrators on the tree. These concentrators compress 
the data signal and transmit it directly to the backbone network hub to which the local 
access tree is connected. 


DEFINITION 


D19: In the capacitated concentrator location problem (CCL), we are given 
the local access tree G(N, E) with node-set N and edge-set EF, the demands d; at the 
terminals i € N, aset M C N of possible sites for locating concentrators, the cost t; 
associated with installing a concentrator of capacity T’ at node 7 € M, and the cost aj; 
of connecting terminal 7 to concentrator 7. We want to determine the minimum cost 
location of the concentrators and assign each terminal to exactly one concentrator. The 
assignment should not violate the concentrator capacity constraint, nor the contiguity 
constraint: if terminal 7 is assigned to concentrator j, then all terminals on the unique 
path in G, P,;, from node 7 to concentrator j are also assigned to the concentrator at 
node 7. 


NOTATION: Let vj equal one if a concentrator is located at node 7 € M and 0 otherwise, 
and let u;; equal one if terminal i € N is served by a concentrator at site j € M. 


Capacitated Concentrator Location Model [CCL] 


The (CCL) problem can be formulated as the following 0-1 integer linear program: 


Minimize { S- » O45 Wig + S> tv, 


i€N jEM jEM 
So uj=1 WEN (CCL1) 
JEM 

So diaiyg< Tv; WiEeM (CCL2) 


iEeN 
Wig SV; VWiEN, VIE M 
Uij > uz; Wi €P,; 
uj =Oorl WieNVWjiEM 
vj =Oorl Wye M 


REMARKS 


R17: In formulation [CCL], the first set of constraints, CCL1, ensures that each ter- 
minal is connected to exactly one concentrator, and the CCL2 models the concentrator 
capacity constraint. The CCL3 constraints ensure that terminal 72 can be assigned to a 
concentrator only if it has been installed. The CCL4 constraints model the contiguity 
requirement. 


1512 Chapter 12. Communication Networks 


R18: This problem can be considered to be a generalization of the bin-packing 
problem but with an additional cost incurred for making the item and bin assignments 
and the contiguity constraint on the items. 


R19: Another problem that arises in local access networks is the capacitated minimum 
spanning tree problem (AlGa88g], [Ga91]). In this problem, there are n — 1 commodities 
and one of the nodes of G(N, E), say node n, is the destination node for all commodities. 
The origin node for commodity k is k, k = 1,2,...,n —1 and its demand is dy. The 
cost of using edge {7,7} is a;; and all flow costs equal zero. We need to select the least 
cost subset of E such that (i) the subset of edges form a spanning tree of G(N, E), and 
(ii) the sum of the demands of the nodes included in each subtree formed by deleting 
all the chosen edges incident to node n does not exceed a pre-specified capacity limit. 
A number of heuristic and optimization based approaches have been developed for this 
problem. 


Survivable Network Design (Capacitated) 


When we are designing capacitated networks, one way of improving survivability is to 
have two edge-disjoint paths, a working path and a backup path from O(k) to D(k), each 
with a dedicated capacity of d, for meeting the demand for commodity k. While this 
approach, called 1+1 Diverse Path Protection, provides the necessary survivability 
instantaneously, it does so at a high cost because it more than doubles the capacity 
(the backup path is typically longer than the working path) in the network. Therefore, 
network designers have devised a number of other ways of imposing the survivability 
condition in capacitated networks. We discuss two of these approaches. The first 
uses self-healing rings (SHRs) (which may still provide dedicated protection capacity 
against a failure) and the second one limits the amount of disrupted flow by using a 
diversification and reservation strategy. 


DEFINITIONS 


D20: = Self-Healing Rings Approach. Self-healing rings (SHRs) are cycles in the 
network formed by groups of nodes. Different SHRs may share edges (and thus be 
connected to each other); together the SHRs cover all the demand nodes. A switching 
device (called Add-Drop Multiplexer) is placed at the nodes that connect two SHRs and 
allows the signal to be transferred between the SHRs. Each edge in an SHR permits the 
signal to flow in both directions. Hence, each pair of nodes in an SHR is connected by 
two edge and two node disjoint paths. Therefore, any signal flowing through a ring is 
protected against a single edge or a single node failure on that ring. A number of different 
design problems arise when SHRs are used (see, for example, [SoWySeLaGeFo98)). 


D21: Diversification and Reservation Approach. Diversification splits the flow 
of commodity & such that no more than a fraction 6, flows through any edge or node 
(except O(k) or D(k)). Reservation reserves enough spare capacity in the network 
such that it can reroute at least a fraction of p, of commodity k if an edge or a node 
fails. 
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EXAMPLE 


E6: [PaJoAlGrWe96] Figure 12.3.5 illustrates the difference between diversification 
and reservation. For this example, K = 1,O(1) = 1, D(1) = 4, and d; = 2. 


Edge. Edge 
Capacities Gepanites 


PCE nCe 


Diversification: Failure State. 
Failure of any edge still allows 1/2 
the original demand to be satisfied 


Diversification: Normal State 


Edge. Edge 
Capacities Capatities 


Og BOs 


Reservation: Normal State Reservation: Failure State. Failure 
Working Path: 1-3-4 of any edge on the working path still 
Backup Path: 1-2-4 allows 1/2 the original demand to 


be satisfied 


Figure 12.3.5: Two ways of enhancing network survivability. 


Diversification and Reservation Model [DR] 


In the model given below, all flow costs are zero. 
NOTATION: The model uses the following notation: 
e 6 € O denotes the operating state of the network, where 6 = 0 denotes the normal 
operating state (when all edges and nodes are operational). 
e The state when node i, i € N, breaks down is denoted 6 = i, and 0 = {i,j}, 
{i,j} © E, denotes the state when edge {7,7} breaks down. 
e G(O™) = (N(@), E(@)), where N(@) and E(@) are the sets of nodes and edges, 
respectively, that are still operating under state 6. 
e P,(0) denotes the set of feasible paths from O(k) to D(k) under operating state 6. 
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L 
a gatos Toe 
Minimize y y Us 


subject to: {i,j}eB l=1 
S- S- gk (0) < SO Cui, = V {i,j} € B, VOC O (DR1) 

KEK pe P, (0):{4,j}€p leL 
S” gh(0) = dy WhEK (DR2) 

pe Px (0) 
SS” gh(9) = prdx VkEK, V0EO\ {0} (DR3) 
pePy (8) 
SS” gk) < Onde = VRE K, Wie N \ {O(k), D(k)} (DR4) 
pe Px, (0):i€p 
gh(0) < dede Wk EK,p={O(k), D(k)} (DR5) 
g(@) > 0 V{i,j} EE, kEK, VOEO (DR6) 
uj; > Oandinteger V {i,j}E BLEL (DR7) 
REMARK 

R20: In the formulation above, (DR1) are the capacity constraints. Constraints 


(DR2) ensure that the full demand of each commodity is routed under normal operating 
conditions (no failures), and constraints (DR3) ensure that at least a fraction py, of 
commodity k is routed under all other operating states. The next set of constraints 
(DR4) ensures that no node (and hence no edge other than the direct edge {O(k), D(k)}) 
carries a flow of more than 6,. Constraints (DR5) ensure this diversification for direct 
edges. The remaining constraints are nonnegativity and integrality constraints. See 
[StDa94] and [AlGrJoPaWe98] for a discussion of cutting plane approaches for solving 
such models. 


p-Cycle Protection Model [PP] 


DEFINITIONS 


D22: Directed version ofG. Given an undirected graph G(N, E), let the correspond- 
ing directed graph be G'pi,(N,A) where A, the set of directed arcs, is {(i,7),(j,2) : 
i,j € E}. 


D23: p-cycle. A p-cycle (or, a pre-configured protection cycle) is a simple, directed 
cycle in a directed graph Gpir(N, A) with at least three arcs. 


EXAMPLE 


E7: Installing sufficient capacity on a p-cycle permits rerouting the flow in case of 
an edge failure. In Figure 12.3.6, the p-cycle is the cycle defined by {1,2,4,5,7,6}. In 
case an edge corresponding to a p-cycle arc, say edge {1,2} fails, the flow from node 1 
to node 2 is rerouted on the path 1— 6 —7—5 —4-—2 and the flow from node 2 to node 
1 is is rerouted on the path 2—4-—5—7-—6-—1. Ifa chord of the p-cycle, say edge 
{1,7} fails, the flow from node 1 to node 7 is is rerouted on the path 1—2—4—5-—7 
and the flow from node 7 to node 1 is is rerouted on the path 7 — 6 —1. Note that the 
same p-cycle can protect against the (non-simultaneous) failure of mupltiple edges. 
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Figure 12.3.6: Directed p-cycle for re-routing flows. 


NOTATION: Model [PP] [AtRa08] uses the following notation: 


e Let © denote the set of p-cycles of Gpi,(N, A) with at least three arcs. 
e Let a denote the amount of capacity reserved for p-cycle w € W. 
e Let ay .= 1 if arc (7,7) belongs to p-cycle w € W and 0 otherwise. 


e Let ae = 1 if edge {7,7} is a chord of p-cycle | € WV and 0 otherwise. 


Minimize { Se ayuyt ok fi} 


{i,j} EE lEL {i,j}EE,kEK 
subject to: 
Flow Conservation Constraints (FC) 
LA S Liajwt Bie Wipe k 
kEK pew pew 
fi S Lieut DBiew Wipe k 
kEK pew pew 
Vi + Vokws Lew, vijpe# 
kEK pew leL 
Lehi + Lakes DCm; Vide # 
kEK pew leL 
k bo 8 
fi; 20 V{i,j} © BE, VREK 
Zp = 0 Vwew 
0< ui, < thy and integer V{ii,j} ek, VleL 
REMARK 
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(PP1) 
(PP2) 
(PP3) 
(PP4) 
(PP5) 


(PP6) 
(PP7) 


R21: The objective function in this formulation minimizes the sum of the variable 
flow and the capacity installation costs. In addition to the flow conservation constraints, 
the formulation’s constraints (PP3) and (PP4) ensure that flow can be rerouted on the 
associated p-cycles if an edge failure occurs. Constraints (PP5) and (PP6) ensure that 
the installed capacity on an edge exceeds the sum of the commodity flows and the 
reserved capacity on the p-cycles associated with that edge. [AtRa08] develop a cutting 
plane approach for solving the capacitated survivable network design problem using 


p-cycles. 
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INTRODUCTION 


We start by making a distinction that graphs are elements of pure or abstract math- 
ematics and networks are modeling tools in applied mathematics. There is insight into 
these structures obtained by thinking about their roles in light of the above statement. 

One way to differentiate a network from a graph is in the complexity of the def- 
inition. In graph theory, a mathematically rigorous, elegant, and simple definition of 
the graph structure is sought and used: a graph is a set of nodes and and a set of 
edges [ChZh05, Di97, We01]. In network science, a more complex, multi-perspective, 
multi-element definition is sought and used for a network. To show some of that com- 
plexity, a common way to define a network is to establish its components (its graph — 
nodes and edges, processes, and data/attributes associated with nodes and edges), its 
properties (dynamic, layered), and its application (social, propagating, organizational). 
This definition requires more than an adjacency matrix to meet all of these definitional 
elements. A network also typically has some kind of database of attributes and/or a pro- 
cess algorithm. Another approach often used to define a network is to use the concept 
of a mathematical graph (its nodal-link structure) with its topological features and then 
classify the various types of graphs that occur (random, scale free, small world, scale 
rich, etc.). A foundational research management report on network science [CoNS05] 
defined networks using a layered architecture of network roles — physical, communica- 
tion, informational, biological, and social/cognitive — that connect the layers together 
to produce the overall network framework. See Figure 12.4.1 for a schematic of that 
architecture. 
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Figure 12.4.1: The layered elements of a network. 


These approaches and elements of the various definitions show both the power and 
complexity of networks as a modeling paradigm. Network Science analyzes the nodes 
and edges with their associated attributes (weights, direction, roles, capabilities) along 
with the additional element that the network itself has a systematic process to meet its 
mission or purpose. Like graphs, networks are often represented by matrices (adjacency, 
weighted) and can possess specialized topological classifications [ChLu06]. However, an- 
other distinction is the dynamical nature of networks. Understanding and modeling the 
changing spatial-temporal behavior of the network structure is a significant component 
of network science [B03]. This element of dynamics perhaps produces the most signif- 
icant challenges for the network scientist. Choices in representing and measuring the 
changing elements of the network can significantly influence the nature and effectiveness 
of a network model [NBW0O6]. 
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12.4.1 Network Measures and Properties 


No matter what definition or framework is used, one of the basic steps in network 
modeling and analysis is to determine, calculate, or measure basic network properties. 
Most networks are sufficiently complex that simply relying on visualization can produce 
counter-intuitive conceptions of the network capabilities and performance or worse yet 
complete misunderstanding of the properties such as the relative importance or roles 
of certain network nodes and edges. Precisely computing and measuring well-defined 
properties counter those potential visual misperceptions and this in turn improves and 
strengthens network modeling and analysis [BE05]. 

Since terminology is important in mathematics and science, especially to expedite 
progress in an emerging discipline, network science gives thanks to its well-established 
graph theoretic foundation. Many of the measures developed in network science (often 
from graph theoretic roots) are very precise with careful definitions which contain helpful 
normalizations and clear notations. 

In this section, we define and discuss several mathematical measures for networks 
and, in particular, try to provide insights and definitions for basic network properties. 
Network properties can be classified many ways: structural (nodal and link), process, or 
data; local, global, or regional; discrete or continuous; dynamic or static. We attempt 
to sort out these classifications and use them in our network measures schema. 


Structural Measures: Centrality 


Let’s consider the underlying system used to measure structural (geometrically related) 
properties of networks. These basic structural measures are important aspects of net- 
works as they make up the bulk of the measures in the literature [VCLC08]. We will 
discuss the most basic elements such as degree, closeness, betweenness, and eigenvector 
centralities. For these network measures, centrality refers to the geometric center or the 
level of importance (as in playing a central role). Unfortunately, even this broad defini- 
tion may be neither sufficient nor enlightening, since network scientists have pronounced 
most network measures as centralities. Since these definitions are precise, we will in- 
troduce them using graph theoretic notation. All graphs in this section are undirected 
and simple. 


DEFINITION 


D1: Given a graph G = (V, EF) and a node v, the degree centrality of v is simply its 
degree, i.e., Cpo(v) = deg(v). 


REMARK 


R1: Degree centrality provides some measure of the importance of each node. 


DEFINITIONS 
D2: Given a node v, its farness is F(v) = oy cy_y Adu, v). 


D3: Given a node v, its closeness centrality is Co(v) = 1/F(v). 
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REMARKS 


R2: Closeness centrality provides some measure of how fast information can spread to 
the rest of a network from a given node. 


R3: The given definition is not meaningful for unconnected graphs; the literature 
contains many modifications and alternative definitions to address this problem. 
DEFINITIONS 


D4: Given a graph G = (V, £) and distinct nodes s and t, we define o,; to be the total 
number of shortest paths in G from s to t. 


D5: Given a graph G = (V, £) and distinct nodes s, t, and v, we define o,;(v) to be 
the total number of shortest paths in G from s to t that pass through v. 


D6: Given a graph G = (V, £) and a node v, the betweenness centrality of v is 


Ca(v) = S- ay) 


Ost 


REMARKS 


RA: The betweenness centrality of a node v measures the sum of the fraction of shortest 
s—t paths that pass through v. 


R5: There are several ways this metric can be normalized to provide a means of 
comparing C’g across graphs of differing order. 


DEFINITION 


D7: Given a graph G = (V,E), we can compute the eigenvector centrality by 
solving the system ry = + Duev QuyLu Where « = (21,22, %3,..., Xn] is a vector and 2; 
is the eigenvector centrality of vertex 7. 


Here the a;; values are the entries in the adjacency matrix A. 
REMARK 


R6: While there can be several values of A that admit solutions for the vector x in 
the above definition, the standard approach is to use the largest eigenvalue of A for » 
as this results in an x that has all positive components. 


12.4.2 Other Structural Measures 


We look at the concept of structural balance within local regions of a network to 
determine network properties that are not nodal or edge-based centralities. The idea 
is to determine if analyzing the regional structural relationships can provide measures 
for valuable properties of the network. We will do so by looking at 2 nodes at a time 
(dyads), 3 nodes (triads), and other size subnetworks (groups). We also present the 
process of finding structurally related nodes by looking at clustering algorithms. 


Section 12.4. Network Science for Graph Theorists 1523 


Reciprocity on Dyads 


In this section we consider directed networks. One basic question for dyads is: if one 
node is linked to another, is that relationship reciprocated by a link back? In more 
precise terms, if there is a link from node wu to node v, is there a link from node v back 
to node u? This characteristic is only relevant to directed networks. The idea is to 
aggregate the results for all the individual nodes to determine statistically the overall 
symmetry level of the network [WaFa94]. A major motivation for studying this property 
is to understand mutual relationships in social networks. The overall data analysis for 
a directed network of n nodes with k links is based on its density AG’ There are 
D = (5) possible dyads that can be classified as mutual (/) (reciprocated), asymmetric 
(A) (nonreciprocated), or null (V) (not present) with M+A+N = D. There are several 
classic examples of calculations for a measure called the Index of Mutuality (I) based on 
whether there are fixed or free choices for the number of outbound arcs at each node. A 
fixed number of links could come from a survey on a network of friends where the survey 
asks each person (node) in the network to list his or her top d friends for fixed d. Free 
choice links come from similar survey data where there is no restriction on the number 
of friends that can be listed. By calculating the random independent probability that 
a dyad is reciprocated for a fixed choice of d friends, an expected value of M can be 
calculated. 


DEFINITIONS 
D8: For a fixed-choice social network, the index of mutuality 


2(n — 1)M — nd? 


ae nd(n — 1-4) 


D9: For a free-choice social network, the index of mutuality 


2(n —1)?M — L? + Le 
L(n— 1)? — 124+ Ly 
where L is the sum of the number of friends chosen and Lz is the sum of the squares of 
the number of friends chosen. 


REMARK 


R7: A convenient feature of this measure for both fixed and free choice is that the 
values of I are benchmarked and normalized by —oo < I < 1, where the maximum 
reciprocity level is given when J = 1 (meaning wv € E <> vu € E) and reciprocity less 
than that expected by random ties gives I < 0. 


The following two examples show a network with low Mutuality Index under free choice 
and a network with high Mutuality Index under fixed choice. 
EXAMPLES 


E1: The graph in Figure 12.4.2 shows a free choice directed social network that results 
in J = —0.087. 


E2: The graph in Figure 12.4.3 shows a fixed choice directed social network that results 
in J = 0.72. 
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2 3 


Figure 12.4.2: A free-choice social network with J = —0.087. 
2 3 


Figure 12.4.3: A fixed-choice social network with I = 0.72. 


The Mutuality Index is very helpful in comparing networks or judging networks to 
known norms for reciprocity. Network modelers can use the reciprocity measure I to 
predict other behaviors in a social network, such as effective communication or decision 
making. 


Transitivity on Triads 


We now turn our attention to transitivity. Here we seek to understand the extent to 
which uv, vw € EF => uu € E. For social networks, the adage is “the friend of my friend 
is also my friend.” We seek to measure the level of transitivity in the entire network 
by finding the portion of transitive triads to the total number of triads in the network. 
This is equivalent to finding the proportion of closed triads in the network and can be 
found for both directed and undirected networks. One such measure is the network’s 
clustering coefficient [WaFa94]. 


DEFINITION 
D10: A network clustering coefficient C is the fraction of paths of length two that 


are closed. 


C= the number of closed paths of length 2 


the number of paths of length 2 
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REMARK 


R8: If C = 1, the network has perfect or maximal transitivity, or uwv,vw € E(G) > 
uw € E(G). C = 0 equates to no transitivity in the network, or wv, vw € E(G) > uw ¢ 
E(G). Many familiar network structures, such as trees and rectangular lattices, have 
C=0. 


Another measure is the local clustering coefficient which measures transitivity as a 
local property. 


DEFINITION 


D11: The local clustering coefficient C;(v) is the fraction of pairs of neighbors of 
a vertex v that are themselves neighbors. 


i jEN(v) i<j US 
(im Cl) 
2 
Here aij are the entires in the adjacent matrix A, and N(v) denotes the open neighbor- 
hood of v. 


Ci(v) = 


REMARKS 


R9: Nodes with lower than average values of C; lead to structural holes in the network, 
and statistical analysis of the values of C; in a network can provide additional insight 
into its structure. 


R10: Some network scientists define a clustering coefficient that is the mean of the 
local cluster coefficients. Nodes with less than 2 neighbors are typically ignored for this 
purpose. This value is different than the clustering coefficient C' defined above. 


Balance on Triads 


In some networks, the node connections are signed (positive or negative). Obvious 
examples of signed connections in a social network are friendship (positive) and ani- 
mosity (negative). These networks are called signed networks and some signed network 
structures exhibit balance and others do not. Lack of balance may lead to network 
instability or poor network performance. Situations that lead to balance are “the friend 
of my friend is my friend” and “the enemy of my enemy is my friend.” But when nodes 
have situations where “my enemy and my friend are friends,” the network exhibits im- 
balance. Network modelers often seek to measure the level of balance in the entire 
network by finding the fraction of triads that are balanced. 


REMARK 


R11: Analysis of cycles and other structural components can be performed on signed 
networks, both directed and undirected, to determine conditions for balance and imbal- 
ance. Another known result is that when a network is in balance it can be divided into 2 
friendly clusters (all positive connections within the cluster) with no positive friendship 
connections between the clusters (negative connections between nodes in the different 
clusters). 
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E3: Figure 12.4.4 shows a network with signed edges. The triad {3,5,6} is an example 
of a balanced triad; this represents three mutual friends. The triad {1,2,4} is also 
balanced; here friends 2 and 4 are both enemies of 1. The triad {4, 6,7} is unbalanced; 
7 is friends with both 4 and 6 but the latter two are mutual enemies. 


5 


y 


1 3 6 
: NZ 
+ + 
Figure 12.4.4: A network with signed edges. 


Communities and Partitions 


Finding structural properties by identifying natural groups (subnetworks) or dividing 
the network into communities or partitions are network science objectives. We distin- 
guish grouping or clustering (not all nodes are included) from community detecting and 
partitioning where we seek to place all nodes in the network. Partitions differ from com- 
munities since the number and/or size of the partitions are fixed in partitioning and not 
set a priori in community detecting. Graph partitioning algorithms often find the best 
division of the network given the set conditions of number or size of the partitions. The 
goal in community detection is to understand the structure of the network even though 
that sometimes results in a situation where no viable communities are discovered. 


REMARK 


R12: The usual goal of clustering is to find groups such that the densities of connections 
within the groups are substantially more than the densities of connections between 
groups or to the nodes outside the group. 


Large cliques (complete subgraphs) are not all that common in social networks. 
It is more common that groups, even cohesive ones, form at best near-cliques. Since 
some members of the group may not be acquainted with everyone in the group, but 
with almost everyone, a new construct was needed for “near-cliques,” giving rise to the 
notion of a k-plex. 


DEFINITION 

D12: A k-plex of order n is a subgraph of n nodes such that each node is adjacent to 
at least n — k of the others. 

REMARK 

R13: A k-plex with k = 1 is a clique. 
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DEFINITION 


D13: A k-core is a subgraph of n nodes such that each node is adjacent to k other 
nodes in the induced subgraph. 


REMARK 
R14: A k-core of n nodes is an (n — k)-plex on n nodes. 


In general, k-cores are easy to find through iteration. Start by removing all nodes of 
degree less than k, along with edges incident with those nodes. Repeat until all nodes 
have degree k or higher. The remaining network is either empty (in which case the 
original network has no k-core), or the remaining network consists of k-core(s). 


EXAMPLE 


E4: Figure 12.4.5 shows examples of cliques, k-plexes, and k-cores. The subgraph 
induced by {1,2,3,4} is a 4-clique. The subgraph induced by {1,2,3,4,7,8,9} is a 
3-plex on 7 nodes or, equivalently, a 4-core on 7 nodes. 


Figure 12.4.5: A graph containing a 4-clique and a 3-plex on 7 nodes. 


We will not cover the details of partitioning algorithms since partitioning is a main- 
stay of graph theory covered in other sections. Partitioning algorithms can be computa- 
tionally demanding and complex in their implementation. Community detection is also 
challenging and we merely give general descriptions of these algorithms. One commu- 
nity detection algorithm attempts to find sets of nodes with high modularity scores (a 
measure of the extent that similar nodes are connected to one another). Determining 
modularity can also be demanding from a computational standpoint. Another algorithm 
for community detection is hierarchical clustering, which is an agglomerative method. 
The main idea of this method is to form groups with high similarity and then iteratively 
combine groups to form larger ones. 
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12.4.3. Attribute or Data Measures 
Similarity 


In this section we look at the similarity of the attributes of nodes, not just their geomet- 
rical or structural similarities. However, we start with a similarity measure involving 
common neighbors of a node [N10]. 


DEFINITION 


D14: Cosine similarity is a measure of commonality of neighbors. It is computed 
by 


CSuy = Nuy 


V deg(u)deg(v) 


where ny, is the number of common neighbors of u and v that have a specified attribute. 


REMARKS 


R15: The specified property can either be structural, or a statistical attribute from 
the network’s database. For an example of the latter, we might be interested in cosine 
similarity related to friends who smoke cigarettes in a social network. In this case, ny» 
would count smokers who are friends of both u and v. 


R16: An example of a purely structural application of this measure is to let ny» count 
common neighbors (of all types). In this case, the measure is 


_ IN@OaN(e) | 


CSw 
deg(u)deg(v) 


R17: In the case of the previous remark, if C’S,,, = 1, then u and v have exactly 
the same neighbors, i.e., N(w) = N(v). If CSy, = 0, then u and v have no common 
neighbors, i.e., N(u)N N(v) = 90. 


Homophily 


A basic notion in network science is homophily — the principle that nodes tend to be 
similar to their neighbors. The adage “birds of a feather flock together” describes the 
idea. One measure of homophily is called modularity. 


DEFINITION 


D15: Modularity is a measure of homophily. It is computed by 


1 
Q=—(L— E(L)). 
m 
Here m is the number of edges in the network, L is the number of “like” edges, and 
E(L) is the expected number of like edges. 
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REMARKS 


R18: Vertices are categorized based on structural (related to the underlying graph) 
or other (related to the database mentioned in the introduction, or some other source) 
properties at the discretion of the modeler. An edge between two vertices that are 
sufficiently similar, based on the vertex categorization, are said to be “like.” E(L) is 
computed assuming edges exist independently of the properties that create like edges. 


R19: It is clear that —1 < Q <1. Relatively high values of Q show high homophily. 


R20: For social networks the tendency of people to form friendships with others similar 
in some characteristic(s) to themselves is termed selection. When people modify their 
behaviors/characteristics to bring themselves in alignment with their friend it is called 
social influence. 


12.4.4 Process Measures 


We will be less precise for process measures which are more like utility or perfor- 
mance functions. Modelers often attempt to design process measures (utility functions) 
in models that are simply well-behaved. That is, they are generally quantitative and 
consistent. However, when we have human preference involved even these basic proper- 
ties are difficult to require because human-based preferences are not easily represented 
by rationality-based utility function measures. Other process measures relate to the 
flow of something (physical, informational, cognitive, emotional) through the network. 


12.4.5 Modeling with Networks 


It should be noted that constructing and calculating effective measures can be dif- 
ficult for modelers. Finding the “right” measure could be called a facet of the art 
of modeling [EK10]. Some commonly used measures have very little value and often 
serve to obscure rather than enlighten. In organizational models, this is referred to 
as measurement inversion. For example, some traditional measures are merely what 
organizations find immediately measurable from already available data, even if they are 
low value and not compatible with the real property that is under investigation. Addi- 
tional modeling ineffectiveness occurs when modelers ignore high value or more precise 
measurements simply because they seem more difficult to calculate. 


12.4.6 Process Dynamics 


Equally important, yet often not considered, are the network processes and their 
measures related to network dynamics. As Barabasi pointed out in 2003, “we must 
move beyond structure and topology and start focusing on the dynamics that take 
place along the links [B03].” Some researchers, especially those modeling resource or 
information systems would classify the process as “flow”. However, these processes are 
often more complex than flow, which connotes direction and continuity, but operates 
on an unchanging underlying structure. 
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12.4.7 Structural Classification 


Two well-known and much studied classes of complex networks are scale-free net- 
works and small-world networks. Both are characterized by similar special structural 
features: inverse power-law degree distributions for the former and short path lengths 
and high clustering coefficients for the latter. These networks provide models for many 
network applications to include the internet, power grids, and airline travel and route 
maps. 

The small-world network is characterized by the properties that the shortest path 
lengths between nodes are proportional to the logarithm of the number of nodes in the 
network and the presence of many near-cliques and highly connected hubs [W99]. For 
a large social network, this results in strangers being connected by only a few links of 
acquaintances. In [W03], Duncan Watts describes his work on small-world networks and 
the application of small-world network theory to sociology. In pop culture, the trivia 
game “Six Degrees of Kevin Bacon” is based on the small-world network concept. In this 
game, movie buffs challenge each other to find the shortest path between an arbitrary 
actor and Hollywood actor Kevin Bacon through a network where vertices are actors 
and edges between two actors are present when they acted together in a movie. A similar 
phenomenon exists in mathematics, where research mathematicians try to connect each 
other to prolific research mathematician Paul Erddés through a co-authorship network. 

The defining property of a scale-free network is its degree distribution following an 
inverse power law or Pareto distribution: 


F(k) ~k-? 


where F'(k) is the fraction of nodes with k connections and the parameter p is often de- 
termined to be between 2 and 3 [BA99]. Modelers have conjectured that these networks 
form through a phenomenon called preferred attachment, where the common phrase 
used to describe this is “the rich get richer.” In this concept, new nodes are more likely 
to attach to already popular or high density nodes and therefore build a heavier tail 
in the statistical distribution of node degrees than expected from random connections. 
Barabasi and Alberts studied this generative model for web pages on the internet. Other 
researchers, such as West and Grigolini [WG11], have used this model for many network 
studies and cultural phenomena. 


12.4.8 Future Directions 


Network Science has had tremendous growth and development as a research area 
and discipline over the last decade. The NRC study of 2005 provided network science 
a baseline framework, along with attention from research funding agencies [CoNS05]. 
Several factors, including the rising importance of cybersecurity in the National Defense 
Strategy of the United States and other nations may accelerate this trend. 
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Glossary for Chapter 12 


adjacency matrix: given a graph G on n nodes, the n x n matrix with a 1 in entry 
(i, 7) if there is an edge from nodes i to j. 
average distance: for a graph G, the term 


adist(G) = y aa 


P| 
Uu,veF 


where F is the set of pairs of distinct nodes u,v of G with the property that the 
distance d(u,v) between u and v is finite. The directed analogue of this parameter, 
where distance refers to shortest directed paths, is denoted by adista(G). 

b(n): minimum broadcast time of any graph with n vertices. 

b(n, A): minimum broadcast time over all graphs on n vertices with maximum de- 
gree A. 

B(n): the number of edges in an MBG on n vertices. 

backbone network: a two-connected network containing high capacity edges and 
nodes. 

bandwidth: capacity of a communications facility in bits per second (bps). 

betweenness centrality: a measure of the extent to which a given vertex is on a 
shortest path between all pairs of other vertices. 

binomial tree: a recursively defined rooted tree on 2" vertices. A binomial tree on 
2" vertices is formed from two binomial trees on 2*~! vertices by joining their roots 
with an edge. 

Braess’ paradox: the counter-intuitive observation that, if the travelers were behaving 
selfishly it was possible to improve everyone’s travel time by removing roads. 

broadcast graph: a graph on n vertices for which there is a broadcast protocol with 
completion time [logn] for each vertex of the graph. 

broadcast protocol — for originator: a group of message transmissions that deliver a 
message from the originator to all of the other vertices in the graph. 

broadcast protocol — for a graph: a collection of broadcast protocols, one for each 
originator. 

broadcast time — of a broadcast protocol for a graph: maximum completion time for 
any vertex of the graph. 

broadcast time — of a graph: minimum broadcast time for any broadcast protocol for 
the graph. 

broadcasting: sending a single message from a particular vertex (the originator) to all 
of the other vertices in the graph. 

capacitated facility: a facility with limited bandwidth. 

Cayley graph D;: a graph with 2" — 2 vertices labeled with the integers mod 2” — 2. 
There is an edge between two vertices with labels i and j if and only if (¢ + 7) 
(mod 2* — 2) = 2-1 for some 1<@<k-1. 
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Cheeger ratio: given a subset S of nodes in a graph G, the terms 


|E(S, $)| 
h(S) = ——— he = ACS 
(S) vol(S) and = he ee (S) 
vol(s)<vol(G) /2 
where we define the volume vol(S) = >°,-,deg(v), vol(G) = vol(V(G)) and 


E(A, B) denotes the set of edges with one endpoint in A and one endpoint in B. 
closeness centrality: a measure of how close a vertex is to all of the other vertices in 
a connected graph. 
clustering coefficient — of a graph G: the term 


where E(N(v)) denotes the number of edges in the subgraph induced by the neigh- 
borhood of uv in G. (See $12.1.) 

clustering coefficient: a measure of the extent to which the network exhibits transi- 
tivity. (See $12.4.) 

collaboration graph: the graph whose nodes are co-authors and two authors are 
connected by an edge if they have written a joint paper together. 

completion time — of a broadcast protocol: the time at which the last vertex of a 
graph is informed of the message being broadcast. 

completion time — of a gossip protocol: the time at which the last vertex of a graph 
is informed of all of the messages. 

complex network model: a stochastic or deterministic model which generates graphs 
which simulate one or more properties of complex networks. Examples include the 
preferential attachment and copying models. 

compound - of G into ZH relative to a set S C V(G): the graph obtained by replacing 
each vertex x of H by a graph G, isomorphic to G and adding a matching between 
two sets S, and S, if a and y are adjacent in H. 

cosine similarity: a measure of commonality between neighbors. 

degree centrality: the degree of a vertex. 

de Bruijn graph UBa, with degree 2d and diameter k: a graph with d® ver- 
tices with labels that are strings of length k over the alphabet {0,1,...,d— 1}. 
There is an edge between vertex 21, %2,...,@, and each vertex %9,%3,...,%%,A and 
A,%1,02,-.-,U~-1 with A € {0,1,...,d—1}. 

degree matrix: given a graph G, the diagonal matrix with the degree of node 2 in the 
(i, 7) entry. 

diversification: splitting of flow between an origin and destination so that no more 
than a pre-specified fraction flows on any one edge or node (except the origin or the 
destination node). 

eigenvector centrality: a centrality measure based on eigenvectors of a graph’s adja- 
cency matrix. 

facility: a transmission medium (e.g., fibre-optic or wireless) installed on an edge for 
sending data, voice or video signals. 

G(n): the number of edges in an MGG on n vertices. 

gossip protocol: a group of message transmissions that complete gossiping. 

gossip time g(G) — of a graph G: the minimum completion time of any gossip protocol 
for G. 
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gossiping: sending a unique message from each vertex to all of the other vertices in 
the graph. 

idle: a vertex is idle at a given time in a broadcast protocol if it has learned the message 
but does not communicate with any of its neighbors at that time. 

informed by a broadcast protocol: a vertex is informed at any given time in a 
broadcast protocol if it has received the message being broadcast. 

informed by a gossip protocol: a vertex is informed at any given time in a gossip 
protocol if it has learned all of the messages. 

integrality gap: for the minimization case, the difference between the optimal ob- 
jective function value of an integer program and its linear programming relaxation 
expressed as a fraction of the optimal objective function value of the linear program- 
ming relaxation. 

index of mutuality: a measure of the extent to which reciprocated arcs appear in a 
directed graph. 

k-core: a way to specify a “near-clique” subgraph. 

k-dimensional binary de Bruijn graph U/B;: a graph with 2" vertices, each labeled 
with a different binary string of length k. Each vertex 71x%2---x, is connected to 
oLZ++*LpL1, LyX, ++ Lp_1, LoLZ+++ T_T, and LET +++ TR_1- 

k-dimensional binary hypercube H;: a graph with 2" vertices, each labeled with 
a different binary string of length k. There is an edge between two vertices if and 
only if their labels differ in exactly one bit position. 

k-dimensional binary shuffle-exchange graph S€;: a graph with 2" vertices, each 
labeled with a different binary string of length k. Each vertex 7122 --- x, is connected 
tO €o%3°++ LEX, Lp, +++ LR—-1, and © XQ--+- Tp. 

k-dimensional cube-connected cycles graph CCC;: a graph derived from the k- 
dimensional binary hypercube H, by replacing each vertex of H;, by a cycle of length 
k. There are edges between each vertex and its two neighbors on its cycle. There is 
one edge between each pair of cycles that correspond to neighbors in the hypercube. 

k-dimensional (wrapped) butterfly graph BF;,: a graph derived from the k- 
dimensional binary hypercube H, by replacing each vertex of H, by a cycle of 
length k. There are edges between each vertex and its two neighbors on its cycle. 
There are two edges between each pair of cycles that correspond to neighbors in the 
hypercube. 

k-plex: another way to specify a “near-clique” subgraph. 

Knédel graph W,,a: a A-regular bipartite graph with n vertices. The vertices are 
labeled (i, 7), i =1,2,0 <j < }—1. There is an edge between every pair of vertices 
(1,7) and (2,7 +2°—1 (mod #)),0<j <A—1, for eachO<@< A-1. 

linear programming relaxation: the linear program obtained by dropping the inte- 
grality requirement of an integer or a mixed-integer program. 

local access network: a network, usually a tree, for transmitting the data signal from 
the backbone network to the end user. 

Logarithmic Dimension Hypothesis: the conjecture that the dimension of an on- 
line social network is best fit by about logn, where n is the number of users in the 
on-line social network. 

minimum broadcast graph (MBG): a broadcast graph on n vertices with the min- 
imum number of edges over all broadcast graphs on n vertices. 

minimum broadcast tree: a rooted tree on n vertices for which there exists a broad- 
cast protocol for the root as originator with completion time [log n]. 

minimum gossip graph (MGG): a graph for which there is a minimum time gossip 
protocol and which has the minimum number of edges of any such graph. 
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minimum gossip time g(n): the minimum gossip time over all graphs with n vertices. 

modularity: a measure of homophily of a network. 

normalized Laplacian matrix: the matrix £ = J — D~'/2AD7~'/? where A is the 
adjacency matrix and D is the diagonal degree matrix. 

on-line social networks: graphs whose nodes consisting of users on some social net- 
working site such as Facebook, and edges consisting of friendship links between 
them. 

optimal gossip protocol for a graph: a gossip protocol with the minimum comple- 
tion time of any protocol for the graph. 

optimal gossip size f(n): the minimum /f(G) over all graphs with n vertices. 

originator: initial sender of a broadcast message. 

p-cycle: a simple, directed cycle in a directed graph with at at least three arcs — used 
in an approach for designing survivable networks. 

PageRank matrix: for a graph G and teleportation constant a € (0, 1], the matrix 


Py = adn/n+ (1—-a)P 


where J,, is the all 1’s matrix of size n x n and n is the number of vertices in G. 

PageRank vector: the stationary distribution of the PageRank Markov chain. 

price of anarchy: the ratio of the average travel time of the decentralized selfish 
routing versus the coordinated routing respecting the collective welfare. 

power law graph: a graph G on n nodes such that for some range of degrees k, Nz is 
proportional to k~°, for a fixed exponent b > 2, where Nj; is the number of nodes of 
degree k. If G is directed, then we may consider (possibly different) power laws for 
the in- and out-degree distributions by defining NV Hae and N, RS respectively, in the 
obvious way. 

protein-protein interaction networks: graphs whose have nodes consisting of pro- 
teins in a living cell, with two proteins joined if they share some biochemical inter- 
action. 

recursive circulant graph G,,_4 for d > 2: an n vertex graph with vertices labeled 
with the integers mod n and an edge between two vertices with labels 7 and 7 if and 
only if i+d*’ = 7 (mod n) for some 0 < ¢ < [loggn] — 1. 

reservation: reserving spare capacity in a survivable network for re-routing flow in 
case of an edge failure. 

size of a gossip protocol for a graph: the total number of calls used in the gossip 
protocol. 

small world phenomenon: a combination of small distances and clustering. Namely, 
there is a short path joining any two nodes and if two nodes share a common neighbor, 
they are more likely to be adjacent. 

solid 1-cover: a subset of the vertices in a broadcast graph that is a vertex cover of 
that graph, and for each originator not in that subset there is a broadcast protocol 
such that at least one neighbor of the originator is idle at some time during the 
broadcast. 

survivable network: a network in which traffic disrupted by a node or edge failure 
can be rerouted using spare capacity. 

switch: node equipment for routing and processing communication traffic. 

two-level network: a network where edges can have two types of facilities (say, fiber- 
optic or wireless). 

uncapacitated facility: a facility with practically unlimited bandwidth relative to the 
demand. 
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web graph: the graph whose nodes consisting of web pages, and edges corresponding 
to links between them. 
Wiener index: for a graph of G, the term 


W(G) = ; S- d(x, y). 


x,yEeV (Gt) 
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INTRODUCTION 


Chemical graph theory (CGT) is a branch of mathematical chemistry which deals 
with the nontrivial applications of graph theory to solve molecular problems. In general, 
a graph is used to represent a molecule by considering the atoms as the vertices of the 
graph and the molecular bonds as the edges. Then, the main goal of CGT is to use 
algebraic invariants to reduce the topological structure of a molecule to a single number 
which characterizes either the energy of the molecule as a whole or its orbitals, its 
molecular branching, structural fragments, and its electronic structures, among others. 
These graph theoretic invariants are expected to correlate with physical observable 
measures by experiments in a way that theoretical predictions can be used to gain 
chemical insights even for not yet existing molecules. In this brief review we shall 
present a selection of results in some of the most relevant areas of CGT. 


13.1.1 Basic Definitions 


DEFINITIONS 


D1: A molecular graph G = (V,£) is a simple graph having n = |V| nodes and 
m = |E| edges. The nodes v; € V represent non-hydrogen atoms and the edges 
(u;,v;) € E represent covalent bonds between the corresponding atoms. In particular, 
hydrocarbons are formed only by carbon and hydrogen atoms and their molecular graphs 
represent the carbon skeleton of the molecule. 


D2: An alternant conjugated hydrocarbon is a hydrocarbon with alternant mul- 
tiple (double and/or triple) and single bonds, such as the molecular graph is bipartite 
and the edges of the graph represent C= C and =C—C=orC=C and=C-C= 
bonds only. 
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13.1.2 Molecular Energy 


FACTS 


F1: In the Htickel Molecular Orbital (HMO) method for conjugated hydrocarbons the 
energy of the j” molecular orbital of the so-called 7-electrons is related to the graph 


spectra by : 
a — E,(j) 
B 2 
where 4, is an eigenvalue of the adjacency matrix of the hydrogen-depleted graph rep- 


resenting the conjugated hydrocarbon and a, 3 are empirical parameters [CoOlMa78, 
GrGuTr77, Ku06, Ya78}. 


dj = 


F2: The total 7 (molecular) energy is given by 


n 
EL; = ANe + BY? gy; + BE, 
j=l 
where ne is the number of z-electrons in the molecule and g; is the occupation number 
of the j” molecular orbital. 


F3: For neutral conjugated systems in their ground state [Gu05], 


2 >> A; if n is even, 


=1 
f(n) = (m41)/2 
2 > Aj + A(j+1)/2 if n is odd. 
j=l 


REMARKS 


R1: In most of the conjugated molecules studied by HMO n is an even number. In 


n 
such cases, E can be expressed as E = 5° |A,|. 


j=1 


R2: The concept of graph energy is defined for any graph as E = 5° |A,| [Ni07]. In this 
j=1 


case this term is not related to any “physical” energy but the index can be considered 
as a graph-theoretic invariant. 


R3: £6 < 0, then in representing the energy of molecular orbitals ¢; = a+ BA, it is 
assumed that the largest eigenvalue represents the minimum energy, then the second 
largest, and so forth [CoO1Ma78, GrGuTr77, Ku06, Ya78]. 


R4: Because an alternant conjugated hydrocarbon has a bipartite molecular graph: 
vj = —An—j41 for all j = 1, Ds eee TN. 


EXAMPLE 


E1: The molecule of 1,3-butadiene is a conjugated hydrocarbon whose molecular graph 
is the path graph with four nodes Py. The energy of the four molecular orbitals in 1,3- 
butadiene, expressed by the eigenvalues of the adjacency matrix are -1.618, -0.618, 0.618, 
1.618. The total energy of the molecule is -4.472. 
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FACTS 


F4: Let G be a graph with n vertices and m edges. Then [Mc71], 


[2m +n(n—1)(detA)”"/2 < E< V/mn. 
F5: Let G be a graph with m edges. Then, 2\/m < E < 2m. 


F6: Let G be a graph with n vertices. Then, E > 2\/n — 1, where the equality holds 
if G is the star graph with n vertices. 


F7: [KoMo01] E < 2m/n+ \/(n—1)(2m — 4m?/n?) where the equality holds if and 
only if G is Ky, } Ke, or a strongly regular graph with two eigenvalues having absolute 
value 


F8: Let G be a graph with n vertices. Then [KoMo01], 
n 
E< 5 (/n +1), 
where the equality holds if and only if G is a strongly regular graph with parameters 


(n, (n+ Vn) /2, (n+ 2/n)/4, (n + 2Vn)/4). 


F9: Let G be a bipartite graph with n vertices and m edges. Then [KoMo03], 


E <4m/n+ V/(n— 2)(2m — 8m?2/n?). 
F10: For all sufficiently large n, there is a graph G of order n such that [Ni07] 


E> (fn — APY, 


n 
2 


13.1.3. Graph Nullity and Zero-Energy States 


DEFINITION 


D3: The nullity of a (molecular) graph, denoted by 7 = 7(G), is the algebraic multi- 
plicity of the number zero in the spectrum of the adjacency matrix of the (molecular) 
graph. 


REMARKS 


R5: An alternant unsaturated conjugated hydrocarbon with 7 = 0 is predicted to 
have a stable, closed-shell, electron configuration. Otherwise, the respective molecule is 
predicted to have an unstable, open-shell, electron configuration. 


R6: If n is even, then 7 is either zero or it is an even positive integer. 
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EXAMPLE 


E2: The molecule of 1,3-cyclobutadiene is a conjugated hydrocarbon whose molecular 
graph is the cycle graph with four nodes cy. The energy of the four molecular orbitals in 
1,3-butadiene is Ey = a— 2|6|, Ez =a+0|6|, E3 =a+0|6| and Ey = 4(a+2|6|). The 
nullity of this graph is 7 = 2 and the first orbital is occupied by a pair of electrons while 
the two zero-energy states have one electron each. The total z-energy is E, = 4(a—|§}). 


FACTS 
F1l: Let P,, Cy, and K, be the path, cycle and complete graph with n vertices, 
respectively. Then [BoGu09], 
i) n(P,) = 0 if n is even and 7(P,) = 1 if n is odd. 
ii) n(C,) = 2 if n = 0 (mod 4) or zero otherwise. 
iii) n(K1) = 1 and n(Kns1) = 0. 


F12: [CvGu72] Let M = M(G) be the size of the maximum matching of a graph, i.e., 
the maximum number of mutually non-adjacent edges of G. Let T be a tree with n > 1 
vertices. Then, 7(T) =n — 2M. 


F13: [CvGuTr72] Let G be a bipartite graph with n > vertices and no cycle of length 
4s(s = 1,2,...), then n(G) =n — 2M. 

REMARK 

R7: The nullity of benzenoid graphs, which may contain cylces of length 4s, is also 
given by 7(G) =n — 2M [Gu83, FaJoSa05). 

FACTS 


F14: [L050] Let G be a bipartite graph with incidence matrix B, n(G) = n — 2r(B), 
where r(B) is the rank of B. 


F15: [ChLi07] Let G be a graph with n vertices and at least one cycle, 
(G) = n—29g(G)+2 g(G)=0 (mod 4), 
aan 29(G) otherwise 
where g(G) is the girth (length of miminal cycle) of the graph. 
F16: [ChLi07] If there is a path of length d(p, q) between the vertices p and q of G 
n — d(p,q if d(p,q) is even, 
n(@) = 4" Ged) se 
n—d(p,q)—1 otherwise. 
F17: [ChLi07] Let G be a simple connected graph of diameter D 
n—D if D is even, 
n(G) = ; 
n—D-—1_ otherwise. 


F18: {ChLi07] Let G be a simple connected graph on n vertices having K, as a 
subgraph, where 2 < p< n. Then, 


n(G) <n—p. 
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13.1.4 Graph-Based Molecular Descriptors 


DEFINITIONS 


D4: A graph-based molecular descriptor, commonly known as topological index 
(TI), is a graph-theoretic invariant characterizing numerically the topological structure 
of a molecule [DeBa00]. 


D5: The Wiener index of a (molecular) graph is a TI defined by 
i<j 
where d,,; is the shortest-path distance between the vertices i and j [Wi47]. 


D6: The Hosoya index of a (molecular) graph is a TI defined by 


n/2 


H=)S°PG,i) 
1=0 


where P(G,i) is the number of selections of i mutually nonadjacent edges in the graph. 
By definition P(G,0) = 1 and P(G,1) = m [Ho7]]. 


D7: The Zagreb indices of a (molecular) graph are TIs defined by [GuTr72] 


D8: The Randié index of a (molecular) graph is a TI defined by [Ra75] 


rey Oe) 


tjeL 


D9: Let k =0,1,2,3,... be the number of adjacent vertices of degrees 6;,6;,67,... in 
graph G. Then [KiHaMuRa75, KiHa76], the Kier and Hall molecular connectivity 
index is defined as 


eg Se Ctipdiioade 
Ajay. 


where the summation is taken over all subgraphs of size k, and the null term is the sum 
of all the vertex degrees (the total adjacency of G). 


D10: Let s; = >> d;; be the distance sum for the vertez i in a (molecular) graph. 
j=l 
The Balaban indez is a TI defined by 


ae “ -1/2 
J= Crt Dyes) ; 


where C =m—n-+1 is the cyclomatic number of the graph [Ba82]. 
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D11: The atom-bond connectivity index of a (molecular) graph is a TI defined by 
[EsToRoGu98, ?] 


6; +4; —2 


tjeL 


D12: Let G be a connected graph with adjacency matrix A and let D be a diagonal 
matrix of vertex degrees of G. The Laplacian matrix of the graph is defined as 
L=D-A. 


D13: Let G be a connected graph with Laplacian matrix L and let p and q be two 
vertices of G. The resistance distance between p and q is defined by [KIRa93] 


_—J tj _o9f7t 
Ong = Lp, + Lg — 22, 
where Li, is the p, q-entry of the Moore-Penrose pseudo-inverse of the Laplacian matrix. 


D14: The Kirchhoff index of a (molecular) graph is a TI defined by [K]Ra93] 


Kf => 0%; 
<j 
REMARKS 


R8: The Wiener number has been modified to describe the basic topology of infinite 
polymeric macromolecules and named Wiener infinite, Wa. [BoMeKa92]: 


Woes lim an? + bn? +cen+d 


R9: The Randié index has been generalized to [BoErSa99] 
x! = . (5;05)’, 
ijeEL 
and a few mathematical results exist for the different values of t [LiSh08]. 
FACTS 
F19: Let T,, be a tree with n vertices, then [EnJaSn76, BoTr77] 


W(Sn) <W(In) < W(Pn); 
where W(S;,) = (n — 1)? and W(P,) = ("4"). 


F20: Let T,, be a tree with n vertices and let 0 = 1 < po <... < py be the eigenvalues 
of the Laplacian matrix of the tree. Then [Me90, Mo91, DoEnGu01], 


W (En) = 0 (045)* 
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F21: Let H;, be a hexagonal chain with k > 1 linearly fused hexagons, then [ShLa97] 


1 
W (Hx) = 3 (16K + 36k? + 26k + 3). 
F22: Let T be a tree on n vertices. Let for an edge e = (x,y) define ni(e) = 


l{ulveV(T), d(v, a|T) < d(v,y|T) and no(e) = fulveV(T),d(v, ylT) < d(v,z|T). Then 
[Wid7, GuPo86, DoEnGu0l], 


REMARK 


R10: This is the manner in which Wiener introduced his index in 1947. 


FACTS 


F23: Let T be a tree. Let the bipartite sets of its vertices be of cardinality |V,| and 
|Vb|. Then [BoGuPo087], W(T) is odd if and only if both |V4| and |V,| are odd. If |V,| 
or/and |Vj| is even, W(G) is even. 


F24: Let m > 2. Let T;,T7>,...,Tjm be trees with disjoint vertex sets and orders 


N1,N2,...;Mm. Leti = 1,2,...,m,wieV(T;). Let T be a tree on n > 3 vertices, obtained 
by joining a new vertex u to each of the vertices w1,W2,...,Wm-. Then [CaRoRo85, 
DoEnGu01], 


[W (Ti) + (n — mi)d(wi|Ti) — nj] + n(n - 1) 


a 
os 
l 


F25: Let T be a tree on n vertices. Let v and u be vertices on a pendant edge. Then 
[DoGu94], 


W(T) = =[n2(n- 1) - yo) — auT))?). 
4 
(u,v)eE(T) 


F26: Let T be a tree on n vertices. Let deg(v) be the degree of vertex v. Then 
[KIMiPITr92, DoGu94, Gu94], 


1 
W(T) = Z In n(n — 1) +S deg(v \T)] 
veV (T) 
F27: Let T be a tree on n vertices and u branching points. Then [DoGr77], 
n+1 
wir) = @ )- SOS sn, 
WU l<i<j<k<m 


F28: Let T be a tree on n vertices and let L(T) be its line graph. Then [Bu81], 
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F29: Let W. be the Wiener infinite index, N, and C, the number of atoms and cycles 
in the monomeric unit, and d the distance between two neighboring monomeric units 
in the polymer graph. Then [BaBaBo01], 


d 


Vea. 
3(Ni + C1) 


F30: Let N, ie. and W be the number of atoms of a polymer whose macromolecule 
contains no atomic rings, the mean-square radius of gyration of the polymer, and the 
Wiener number of the polymer graph. Let also b be the length of the covalent bond 
connecting two monomeric units, let c be the number of polymer chains in a unit volume, 
and let € be the friction coefficient. Then [BoMaDe02], 


F31: Let g be the Zimm-—Stockmayer branching ratio of a branched macromolecule 
containing no atomic rings. Let also W, Wiin, and Rj, R2 j;, be the Wiener indices and 
the mean-square radius of gyration of the branched and linear polymer graph with the 


same molecular weight. Then [BoMaDe02], 


Ee ieee 


Wlin 


F32: Let subgraphs G; cover upon a vertex u’. Let also d(ueG) and d(ujeG;) be the 
distance numbers of the common vertex u in graph G and its i‘” component G;. Then 
[PoBo86], 


W(G) = » W(G;) + nd(u € G) — doris eG). 


F33: Let I be the number of isomorphic components G’, which cover to form graph 
HT, and let each of the G’s have n’ vertices. Let also W(G’), W(H), and d(u|G’) be 
the Wiener number of G’ and H, and the distance number of vertex u in G’. Then 
[PoBo90], 


W(H) = I.W(G’) + (n' — 1).1(1-1).d(ul@’). 


F34: Let graphs G, and G2 have n; and ng vertices, and let the graphs be linked by 
a bridge {uv}. Then, 
W (Hf) = W(G,) + W (G2) + nynNs + ngd(ulG,) + n d(v|G2). 


F35: Let an edge {uv} be divided by an inserted vertex x. Let also the total distance 
of vertex x in the graph H obtained by d(xH), and the number of geodesics containing 
vertex s, which are enlarged due to the division of the edge be b(s). Then, 


W(H) = W(G) + d(2|H) + [D> 0(s)]/2. 
seG 


F36: Let the edge considered in Fact F35 be a bridge. Let also the number of vertices 
in the two subgraphs G; and G2 be n; and ng, and let weG, and veGz. Then, 


W(H) = W(G) + ning + [d(ulG) + d(v|G) + ni + ng]2. 
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F37: Let a subgraph of n; vertices be transferred from a terminal vertex u to another 
terminal vertex v. Let also the distance numbers of u and v be denoted by d(u|G) and 
d(v|G). Then [PoBo86], 

AW = n,[d(u|G) — d(v|G)). 
F38: Let a subgraph be transferred from a terminal vertex u to another terminal 
vertex uv. Let also the length of the shortest path uv be L, the position of the branches 


located between u and v be 7, and the number of vertices in these intermediate branches 
i, located symmetrically with respect to u and v, be ny; and n,,;. Then [PoBo90], 


AW = S- if(L — 24) (nui — nv,))- 
F39: Let T,, be a tree with n vertices and let F,, be the n“” Fibonacci number. Then, 


We Ae A 


where the lower bound is obtained for $,, and the upper bound is obtained for P, 
[Gu77]. 


F40: Let G be a graph with k components Gj, G2,..., 9%. Then [GuPo86], 


F41: Let G be a graph, let pqeE be an edge, and p € V be a vertex of G [GuPo86]. 
Then, 


i) Z(G) = Z(G — pq) + Z(G — {p, q}). 
ii) Z(G) =Z(G—p)+ S) Z(G — {p,q}). 


p.qek 
F42: Let G be a graph and let pgeH. Then [WaYeYal0], 
Z(G) > Z(G — pq). 


F43: Let G be a graph with |P;| paths of length 7, |P;| =m, and |C3| triangles. Then 
[BrKeMeRu05], 


i) M, = 2m+ 2|Po|, 
ii) Mz=m-+ 2| P| + | P3| + 3|C3|. 
F44: Let G be a connected graph with n vertices and m edges. Then [De98], 
2 
My sm (tna), 
n—-1 


with equality if and only if the graph is S, or Ky. 
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F45: [DaGu04] Let G be a graph with n vertices. Then 
1 
O<My< grr ay 


where the upper bound is obtained for the complete graph and the lower one for the 
empty graph. 


F46: [DaGu04] Let G be a connected graph with n vertices, m edges, and minimum 
degree O;nin- Then 


Mp2 < 2m? — (n—1)Mb min + 5 (min 1)m Ge +n 2) : 
with equality if and only if the graph is S, or Ky. 
F47: The Randié index is bounded as [CaGuHaPa03, LiSh08] 
Vn-1<x< 5 


where the lower bound is reached for the star S;,, and the upper bound is attained for 
any regular network with n nodes indistinct of its degree. 


F48: Let T,, be a chemical tree (Omin < 4) with n vertices and n; > 3 pendant vertices. 


Then, 
n 1 1 7 
Tn < | t ; 
Tn) <3 m(4 -. *) 


with equality if and only if the tree is T(3, 2) [HaMe03}. 


yh 
F49: [Es10] Let k = [671 67/7... 6:17]. Then 


x= s(n —k"Lk). 


F50: Let G be a connected graph with n vertices, m edges and let A; be the largest 
eigenvalue of the adjacency matrix of G. Then [FaMaSa93, CaHa04], 


i) d= 


lS 


¢3 


ii) x +A1 > 2V/n—1 (n > 3), 
iii) y.A1 > n—1(n > 3). 
F51: Let G be a connected graph with n vertices. Then [DoGu10] 
I(Pn) < I(G) < J(Kn), 


where 
n-1 ‘4 : : 2 : 
7 2.490 Jett GHD. Gath 
J(Pn) = (n _ 1) D_(si8i+1) / » SE 9 T 2 
and 


n?(n—1) 


IES) see tae) 
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F52: [Dal0] Let G be a connected graph with m edges and let dmaz be the maximum 
vertex degree. Then, Mon IS 
2 max 1 
ABC > si 


7 ee (Vomaz + V2) 


when equality is attained for the path graph with n vertices. 


F53: [ChGull, DaGuFull] Among graphs with n vertices the complete graph has the 
greatest ABC index and this maximal-ABC graph is unique. 


F54: [ChGull, DaGuFull] The smallest ABC index for a connected graph with n 
vertices must be a tree and this minimal-ABC tree need not be unique. 


F55: + [FuGrVu09] Among trees with n vertices, the star has the greatest ABC index 
and this maximal-ABC tree is unique. 


REMARK 


R11: The trees with vertices for which the ABC index is minimum are not known. 
FACTS 


F56: Let L(G — u) be the matrix resulting from removing the u‘” row and column 


of the Laplacian and let L(G — u — v) be the matrix resulting from removing both 
the u’” and v*” rows and columns of L. The resistance distance can be calculated as 
[BaGuXi03]: 


F57: Let U;,(u) be the u” entry of the k’” orthonormal eigenvector associated to the 
Laplacian eigenvalue yz,, which has been ordered as 0 = py < po <... < py. Then 
[XiGu03], 


F58: Let the resistance matrix { be the matrix containing the resistance distance 
between every pair of vertices in a graph. Then [GoBoSa08], 


O = |1)diag{[L + (1/n)J]~1}? + diag[L + (1/n)J]~“!(1| — 2(L + (1/n)J)~# 
where J = |1)(1| is an all-ones matrix. 


F59: Let G be a connected graph with n vertices; the Kirchhoff index is given by 


K f(G) =nTr i Ge 7 “11") dt, 


0 
where 1 is an all-ones column vector [GoBoSa08]. 


F60: Let G be a connected graph with n > 3 vertices, m edges, and let dmaz be the 
maximum vertex degree. Then [ZhTr08], 


n n(n — 2)? 
> : 
BG) est aE Omax a 2m — il <4 Omax 
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F61: Let G be a connected graph with n > 2 vertices, m edges, and let dmin and Omax 
be the minimum and maximum vertex degree, respectively. Let 0 = fy < po <... < tn 
be the eigenvalues of the Laplacian matrix. Then [ZhTr09], 


with equalities at both sides if and only if it is regular. 


F62: Let G be a connected bipartite graph with n > 2 vertices, and let dmaz be the 
maximum vertex degree. Then [ZhTr09], 


Kf(@)> n(2n — 3) 


| 
Omas 


with equality if and only if Gis Kz 2. 


2. 


13.1.5 Walk-Based Molecular Parameters 


DEFINITIONS 


D15: A walk of length k& is a sequence of (not necessarily distinct) nodes vp, v1, ..., 
UVp—1, Up Such that for each 7 = 1,2,...,k there is a link from vj;_; to v;. A walk is a 
closed walk if vo = vz. The number of edges in the walk is called the length of the 
walk. 


D16: The vector w = [i1,H2,...,Ux], where pz; is the number of closed walks of 
length j or j*” spectral moment of the adjacency matrix in the graph and k < ov, 
represents a molecular descriptor, such as a molecular property A can be expressed as 


A= oy bjuj +a, 
J 
where b; and a are empirical coefficients |GuTr72, JiTaHo84, Boki92, Es08b). 


REMARK 


R12: Every ju; can be expressed in terms of subgraphs, which allows us to express a 
molecular property as a combination of fragmental molecular contributions. 


DEFINITIONS 


D17: The weighted sum of all closed walks starting at a given node represents 
an atomic descriptor, subgraph centrality, for the corresponding atom in a molecule 
[Es00, EsRo05], 


EE» = ‘s (A*) op _ Ce). 


k! 
k=0 
where e4 is a matrix function that can be defined using the following Taylor series: 
A? A Ak 
eA =I+At+ 4+ 4...4 54... 


2! 3! k! 
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D18: The sum of subgraph centralities of all atoms in a molecule is a molecular 
descriptor called the Estrada index of the graph [Es00, EsRo05, DeGuRa07], 


EE(G) = 5) EE. 
p=l1 


D19: The subgraph centrality and Estrada index have the following spectral 
representations [Es00, EsRo05, DeGuRa07]: 


EE, = ) lej(p)e™, (13.1.1) 
EE(G) = sei: (13.1.2) 


REMARK 


R13: The Estrada index of a molecular graph in which every edge is weighted by the 
parameter 6 = (kT)~!, where T is the temperature and k is the Boltzmann constant, 


n 
represents the electronic partition function of a molecule as defined by Z. = >> e Be; 
j=l 


[EsHa07]. 


DEFINITIONS 


D20: The probability that the system is found in a particular state can be obtained 
by considering a Maxwell—Boltzmann distribution [EsHa07]: 


ePAj ers 
Pi yes ~ EEG, 8)’ 


Jj 


D21: The enthalpy H(G) and Helmholtz free energy F(G) of the graph are, 
respectively, [EsHa07] 


j=l 
F(G,8) = 6 'MEE. (13.1.4) 


FACTS 


F63: [EsHil0] The Estrada index can be obtained as EE = tr(e®“), where tr is the 
trace and oo Ak 


~ Lak 
k=0 


exp(A) 


F64: [DeGuRa07] The Estrada index of a network G of size n is bounded as 


-1 
n< EE(G) <e™14+"—, 
where the lower bound is obtained for the graph having n nodes and no links and the 


upper bound is attained for the complete graph K,,. 
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F65: Let T,, be a tree with n vertices, then [De09, DeRaGu09] 


EE(s8n) > EE(TIn) > EE(Pn), 


where FE(S,) = n— 2+ 2cosh(/n— 1), and EE(Pn) = D> e208 @rt/(+), 


r—1 


F66: [DeRaGu09] Let G be a graph with n vertices and m edges. Then 


Vn? +4m < EE(Gn) <n—-1+eV™., 


F67: [BeBol0] Let G be a graph with n vertices. Let 6; be the degree of the j‘” vertex 
and let a,b « R be such that the spectrum of A is contained in [a,b]. Then, 


6; 65 


n J n J 
b2e% +6,e° azea +5;e7% 
S J me ER < Se de 
b2 4 < = 
j=l = 


245. 
a? + 0; 


F68: [BeBol0] Let G be a graph with n vertices. Let 6; be the degree of the j‘” vertex 
and let a=1—nandb=n-—1. Then 


(n — 1)2e==1 + el-” <EE< n—-1n—1+e" 


n—-1 e n-2 


F69: [EjFiLuZo07] Let G be a regular graph with n nodes of degree d= q+ 1. Then, 


i. ps W4q— 8? 
a e 


EE(G, 8) = 
( B) n On ae (q + 1)? az s2 


ine i) 
ae oe RIG) f2 Eau) (2V 48) ; 
y k=l 


where ¥ runs over all (oriented) primitive geodesics in the network, I(7) is the length of 
y, and I,(z) is the Bessel function of the first kind 


= (z/ayn tr 
LO, Gea 


r=0 
F70: [EsHa07] The electronic parameters are bounded as: 
i) 0< S(G,) < Blnn, 
ii) —8(n— 1) < H(G,8) <0, 
iii) —B(n—1) < F(G, 8) < —#1nn. 


The lower bounds are obtained for the complete graph as n > oo and the upper bounds 
are reached for the null graph with n nodes. 
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13.1.6 Vibrational Analysis of Graphs 


DEFINITIONS 


D22: A ball-spring graph is a graph in which every node is a ball of mass m and 
every link is a spring with the spring constant mw? connecting two balls. The ball- 
spring graph is submerged into a thermal bath at the temperature T, such that the 
balls oscillate under thermal disturbances. 


D23: The coordinates chosen to describe a configuration of the system are x;,i = 
1,2,...,n, each of which indicates the fluctuation of the ball z from its equilibrium 


point xz; = 0. 


D24: The ball-spring graphs are described by any of the following Hamiltonians: 


2 2 2 
_ D; Kmw » mu 


where p; is the momentum of the node i, K is a constant satisfying K > maz;k;, and 
k, is the degree of the node i [EsHaBel2]. 


D25: A classical vibrational scenario is one in which the momenta p; and the 
coordinates x; are independent variables. A quantum vibrational scenario is one in 
which the momenta p; and the coordinates x; are not independent variables but they 
are operators that satisfy the commutation relation [x;,p;] = ihd;;, where 1 = /—1,h 
is the Dirac constant, and 6;; is the Dirac delta function [EsHaBel12]. 


FACT 


F71: The mean displacement of node 7 in the classical vibrational scenario is given by 
any of the following expressions depending on the Hamiltonian selected [EsHaBel2]: 


Ax; = (x?) = aKa [( - Api les 


where L' is the Moore-Penrose generalized inverse of the Laplacian. 


REMARK 


R14: By obviating the physical constants the mean atomic displacements in the clas- 
sical picture are given by the diagonal entries of the resolvent of the adjacency matrix 
or of the pseudoinverse of the Laplacian, respectively. The last expression was also 
investigated in [BaAtEr97, EsHa07]. 
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FACT 


F72: The mean displacement of node 7 in the quantum vibrational scenario is given 
by any of the following expressions depending on the Hamiltonian selected [EsHaBel12]: 


Bhus* 
x (x?) tim, (exo | 50 L ; 
pq 
. Bhw? 
= 14 din, Crp OagexP Pa 
where jig is the second eigenvalue of the Laplacian matrix and 0 = ,/K/mw. 


REMARK 


R15: The displacement correlation between a pair of nodes (x;x;) is given by the 
(i, j)-entry of the corresponding matrix [EsHaBel2]. 


FACTS 


F73: The resistance distance between a pair of nodes in a graph can be expressed in 
terms of the node displacements due to small vibrations/oscillations as follows [EsHa07]: 


M3 = [(Aai)? + (Avy)? — (wins) — (ajai)] = (ai — 2)”)- 


F74: The sum of resistance distances for a given node in a graph, 


R= 
j 


T t T 
(L;, + L,; -— 2L;;), 
=1 
is related to the node displacements as [EsHa07] 
R; = n(Aai)? + ) (Aa)? =n (Az)? + (Azy]. 
i=1 
F75: The potential energy of the vibrations in a graph is given by [EsHa07] 


(VB) = 2 Ri — (Ri + Ry — 20%) 
i=1 


i,jeE 
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INTRODUCTION 


The last decades brought us a new scientific area of computational biology, which 
is placed at the junction of biology (especially molecular biology), computer science 
and mathematics. Its aim is to solve real-world problems arising in biology with the 
use of mathematical models and methods, and tools from computer science. Molecular 
biology, due to its rapid progress, yields more and more experimental data, possible to 
be processed on computers only. Efficient processing and advisable analysis must be 
accompanied by well suited models and methods; the ones coming from graph theory 
frequently appeared to be most useful. Here, the most interesting and breakthrough 
approaches of computational biology tied with graph theory are characterized. 


13.2.1 Biological Primer 


Except for the very first moments, where collected data were small enough for non- 
automated treatment, processing and analysis of biological data were done on computers 
with the use of algorithms. Huge amounts of data produced by a new generation of 
specialized laboratory equipment need efficient computational approaches. Many of 
them refer to known models from graph theory. In the following subsections these 
issues are discussed. Before that, let us introduce basic notions of molecular biology 
used in this section. 

Deoxyribonucleic acid (DNA for short) carries genetic information and exists in the 
form of a double helix, i.e., two twisted chains/strands of nucleotides joined together. 
Each nucleotide is composed of a nitrogenous base, a saccharide (deoxyribose) and a 
phosphoric acid. Nucleotides differ only in their nitrogenous bases. There are four bases: 
adenine (A), cytosine (C), guanine (G), and thymine (T). Their order in a DNA chain 
codes genetic information, symbolically written as a sequence of the letters A, C, G, T. 
Bases from the two DNA strands are joined by double or triple hydrogen bonds. Against 
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adenine always thymine stands, against cytosine — guanine. This property was named 
the complementarity. The DNA strands are complementary, i.e., knowing a sequence of 
bases from a fragment of one strand, one can always determine a sequence of bases from 
the corresponding fragment of the other strand. A length of a DNA chain is expressed 
in nucleotides or in bases (in base pairs when we have double-stranded DNA). The 
length of a human genome is about 3-10° base pairs. A short fragment of a DNA chain 
(formerly up to 20 nucleotides, now understood as of length smaller than 100) is called 
an oligonucleotide. 


A transcription factor (TF) is a protein that binds to specific short DNA sequences 
(typically 5-15 bp long), thereby controlling the transcription of genetic information 
from DNA to mRNA [La97]. TFs are the largest set of regulatory proteins in mam- 
malian cells, covering about 10% of all known proteins of mammals, including humans 
[KuSiJe10]. In a mammalian genome the number of binding sites for a given TF could 
be on the order of hundreds or thousands. A TF attaches to one or more binding 
sites on the genome and helps initiate a gene transcription program—see Figure 13.2.1 
for a cartoon view. As such, TFs are vital for many important cellular processes. 
A fundamental problem, which we will discuss in Section 13.2.3, is the characterization 
and the extraction from data of the DNA binding specificities of a given transcription 


factor. 
transcription 
factor RNA 
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binding site 


Figure 13.2.1: A cartoon view of gene transcription regulation: A transcription fac- 
tor attaches to a binding site on the DNA and helps initiate a gene 
transcription program. 


DNA sequencing, which is the process of determining a sequence of bases of a DNA 
fragment, is the first stage of discovering genetic information. This process is based 
on the property of complementarity of DNA duplexes. Later, the read DNA fragments 
are combined into the whole chromosome, or genome, by assembling and mapping al- 
gorithms (Figure 13.2.2). Nowadays, next-generation sequencing approaches produce, 
at relatively low cost, huge volumes of data composed of reads, being recognized short 
DNA or RNA fragments. For these approaches, the step of mapping often is not nec- 
essary. One of the former approaches, sequencing by hybridization (SBH), became the 
inspiration for defining a series of graph classes (characterized in the next subsection); 
thus we pay more attention to it here. What is more, the algorithmic part of SBH is 
basically very similar to current algorithmic approaches solving the assembling problem. 


The aim of the SBH experiment in its standard version is to detect all oligonu- 
cleotides of a given length | (usually 8-12 nucleotides) composing a part of a DNA 
chain of a known length n (a few hundreds of nucleotides). For this purpose, the 
oligonucleotide library is generated, which consists of all possible single-stranded DNA 
fragments of length | (i.e., their number is 4!). Next, the library is exposed to the 
reaction of hybridization with many copies of the studied DNA. In order to operate on 
that great number of molecules, microarray technology is used (Figure 13.2.3). 
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Figure 13.2.2: 


Discovering genetic information as a three-stage process. DNA fragments 


read at the lowest level, in SBH or next-generation sequencing exper- 
iments, become input data for the assembling stage. At the mapping 
stage the assembled sequences are ordered within a chromosome or a 


genome. 
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Sequencing by hybridization involving microarray technology. 


(A) A microarray containing a complete library of oligonucleotides 
of length 4. Every oligonucleotide (in many copies) is immobilized 
to the microarray at known coordinates. (B) During the reaction of 
hybridization, clones of the studied DNA join oligonucleotides being 
(reversely) complementary to their subchains. Due to fluorescent la- 
beling of the clones, the image of the microarray gives the information 


about the spectrum contents. 


During the hybridization process, oligonucleotides from the library join the DNA 
chains containing subchains complementary to them (to form a duplex). As a result, one 
can select oligonucleotides composing the studied DNA, and the oligonucleotides written 
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as words of equal length over the alphabet {A,C,G,T} make a set called a spectrum (for 
details see, e.g., [BaSm88, LyFIKht88, DrLaBr*89]). The computational phase of the 
sequencing process consists in the reconstruction of an original DNA sequence on the 
basis of the spectrum. 

The hybridization experiment executed without any errors provides an ideal spec- 
trum, that is, containing only all substrings of length / of the original sequence of the 
known length n (see Example E1). However, experiments usually end with several errors 
in spectra. There are two general types of errors: negative ones, i.e., oligonucleotides 
missing in the spectrum (the words which are parts of the original sequence but are not 
present in the set), and positive ones, which are erroneous oligonucleotides (the words 
which are not parts of the original sequence but are present in the set). In the literature 
these errors are alternatively named false negatives and false positives, respectively. 


EXAMPLE 


E1: Suppose the original sequence to be found is CAGTCAGAGTA, n = 11. In the 
hybridization experiment one can use, for example, the complete library of oligonu- 
cleotides of length | = 4, composed of the following 44 = 256 oligonucleotides: {AAAA, 
AAAC, AAAG, AAAT, AACA, ..., TTTG, TTTT}. As a result of the experiment per- 
formed without errors one obtains the ideal spectrum for this sequence, containing all 
four-letter substrings of the original sequence: {AGAG, AGTA, AGTC, CAGA, CAGT, 
GAGT, GTCA, TCAG}. The reconstruction of the sequence consists in finding such 
an order of the spectrum elements, where each pair of neighboring elements overlaps on 
1—1 = 3 letters (i.e., for every pair the suffix of | — 1 letters of the predecessor is the 
same as the prefix of the successor). Two possible solutions for this instance are shown 
in Figure 13.2.4. 


CAGT CAGA 
AGTC AGAG 
GTCA GAGT 
TCAG AGTC 
CAGA GTCA 
AGAG TCAG 
GAGT CAGT 
AGTA AGTA 
CAGTCAGAGTA CAGAGTCAGTA 


Figure 13.2.4: Two possible solutions of the sequencing problem for the ideal spectrum 
{AGAG, AGTA, AGTC, CAGA, CAGT, GAGT, GTCA, TCAG}. 


In the standard approach to SBH, the complete library contains oligonucleotides 
of the same length but of compositions differing in the ratio of C/G nucleotides to 
A/T ones. However, DNA duplexes of C/G rich oligonucleotides are more stable than 
those containing more A/T nucleotides, and the hybridization conditions (e.g., tem- 
perature) should differ for these two groups. The constant temperature kept during 
the hybridization reaction (i.e., not taking into account C/G content) causes a number 
of hybridization errors in the resulting spectra. In the earliest studies a simple equa- 
tion was used to calculate melting temperatures of oligonucleotide duplexes, assuming 
4 degrees for C/G pairs and 2 degrees for A/T pairs. Although simple, this rule gives 
a useful estimation (for more advanced rules see, e.g., [XiSaBut98]). Therefore, to 
equalize melting temperatures of all oligonucleotides in the library, the ones composed 
mainly of A/T nucleotides should be longer than the ones composed mainly of C/G. 
This fact justifies a new approach to DNA sequencing by hybridization using isother- 
mic oligonucleotide libraries [BIFoKa*04]. Such a library is composed of all possible 
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oligonucleotides of a constant (for the library) melting temperature of oligonucleotide 
duplexes, but of different lengths. The hybridization with isothermic libraries should 
result in spectra with a lower number of experimental errors, and then the computa- 
tional phase of the DNA sequencing would be much more effective. But, to ensure that 
every possible DNA sequence is covered by such a library, two isothermic oligonucleotide 
libraries differing by 2 degrees must be used. 

We refer the interested reader to [Wa95, Pe00] for a broader treatment of algorithmic 
and graph-theory issues in computational biology. 


13.2.2 Graphs in Sequencing and Mapping 


Computational biology, where molecular biology and computer science meet, gave a 
fresh view on problems of both sides due to its interdisciplinary nature. The biological 
side was supplied by models and methods from computer science, which allow us to 
solve biological problems efficiently. Many of them were based on graph theory. The 
computer science side also gained a lot from this junction; among others, new classes of 
biologically inspired graphs useful for problem modeling were defined and analyzed. 

Graph theory appeared to have wide application in modeling biological problems, 
following the combinatorial nature of nucleic acids. Especially the recognition of one- 
dimensional DNA and RNA structures well fits this kind of model. Small DNA/RNA 
particles are then represented as vertices in a directed graph, overlap of strings corre- 
sponding to their nucleotide chains is expressed by arcs, and a solution path corresponds 
to a resulting sequence of nucleotides. 


Interval Graphs in DNA Mapping 


Historically the first connection between graph theory and molecular biology happened 
in 1959, when Benzer proposed a method for identifying the topology of genetic in- 
formation in a bacteriophage [Be59]. His experiment led to the detection of its linear 
structure and, as a byproduct, to the invention of interval graphs. 


DEFINITION 


D1: Interval graphs represent the relation of interval overlapping in a linear space. 
Two vertices, corresponding to intervals, are joined by an edge in an undirected graph 
if the intervals intersect. 


FACTS 


F1: Ifa graph belongs to the class of interval graphs, the problem of verifying its 
Hamiltonicity becomes polynomially solvable [Ke85]. 


F2: Combinatorial problems, which are in general NP-complete but easy in the case 
of interval graphs, include vertex coloring, independent set, and clique problem. 
REMARK 


R1: Since 1959 interval graphs were used in modeling problems from different areas, 
where a linear arrangement is looked for, for example, in computational biology for 
mapping by hybridization to unique probes. 


The problem of mapping by hybridization aims at ordering long DNA fragments without 
referring to their nucleotide sequences, unlike the sequencing or assembling stages. 
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FACT 


F3: In the process of mapping, the hybridization experiment provides a binary matrix 
of information about pairwise hybridization of longer fragments against a set of short 
(possibly unique) probes. In the case of errorless experiment, when all appearances of 
probes within fragments have been detected and there is no false information, the matrix 
possesses the consecutive 1s property in rows, i.e., the columns can be ordered in 
such a way that in every row all 1s are consecutive. 


EXAMPLE 


E2: Figure 13.2.5 visualizes exemplary hybridization data and the resulting interval 
graph. This kind of data is enough to find a solution in polynomial time, which is a 
Hamiltonian path in the graph. 
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Figure 13.2.5: Example for mapping by hybridization to unique probes. 

(A) Result of hybridization has the property, and to find the correct 
order of its columns, one can model this problem with the use of an 
interval graph. (B) Vertices correspond to fragments and edges join 
fragments having at least one probe common. Every Hamiltonian path 
in the graph represents a possible ordering. (C) One of possible order- 
ings of the columns. (D) The solution, which is an order in the set of 
fragments, can be immediately read from the rows of the matrix with 
all 1s consecutive. Two distinguished points correspond to two cells of 
the matrix in bold. 


Adjoint Graphs in DNA Sequencing 


Coming back to DNA sequencing, the story began with two papers of Lysov et al. 
[LyFlKh* 88] and Pevzner [Pe89]. The algorithms presented there had been modeled 
with the use of two fundamental problems of graph theory. The approach by Lysov and 
co-authors refered to the Hamiltonian path problem (Algorithm 13.2.1), while Pevzner 
used the problem of searching for the Eulerian trail (Algorithm 13.2.2). 
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E3: In Figure 13.2.6 graphs modeling the SBH problem in its standard version with 
constant-length oligonucleotide libraries are presented. They are constructed for the 


errorless spectrum from Example El by Algorithms 13.2.1 and 13.2.2. 


Algorithm 13.2.1: Algorithm of Lysov et al. 


Input: an ideal spectrum S' composed of words of length J. 


Build a directed graph G: 
For every 0; € S add vertex v; labeled by 0. 
For every pair of vertices vj and v; 


Add arc (v;, v;). 
Find a Hamiltonian path P in G. 
Return P. 


Output: a Hamiltonian path in G corresponding to an order within S. 


If suffix of length | — 1 of 0; is equal to prefix of length | — 1 of 0; 


Algorithm 13.2.2: Pevzner’s Algorithm 


Input: an ideal spectrum S' composed of words of length J. 
Output: an Eulerian trail in G corresponding to an order within S. 


Build a directed graph G: 
For every 0€ S 


Add arc (v;, v;). 
Find an Eulerian trail P in G. 
Return P. 


Add, if not yet present, vertices v; and v; labeled by prefix of length 
1 —1 of o and suffix of length / — 1 of 0, respectively. 


(B) 
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Figure 13.2.6: (A) The graph from the method of Lysov et al., where every Hamiltonian 
path corresponds to a possible solution. (B) The graph from Pevzner’s 


method with solutions represented by Eulerian trails. 
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REMARKS 


R2: This interesting transformation of the graph, in which the Hamiltonian path 
is looked for (the strongly NP-hard problem in general), to the graph, in which the 
Eulerian trail is searched, which changed the computational complexity of the way the 
solution is produced, was not commented on by Pevzner. 


R3: The class of graphs for which such transformation is possible (labeled graphs) 
was widely examined in [BIHeKo*99, BIFoKat 02]. The graphs built on the base of the 
ideal spectrum according to Lysov’s manner, called DNA graphs, belong to this class 
— they are labeled graphs over the four-letter alphabet. 


R4: The class of Pevzner’s graphs does not belong as a whole to the class of la- 
beled/DNA graphs. This is because there is not a 1-1 correspondence between the 
presence of arcs in such graphs and the overlaps of vertex labels (see Figure 13.2.6B, 
where arc (GAG,AGA) is not present). 


DEFINITIONS 


D2: Directed de Bruijn graphs constitute a class of labeled graphs which are 
complete with respect to the size of the alphabet and the length of labels [Br46]. For 
an alphabet of size a and labels of constant length J, de Bruijn graph B(a,1) has a! 
vertices, every one labeled by a different word over the alphabet. Arcs are present 
between all pairs of vertices u and v satisfying the identity of the (/-1)-length suffix of 
label of u and the (/-1)-length prefix of label of v. 


D3: DNA graphs are vertex-induced subgraphs of directed de Bruijn graphs with 
a=4. Pevzner’s graphs are subgraphs of DNA graphs. 


D4: As defined in [Be73] in the context of directed graphs, the adjoint G = (V, A) 
of a graph H = (U,V) is a 1-graph whose vertices represent arcs of H, and which has 
an arc from « to y if the head of the arc in H corresponding to z is the tail of the arc 
corresponding to y. 


D5: A directed line graph is an adjoint G of a 1-graph H. 


FACTS 


F4: For directed graphs, the Hamiltonian path/cycle problem is easily solvable in an 
adjoint by transforming the adjoint into its original graph and then by searching for an 
Eulerian trail/closed trail within it. The existence of an Eulerian trail/closed trail in 
the original directed graph is a necessary and sufficient condition of the existence of a 
Hamiltonian path/cycle in its adjoint [BIHeKo* 99]. 


F5: The graph constructed by the method of Lysov et al. is the directed line graph of 
the graph by Pevzner for the same spectrum. For that pair of graphs, the two problems 
of looking for a Hamiltonian path and an Eulerian trail are equivalent. 


REMARK 


R5: What is interesting, this transformation does not work for undirected graphs 
[Be81], unlike the statements present in [La76, GaJo79]. See, for example, the undirected 
graph in Figure 13.2.7A, and its line graph in Figure 13.2.7B. The first one does not 
possess any Eulerian trail while the other has a Hamiltonian path. This is why further 
work utilizing this transformation was restricted to directed graphs. 
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(A) (B) 


Figure 13.2.7: The equivalence between the problems of Eulerian trail and Hamiltonian 
path in undirected graphs and their line graphs, respectively, is not 
valid. (A) A graph without any Eulerian trail. (B) Its line graph 
having a Hamiltonian path. 


Quasi-Adjoint Graphs in DNA Sequencing 


The problem of isothermic DNA sequencing by hybridization without errors in the 
spectrum was solved in [BlKa06]. The polynomial-time algorithm presented there used 
a graph model, in which vertices represented oligonucleotides and arcs were added in 
a similar way as in DNA graphs, but with special rules concerning vertices with labels 
being substrings of others. (The substrings are possible for two isothermic libraries 
differing in the melting temperatures by two degrees, which are used together in the 
isothermic approach.) In order to make such graph a directed line graph, some arcs 
must be deleted and some “temporary” arcs must be added. After the transformation 
of this graph into an original graph, a modified algorithm searching for the Eulerian trail 
was applied. This work, which proved that the equivalence of the Hamiltonian path and 
Eulerian trail in the two kinds of graphs holds true also for non-adjoints, became strong 
motivation for looking for a wider class of graphs with this property. In [BlKaLe*08] 
such a class was defined. 


DEFINITIONS 


D6: A directed graph is a quasi-adjoint graph if, for any two vertices x and y, the 
following property holds: 


N*(x)ON*(y) #0 = N*(a)= ) 
Nt(2) CNt(y) V 
N*(y) CN*(@) 


where N*(a) is the set of immediate successors of vertex 2. 


FACTS 


F6: For the class of quasi-adjoint graphs, the Hamiltonian cycle problem (HCP) is 
solvable in polynomial-time [BlKaLet08]. The exact algorithm solving this problem 
used the transformation between graphs. 


F7: The class of quasi-adjoint graphs is a generalization of, among others, adjoints 
and the graphs modeling the problem of isothermic SBH without errors in experimental 
data. 
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REMARKS 


R6: One step toward a further extension of this class was made in [BlKal2], where 
an algorithm for removing some additional superfluous arcs from a graph was exam- 
ined. The algorithm takes as the input any directed 1-graph, and by solving a series 
of perfect matchings over its subgraphs reduces arcs, which are guaranteed to not com- 
pose any Hamiltonian cycle in the graph. The graphs on the output of the algorithm, 
named reduced-by-matching graphs, in the best case become quasi-adjoint graphs, 
but mostly they remain “hard” for the problem. However, even hard, the reduced graph 
becomes an easier instance for some exact or heuristic algorithm solving HCP. It was 
confirmed in tests, which also showed that the algorithm is especially efficient for graphs 
with average outdegree equal to 2 (HCP is NP-hard for such graphs). Those graphs af- 
ter the reduction had the mean outdegree close to 1, what made them easy (on average) 
for HCP algorithms. 


R7: In [BlKal2] a systematization of several classes of digraphs referring to directed 
line graphs was provided, together with the proof of its correctness. In Figure 13.2.8 
this relationship is shown. For most of the classes, polynomial-time algorithms solving 
HCP exist. 


exponential-time 


algorithms for HCP 


polynomial-time 
algorithms for HCP 


quasi-adjoint graphs 


Figure 13.2.8: [BlKal2] The relationship between a few digraph classes with reference 
to HCP solvability. DLG stands for directed line graphs, PDLG for 
partial directed line graphs [ApFr07], and RBM graphs for reduced- 
by-matching graphs. DNA graphs lie within the DLG area. 


Labeled Graphs in DNA Assembling 


Another field of computational biology, DNA sequence assembling, also intensively uses 
models and methods of graph theory. Sequence assembling is the next step, after se- 
quencing, in the reconstruction of a genomic fragment. The output of the sequencing 
process (e.g., SBH) becomes the input for sequence assembling and the latter is somehow 
similar to the former but on a much larger scale. Now the input sequences generally have 
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different lengths, misreadings (insertions, deletions, and substitutions of nucleotides), 
and come from any of the two strands of a fragment of a DNA helix (with no hint 
from which one). Huge amount of erroneous and incomplete data make the problem 
well known for its high complexity. It is strongly NP-hard even in the case of data 
without errors and derived from one DNA strand (compare with the shortest common 
superstring problem [GaJo79]). 


FACTS 


F8: The goal of DNA sequence assembling is to compose input fragments into one 
resulting sequence (or a series of disjoint sequences — contigs) in a proper order. This 
problem can be modeled as a graph theoretic problem, the searching for a Hamiltonian 
path (or a series of paths in disjoint components) in a certain digraph (e.g., [KeMy95]). 


F9: Unlike in sequencing, due to inexact matchings allowed between fragments, such 
graphs do not possess the useful property of being adjoints or quasi-adjoint graphs. 


REMARK 


R8: The inexact matchings are the result of sequencing errors (insertions, deletions, 
substitutions of nucleotides) and researchers sought ways to overcome this disadvantage. 
One successful idea, applied next in many assembling algorithms (e.g., [PeTaWa01]), 
was to decompose the input sequences into series of shorter overlapping constant- 
length oligonucleotides and to build the graph with only exact matchings allowed, as 
in Pevzner’s approach. Erroneous information present in the input data was then ei- 
ther corrected by some heuristic procedures or ignored. Finally, the Eulerian trail was 
searched for. Novel next-generation sequencing approaches, like 454 sequencing, Ilu- 
mina’s one or SOLID, well suit to this model thanks to a small guaranteed experimental 
error rate. 


FACT 


F10: The assembling graphs constructed with the decomposition of input sequences 
are — as Pevzner’s graphs — subgraphs (but not vertex-induced) of de Bruijn graphs 
over a four-letter alphabet. 


With the progress in sequencing technology, the form of the data change and theoretical 
models have to adjust accordingly. Paired-end sequencing protocol has been theoreti- 
cally and experimentally proved to be much more efficient for de novo sequencing than 
standard single-read approach. In paired-end sequencing, instead of a set of single frag- 
ments at the input, one gets a set of paired fragments placed close to each other in the 
original genome (within some predefined approximate distance). Theoretical models 
proposed for this kind of data usually skipped this additional information at the be- 
ginning, built the assembling graphs as defined above and used the pairs just to guide 
the process of searching for a path. They did not lead to a simplification of the graph. 
One of the latest papers brought a new concept of so-called paired de Bruijn graphs 
[MePhCh* 11], a modification of the assembling graphs incorporating the information 
about paired fragments. 
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REMARKS 


R9: Recall that graphs from [MePhCh* 11], similarly to other decomposition-based 
assembling graphs, are not in fact de Bruijn graphs due to the incompleteness of both 
the vertex and arc sets. 


R10: In [MePhCh*11] the input pairs of fragments, distant by a number of nu- 
cleotides called the insert size, are decomposed into pairs of oligonucleotides: “left” 
oligonucleotides in the pairs come from the decomposition of the “left” fragments, and 
the corresponding “right” oligonucleotide lies at the same position as the “left” one but 
within the corresponding “right” fragment. In the graph, arcs correspond to the pairs of 
oligonucleotides and vertices to pairs of their prefixes and suffixes. Such graphs, thanks 
to the more restricted rule of joining arcs, are indeed less tangled than the regular 
decomposition-based assembling graphs. See Example E4. 


EXAMPLE 


E4: For the original sequence TATTTATTACGTACG and for insert size equal to 
1 (a theoretical assumption since in fact the insert sizes differ to a large extent in 
one biological experiment), the paired fragments may be as follows: TATTT+TTACG, 
ATTTA+TACGT, TTTAT+ACGTA, TTATT+CGTAC, TATTA+GTACG. After their 
decomposition into pairs of oligonucleotides of length / = 4, one obtains: TATT+TTAC, 
ATTT+TACG, TTTA+ACGT, TTAT+CGTA, TATT+GTAC, ATTA+TACG. The 
graph from [MePhCh*11] is shown in Figure 13.2.9A. The regular decomposition-based 
assembling graph constructed without the information about pairing is presented in 
Figure 13.2.9B. 


(A) (B) ACG 
e 
TTA eet 
TAT ACG 
TTA ATT 
ATT TAC 
. TAC ATT Pin ay 
2 Se TTT “ON GTA 
( ACG [/ TTT 
TAT, TAT 
GTAX__ eee 
ITA TTA 
Cer 


Figure 13.2.9: Decomposition-based assembling graphs constructed for the sequence 
TATTTATTACGTACG. (A) The graph with paired labels from the 
method of Medvedev et al. (B) The regular decomposition-based as- 
sembling graph. 


We see that there is a number of graph classes inspired by biological problems 
concerning the reconstruction of one-dimensional DNA or RNA structures. What is 
more, these graphs are not only nice theoretical models of real-world problems, but 
— first of all — they led to polynomial-time solutions of these problems. The class 
most known beyond the area of computational biology (and the oldest) is the one of 
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interval graphs. The new class of quasi-adjoint graphs also may become significant for 
the non-biological community, since it enables polynomial-time exact solution of the 
Hamiltonian cycle problem, commonly used in modeling real-world problems. 


13.2.3. Probabilistics and Graphs 


In the previous subsections, we encountered several graphs where vertices correspond 
to short DNA sequences, and edges reflect some notion of vertex similarity (such as 
overlap of the corresponding vertex sequences). Here we will briefly describe a different 
use of graphs in biology, in particular in the context of transcription factor (TF) binding 
(see the introductory section for some background information). Here, the vertices of 
the graph correspond to single positions on the DNA, and the edges of the graph capture 
a certain notion of stochastic dependence between nucleotides at different positions. 


Graphical Models of TF Binding 


The dominant model for TF binding in the literature is the position weight matrix 
(PWM) [DuEdkrt98, St00]. This is a probabilistic model that assumes that all DNA 
positions in a binding site are mutually independent given the binding event (See Fig- 
ure 13.2.10 for a cartoon view). However, it has been recognized that more expressive 
models need to be developed, in particular models that can capture higher-order depen- 
dencies among nucleotides at different positions on the genome [BaBePh* 09, StZh10]. 


Crr E a 
- N Ce) vt 0 oo Urn 


Figure 13.2.10: A ‘logo’ of a PWM model for a binding site of length seven, where 
the height of a letter in some position of the binding site reflects 
the probability under that model that the corresponding nucleotide 
appears in that position. 


DEFINITIONS 


D7: A probabilistic graphical model is a directed or undirected graph, where each 
vertex defines a random variable and the edges define stochastic (in)dependence relations 
between these variables. 


D8: A Markov network is an undirected probabilistic graphical model. 
D9: A Markov tree is a cycle-free Markov network. 


D10: A mixture of trees is a probabilistic model that is a convex combination of a 
(typically small) set of Markov trees [BaEIFrt 03]. 


D11: An ensemble of trees is a probabilistic model that is a mixture over all 
spanning trees of the complete graph formed by all variables [MeJa06]. 
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FACTS 


F11: Ina Markov network, a local Markov property holds among variables: A vari- 
able is conditionally independent of all other variables given its neighbors in the graph 
[KoFr09]. 


F12: The probability of a vector x under a Markov network with strictly positive 
distribution is given by a log-linear function: 


p(x) = 5 exp (326 de(xe)), (13.2.1) 


cEC 


where C is a set of cliques in the graph, and ¢, is a function that maps the subset of 
variables involved in clique c to the real numbers. The model is parametrized by a set 
of parameters 6. for each clique c, and the term Z is a normalizer that ensures that 
p(x) is a proper distribution, that is, >, p(x) = 1. 


F13: The probability of a vector x under a Markov tree model T is given by 


Gj (Gin Bi) ae 
p(x|T;0) = JT 20:98; (0,) Ta zi), (13.2.2) 
egjeT 


where e;; denotes the edge that is incident to the vertices 7 and j in tree TJ’, and the 
parameters 6 = {6;,6;;} encode all single and pairwise marginals of the distribution of 
x under the model. 


F14: The probability of a vector x under a mixture of trees model is given by 
p(x) = 9° v(T) p(xIT; Or), (13.2.3) 
TET 
where 7 is a fixed set of trees, and p(T’) defines a probability distribution over the set 


of trees. 


F15: The probability of a vector x under an ensemble of trees model is given by 


p(x) = >> v(T; B) p(x|T; 4), (13.2.4) 


TES 


where S is the set of all spanning trees in the complete graph formed by the set of 
variables x;, for i = 1,...,d, and the probability of a tree p(T; 8) is given by 


Il 4: (13.2.5) 


(4,j)ET 


for parameters 8 = {(6;; > 0}, for each edge (2,7) of the graph, with 6;; = 6;;. Here, 
Z = Vires H¢.,jer Bij 18 a normalizer that ensures that p(x) sums to one over all x. 


F16: The probability p(x) in an ensemble of trees model (13.2.4), as well as the 
normalizer Z in (13.2.5), can be computed analytically via the weighted matrix tree 
theorem [Mo70]. This theorem states that the weighted sum of all spanning trees of a 
given graph equals any minor of order d — 1 of the weighted graph Laplacian L: 


_ J Big if iAg 
La ={ ier eee: (13.2.6) 


Section 13.2. Ties between Graph Theory and Biology 1573 


REMARKS 


R11: Comparing the ensemble of trees model (13.2.4) with the mixture of trees model 
(13.2.3), we note a few differences. The first is that we are now using a large (super- 
exponential) number of spanning trees (the number of spanning trees of a complete 
graph of order d is d4~?, according to Cayley’s formula). The second is that each 
spanning tree T defines a Markov tree model p(x|T; 9) that is parametrized by the same 
parameters 9; in other words, all trees share the same parameters 0 = {0;,0;;}. The 
last crucial point is that the mixing distribution p(T; 3) is parametrized by an extra set 
of parameters 3, which play a special role in this model. 


R12: In the context of TF binding, the random vector x = (x,...,%q) corresponds 
to a length-d genomic sequence (a TF binding site), a vertex of a Markov network 
corresponds to a position on the DNA site where the given TF binds to, and an edge 
captures stochastic dependencies between different nucleotides at different positions on 
the binding site. 


R13: Markov networks and Markov trees have been used as models of TF binding 
[BaElFr*03, ShLuSe08]. As shown in [BaElFr*03], a mixture of trees provides better 
predictions over a single Markov tree in general, and better predictions than the classical 
PWM model. 


EXAMPLE 


E5: Figure 13.2.11 shows a hypothetical example of a binding site of length 8 on the 
DNA, and a graphical model that can be used as a model of TF binding. In this example, 
the TF binding model is a Markov tree with 8 vertices and 7 edges. Each vertex indexes 
the corresponding DNA position on the binding site (e.g., vertex 1 denotes the left- 
most position in the binding site), and edges capture independencies of nucleotides at 
different positions on the site according to the local Markov property of the graph. For 
example, the neighborhood structure of vertex 2 in the graph implies that, for observed 
values of nucleotides at positions 3 and 5, position 2 is stochastically independent of all 


other positions. 
transcription 
factor gs ay) 


satseaesdeccsseeaie CTCGGTCA _.......ceee. 


binding site CG) Cs) <> > 


Figure 13.2.11: An example of a hypothetical TF binding site on the DNA of length 
eight (on the left), and an undirected graphical model (a Markov 
tree) that can be used as a model of TF binding (on the right). The 
particular values of the nucleotides shown on the left can be viewed 
as realizations of the corresponding random variables, one per DNA 
position. 
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Learning Graphical Models 


For any choice of TF binding model, a fundamental question is how to learn the model 
i.e., estimate its structure as well as its parameters (typically via maximum likelihood), 
from measured data. The recent technological advances in biology permit the simultane- 
ous measurement of thousands of candidate binding regions on the DNA [BaBePht 09]. 
The measured data, after some preprocessing, are in the form of short DNA sequences 
that serve as binding sites of a given TF, and from this set of sequences a model can be 
learned. 


EXAMPLES 


E6: In Figure 13.2.6 graphs modeling the SBH problem in its standard version with 
constant-length oligonucleotide libraries are presented. They are constructed for the 
errorless spectrum from Example E1 by Algorithms 13.2.1 and 13.2.2. 


E7: In Figure 13.2.6 graphs modeling the SBH problem in its standard version with 
constant-length oligonucleotide libraries are presented. They are constructed for the 
errorless spectrum from Example E1 by Algorithms 13.2.1 and 13.2.2. 


FACTS 


F17: Learning a probabilistic graphical model from data by likelihood maximization 
is in general NP-hard [KoFr09]. 


F18: A Markov tree (13.2.2) can be learned in polynomial time by casting the problem 
as a maximum weight spanning tree problem, for appropriate weights computed from 
the observed data [ChLi68]. 


F19: Learning Markov networks with higher tree-width is NP-hard [Sr03]. 


REMARKS 


R14: The NP-hardness of the problem can be understood by noticing that in the case 
of general Markov networks (13.2.1) the normalizer Z involves a summation over an 
exponentially large set of all combinations of values for each variable. In the context of 
our TF binding problem, a brute-force computation of Z would require enumerating a 
set involving O(4“) elements, for a binding set of length d. Even for modest values of 
d, this can be an expensive computation. 


R15: Learning a mixture model is also a difficult problem, but complexity results are 
scarcer [AnHsHu* 12]. The complexity of learning an ensemble of trees model (13.2.4) 
is not known, but we conjecture that the problem is NP-hard to solve to global opti- 
mality. In practice, local optimization techniques are used, such as the Expectation- 
Maximization (EM) algorithm [DeLaRu77], but these techniques offer little or no opti- 
mality guarantees. Better results can be obtained by taking a greedy approximation in 
a Hilbert space, an approach that also offers theoretical guarantees [LiBa00, VILi02]. 
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13.2.4 Hypergraphs in Biology 


In this section we briefly discuss possible applications of hypergraphs in biology. We 
mainly follow Klamt et al. [KIHaTh09], who provide a readable review on the topic. The 
need for hypergraph modeling in biology stems from the fact that biological processes 
often involve more than two participating elements. A typical example is a biochemical 
reaction such as A+ B + C+ D that involves four molecules. Hypergraphs can 
naturally capture such multi-partner relations and therefore they can model a large class 
of problems in biology, such as protein-protein networks, gene regulation, biochemical 
reaction sets, and others [KIHaTh09, Mall, FIMaYet 12]. 


DEFINITIONS 


D12: An undirected hypergraph H is a pair H = (V, E), where V is a set of vertices, 
and FE is a set of nonempty subsets of V called hyperedges [Be89]. 


D13: A directed hypergraph H is a pair H = (V,E), where V is a set of vertices, 
and FE is a set of hyperedges, where each hyperedge is assigned a direction. 


D14: A transversal of a hypergraph (V,£) is a subset of V that has nonempty 
intersection with every edge in &. A hypergraph transversal T is called a minimal 
transversal if no proper subset of T is a transversal. 


EXAMPLES 


E8: A protein-protein interaction network can be modeled by an undirected hyper- 
graph, where the vertices of the hypergraph represent proteins, and the hyperedges 
represent protein ‘complexes’ in which two or more proteins are physically connected to 
each other [KIHaTh09]. 


E9: A chemical reaction A+ B— C+D can be modeled by a (trivial) directed 
hypergraph H = (V,£), where V = {A, B,C, D} is the set of participating molecules, 
and EF consists of only one hyperedge { A, B} > {C, D}. 


E10: Figure 13.2.12 shows an example of a small reaction network and its corresponding 
directed hypergraph. If all reactions were reversible, the hypergraph would have been 
undirected. 


Reaction network Hypergraph 
R,:A—B 
Ry: A+B—-+C+D 
Ry: D—-E 


Figure 13.2.12: A reaction network can be represented by a directed hypergraph in 
which vertices represent molecules and hyperedges represent reac- 
tions. Adapted from [KIHaTh09]. 
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E11: The influence of a change of the concentration of a metabolite A to another 
metabolite B in a metabolic reaction network can be characterized by the shortest 
hyperpath between vertices A and B in the corresponding reaction hypergraph. 


REMARKS 


R16: Several tractable problems in graph theory become intractable when moving to 
hypergraphs (e.g., bipartite hypergraph testing) [GaJo79, Be89]. 


R17: In a biological context, a minimal transversal of a hypergraph could define a 
set of distinct representative molecules, for instance a set of essential proteins in a 
protein-protein interaction network, or it could characterize the most economical way 
to ‘disrupt’ a biological system such as a cancer cell: In the case of a metabolic network 
involving a set of reactions modeled via a hypergraph, a minimal transversal of the 
hypergraph would index the smallest set of molecules the perturbation of which could 
have an effect on every single reaction in the network. 


R18: A faithful model of a biological system often requires imposing additional se- 
mantics on its objects, as in the case of stoichiometry where each biochemical reaction 
includes also the molecularities of the involved species [FIMaYet 12], in which case a 
hypergraph model must be annotated with additional semantics. Accordingly, several 
classical graph concepts when generalized to hypergraphs, such as hypercircuits or hy- 
percycles, may need to be carefully redefined in a biological context [Ze00]. 
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Glossary for Chapter 13 


ABC index - atom-bond connectivity index: the sum of the square root of edge 
weights for the graph, where the edge weights are defined as the edge degree divided 
by the product of vertex degrees of the pair of vertices forming the edge. 

adjoint G = (V,A) - of a digraph H = (U,V): a directed 1-graph whose vertices 
represent arcs of H and which has an arc from «x to y if the head of the arc in H 
corresponding to x is the tail of the arc corresponding to y. 

alternant conjugate molecule: a molecule in which multiple bonds (double or triple) 
alternate with single ones. 

assembling graph: a directed graph used for modeling instances of the DNA assem- 
bling problem, constructed with decomposition of input sequences into equal-length 
subsequences. It is a subgraph (but not vertex-induced) of a de Bruijn graph over a 
four-letter alphabet. 

Balaban index: an analog of the Randi¢ index in which the vertex degrees are replaced 
by the total distances of the graph vertices, with a normalizing coefficient including 
the number of graph edges and cycles of a molecule. 

benzenoid molecule: a molecule formed by fused hexagonal rings. 

consecutive 1s property in rows: when columns of a matrix can be ordered in such 
a way that in every row all ls are consecutive. 

cycle graph: a graph in which every node has degree two. 

de Bruijn graph: a labeled graph, which is complete with respect to the size of the 
alphabet and the length of labels. For an alphabet of size a and labels of constant 
length 1, de Bruijn graph B(a,l) has a! vertices, every one labeled by a different 
word over the alphabet. 

directed hypergraph: a hypergraph where each hyperedge is assigned a direction. 

directed line graph: an adjoint of a directed 1-graph. 

displacement correlation: refers to the correlation function of the displacements of 
two atoms (vertices) in a molecule. 

DNA graph: a labeled graph with labels over a four-letter alphabet. 

edge degree: number of edges adjacent to a given edge. 

ensemble of trees: a probabilistic model that is a mixture over all spanning trees of 
the complete graph formed by a set of variables. 

Estrada index: the sum of the exponential of the eigenvalues of the adjacency matrix, 
i.e., the trace of the exponential of the adjacency matrix. 

graph diameter: the length of the largest shortest-path distance in a graph. 

graph energy: the sum of the absolute values of graph eigenvalues. 

graph invariant: a characterization of a graph which does not depend on the labeling 
of vertices or edges. 

graph nullity: the multiplicity of the zero eigenvalue of the adjacency matrix, i.e., the 
number of times eigenvalue zero occurs in the spectrum of the adjacency matrix. 
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Hosoya index: the total number of selections of k mutually nonadjacent edges with k 
(max) equal to half of the number of vertices in the graph. 

hydrocarbon: a molecule formed only by carbon and hydrogen. 

hypergraph transversal: a subset of the set of vertices V of a hypergraph (V, £) that 
has nonempty intersection with every edge in EF. 

incidence matrix: of a graph: a matrix whose rows correspond to vertices and its 
columns to edges of the graph and the i,j entry is one or zero if the i‘” vertex is 
incident with the j*” edge or not, respectively. 

interval graph: a graph that represents the relation of interval overlapping in a lin- 
ear space. Two vertices, corresponding to intervals, are joined by an edge in an 
undirected graph if the intervals intersect. 

Kirchhoff index: the sum of the resistances of the graph. 

labeled graph: a directed graph whose vertices can be labeled over an alphabet in 
such a way that labels are unique and of the same length /, and the suffix of length 
!—1 of a label of vertex x is equal to the prefix of length | — 1 of a label of vertex y 
if and only if there is the arc (a, y) in the graph. 

Laplacian matrix: a square symmetric matrix with diagonal entries equal to the 
degree of the corresponding vertex and out-diagonal equal to —1 or zero depending 
if the corresponding vertices are connected or not, respectively. 

Markov network: an undirected probabilistic graphical model. 

Markov tree: a cycle-free Markov network. 

matching - of a graph: the number of mutually non-adjacent edges in the graph. 

mean displacement - of an atom (vertex): refers to the oscillations of an atom from 
its equilibrium position due to thermal fluctuations. 

molecular descriptor: a quantitative characteristic of a molecule based on its struc- 
ture or composition. 

molecular graph: simple graph with non-hydrogen atoms as nodes and covalent bonds 
between them representing links. 

molecular Hamiltonian: the operator representing the energy of the electrons and 
atomic nuclei in a molecule. 

paired ‘de Bruijn’ graph: a directed graph being a modification of the assembling 
graph, which incorporates information about paired input sequences (see §13.2). 

path graph: a graph formed by vertices a degree two except two nodes of degree one. 

Position Weight Matrix (PWM): a probabilistic model for TF binding that assumes 
that positions in a binding site are mutually stochastically independent. 

probabilistic graphical model: a directed or undirected graph, where each vertex 
defines a random variable and the edges define stochastic (in)dependence relations 
between these variables. 

quasi-adjoint graph: a directed graph with the following property satisfied for all 
pairs of vertices « and y: either their sets of immediate successors are disjoint, or 
equal, or one of the sets is contained in the other. 

Randié index: the sum of the inverse-square root of the product of degrees of all pairs 
of vertices in the graph. 

reduced-by-matching graph: a directed graph being a product of an algorithm 
reducing arcs not composing any Hamiltonian cycle in a digraph, via a series of 
perfect matchings solved for its subgraphs (see §13.2). 

resistance distance: a distance between any pair of vertices of the graph, determined 
by the Kirchhoff rules for electrical sets. 

shortest path: a path having the least number of edges among all paths connecting 
two vertices. 


1582 Chapter 13. Natural Science & Processes 


star graph: a tree consisting of a node with degree n — 1 and n — 1 nodes with degree 
one. 

stoichiometry: a directed hypergraph that models a set of biochemical reactions, 
where each reaction corresponds to a hyperedge that is annotated by the molecular- 
ities of the involved chemicals in the reaction. 

strongly regular graph: a regular graph in which every two adjacent vertices and 
every two non-adjacent vertices have an integer number of common neighbors. 

subgraph centrality - of a vertex: the corresponding diagonal entry of the exponential 
of the adjacency matrix. 

topological index: a graph invariant characterizing numerically the topological struc- 
ture of a molecule. 

transcription factor (TF): a protein that binds to DNA and regulates the activity of 
genes. 

transcription factor binding site: a site on the DNA where a transcription factor 
binds to. 

undirected hypergraph: a pair H = (V, EF), where V is a set of vertices, and E is a 
set of non-empty subsets of V (hyperedges). 

weighted matrix tree theorem: a theorem that states that the weighted sum of all 
spanning trees of a given graph with d nodes equals any minor of order d— 1 of the 
weighted graph Laplacian. 

Wiener index: the sum of the shortest-path distances between all pairs of vertices in 
the graph. 

Zagreb indices: a pair of topological indices based on sums of squared vertex degrees 
or the product of vertex degrees of adjacent vertices. 


Index 


6-valuation of a graph, 563 
A’-phase, 1399 

é’-phase, 1401 

€-optimal, 1400 


accept a string, 64 
Achievement game, 451 
achromatic coloring, 445 
achromatic number, 445 
act doubly transitively, 592 
act freely, 793 
act on an orientable surface, 807 
act regularly, 599 
act semiregularly, 599 
act transitively, 592, 793 
action on a Cayley graph, natural, 807 
active vertex, 1385 
acyclic coloring, 442 
acyclic digraph, 180 
acyclic graph, 1318 
acyclic orientation game, 427 
adding a crosscap, 734 
adding a vertex, 15 
adding an edge, 15 
adding an orientable handle, 734 
additive bandwidth of a graph, 1139 
additive bandwidth of an edge-numbering, 
1139 
adjacency function, 135 
adjacency, simple, 3 
adjacency list, 57 
adjacency matrix, 12, 57, 169, 213, 376, 
673 

___, It-order, 683 
adjacency operator, 1338 
adjacent edges, 3 
adjacent vertices, 3 

__ in a digraph, 56 
V-adjacent, 778 
adjoint of a graph, 1566 
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admissible arc (in a flow network), 1385, 
1400 

adversary reconstruction number, 86 

affine plane of order n, 903 

agenda, 217 

agree with a circuit orientation, 659 

agree with a cut orientation, 659 

algebra of k-labeled quantum graphs, 1054 

algebraic connectivity, 377, 687 

algebraic multiplicity, 674 

algorithm, 2-opt (k-opt), 346, 347 

ally reconstruction number, 86 

almost every graph, 325 

almost regular, 501 

almost transitive group action, 603 

alphabet, 266 

alteration method of random object 
generation, 1029 

alternant conjugated hydrocarbon, 1538 

alternating path, 1409 

amalgamands, 764 

amendment procedure, 217 

ancestor of a vertex, 177, 186, 1176 

Andrasfai graphs, 962 

angular resolution in a polyline drawing, 
1247 

annulus, 732 

X-anticomplete vertex, 127 

anticomplete vertex, 127 

anticomponent, 124 

anticonnected, 124 

anti-directed path (or cycle), 204 

antifactor set, 500 

antihole, 125, 426 

__, even, 125 
__, odd, 125 

antipodal double cover, 824 

antisymmetric digraph, 366 

apex graph, 755, 921 

approximate (or approximation) algorithm, 
294, 339 
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r(n)-approximation (for chromatic 
number), 461 

qa-approximation algorithm, 1201 

arborescence, 166 

arboricity, 483 

arc, 6 

s-arc, 599 

arc-coloring lemma, 663 

arc-cut, 166, 261 

arc-disconnecting set, 166 

arc-disjoint s-t paths, 1393 

k-arc-strongly connected, 28 

arc-transitive graph, 596 

Archimedean ¢-tolerance graph, 1110 

Archimedean function, 1110 

Archimedean graph, 1110, 1113 

area of a drawing, 1247 

c-arrangeable graph, 1014 

arrow a k-tuple, 1003 

aspect ratio of a drawing, 1247 

assembling graphs, 1569 

k-assignment, 410 

assignment in a bipartite graph, 1412 

associated symmetric digraph, 238 

associated transshipment network, 1398 

asymmetric graph, 591 

Asymmetric TSP (ATSP), 337 

asymptotically almost surely (a.a.s), 504, 
982 

asymptotically normal, 983 

asymptotically Poisson with mean A, 982 

atom (k-atom), 378, 385 

atom-bond connectivity index, 1543 

attach a closed or open ear, 776 

___, serially, 776 

augmenting path, 1394, 1409 

automaton, finite, 64 

automorphism group, 1067 

automorphism of a covering graph, 
natural, 788 

automorphism of a graph, 68, 69, 590, 
591, 616, 838, 1067 

_._, bounded, 607 

automorphism of a map, 838 

automorphism of an oriented map, 838 

automorphism of a weighted graph, 1040 

availability constraint, 534 

average distance of a graph, 1458 

average genus, 776 

Avoidance game, 451 


INDEX 


away game, 549 
axiom of uniqueness, 902 
axioms of uniformity, 902 


back edge, 59, 1177 

backtracking, 546 

backtrackless, 1342 

backtrack of a closed walk, 1342 

backward arc, 1380, 1394 

bad-cut-vertex, 1089 

badness function, 533 

Balaban index of a molecular graph, 1542 

balance condition, 292 

balanced binary tree, 178 

balanced Cayley map, 846 

balanced graphs, 129 

balanced incomplete block design, 902 

balanced labeling, 572 

balanced orientation, 259 

balanced partition, 1047 

balanced vertex, 259 

ball-spring graph, 1552 

bandsize of a graph, 1136 

bandwidth of a facility, 1496 

bandwidth-k graph, 106 

bar, 13849, 1359 

bar-amalgamation, 766 

bar layout of a graph, 1361 

bar representation of a graph, 1361 

bar-visibility graph, 1349 

bar k-visibility graph, 1359 

bar-visibility number of a graph, 1361 

1-barrier, 500 

barycentric subdivision of a map, 835 

base for a matroid, 692 

base graph for a voltage assignment, 783, 
789 

base graph for recursive construction, 
101, 1291 

base surface for a lifted imbedding, 795 

basic figure in spectral graph theory, 675 

basis (of a digraph), 180 

basis for a matroid, 692 

basis for recursive graph construction, 
103 

BD (abbr. for block design), 903 

beat (in a tournament), 197 

Belyi function, 838 

bend drawing, 1264 

bend in a polyline drawing, 1241 


INDEX 


Berge graph, 125 
Berge-basic graph, 126 
Bernoulli random graph, 981 
best improvement k-opt, 347 
Betti number 6(G), 746 
betweenness centrality of a node, 1522 
BIBD, 902 
biconnected component in a graph, 1189 
biconnected graph, 1188, 1240 
bicritical, 493 
bicycle, 664 
bicycle-based tripartition, 664 
bidegreed graph, 83 
bidirectional double tracing, 268 
bifurcated flow, 1498 
binary matroid, 693 
binary tree, 177, 186, 643, 1240 
binary-search tree, 178 
binary m-vector, 651 
x-binding function (or x-bounding), 421 
binding number, 492 
binomial random graph, 981 
binomial tree, 1479 
bipancyclic, 319 
bipartite degree closure, 315 
bipartite graph, 26 

___, r-by-s, 26 
bipartite index, 243 
bipartite Ramsey number, 1019 
bipolar digraph, 1240 
Birkhoff diamond, 43 
bisection width, 915 
bisimplicial vertex, 129 
block design, 903 
block of a graph, 237, 566, 1317 
block of imprimativity, 602 
block-cutpoint graph, 566 
blocking pair, 1421 
blossom, 1417 
bond in a matroid, 697 
bond matroid, 696 
book (polygonal complex), 737 
book graph, 568 
book graph (a la Erdés), 956 
boundary (spectral theory), 681 
boundary of a 2-manifold, 731 
boundary of a set of vertices, 1337 
boundary vertices, 1214 
boundary walk, oriented, 736 
boundary-separating, 763 
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boundary-walk specification, 736 
bounded bitolerance, 1111 

bounded tolerance graph, 1107 
x-bounding function, in structural graph 


theory, 142 
bouquet B,, 4, 21, 769, 786 
d-box, 1359 


box intersection graphs, 1359 
box-representable graph, 1352 
box-visibility graph, 1358 
box-visibility representation (BR), 1358 
boxicity of a graph, 1359 
BR by unit cubes, 1358 
Braess’s paradox, 1470 
branch points, 791, 821 
branch set, 791 
branch-decomposition, 108 

__-, partial, 108 
partial fuzzy subgraph, 1316 
branched covering, 791, 821 

___ space, 791 
branching, 166 
branchwidth-k graph, 108 
branchwidth of a graph, 108 
breadth-first search, 1175 
breadth-first tree, 60, 1176 
break (in a tournament), 174 
breakpoints, 1402 
bridge, 235, 1189, 1192, 1317 
bridge component of a graph, 1189 
bridge tree, 1189 
bridgeless graph, 1189 
broadcast chromatic number, 429 
broadcast graph, 1480 
broadcast protocol, 1478 
broadcast time, 1478 
broadcasting, 1478 
bull graph, 132 
bull-free graph, 132 
k-dimensional butterfly graph, 1485 


cactus, 750 

C,,-cactus, 566 

cage, 599, 906 

(k, g)-cage, 371 

call graphs, 1457 
candidate solution, 294 
canonical factorization, 555 
canonical factors, 814 
canonical form, 69, 814 
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canonical labeling, 69 
canonizing a graph, 69 


capacitated arc routing problem (CARP), 


299 

capacitated concentrator location 
problem, 1511 

capacitated CPP, 299 

capacitated facility, 1496 

Capacitated Vehicle Routing Problem 
(CVRP), 350 

capacity function (in a flow network), 
1378, 1387 

capacity of a flow-augmenting path, 1382 

capacity of an s-t cut, 1380 

cardinality of a cluster, 1210 

cartesian product, 16, 248, 323, 594, 686, 
1094, 1435 

carving lower bound, 1203 

Catalan numbers, 644 

Catalan triangulation, 877 

categorical product, 594 

caterpillar, 564, 1351 

Cauchy sequence, 1044 

Cayley graph (or digraph), 24, 381, 615, 
784, 806, 1340, 1480 

Cayley map, 807, 843 

Cayley set, minimal generating, 622 

Cayley’s formula, 638 

cell-distribution vector (a-vector), 826 

cells, 0-, 1-, and 2-dimensional, 736, 821 

cellular imbedding, 738, 745, 861 

__., strongly, 738 

k-center, 1074 

center of a graph, 17, 1066, 1076 

central vertex, 17, 1066 

k-central vertex, 1074 

centroid, 904 

certificate for a graph, 71, 1217 

chain with boundary, 1321 

characteristic polynomial of a graph, 674 

characteristic vector, 479, 1082 

Cheeger constant of a graph, 1338, 1459 

Cheeger inequality, 1460 

Cheeger ratio, 1459 

child (in a rooted tree), 177, 186, 643, 
1431 

Chinese Postman Problem, 45, 175, 270, 
284, 286 

k-CPP, 299 

chiral map, 842 


INDEX 


choice index, 454 
choice number, 410, 454 
(a, b)-choosability, 454 
f-choosable (or (f, g)-choosable), 440 
k-choosable, 410 
chord, 649 
chordal graph, 104, 510, 1114, 1328 
k-chromatic, 409 
chromatic graph theory, 1470 
chromatic index, 421, 454 
chromatic number, 7, 124, 409, 454, 753, 
1302, 1470 
___, online, 445 
b-chromatic number, 445 
chromatic polynomial, 444 
chromatic strength, 448 
chromatic sum, 447 
chromatic surplus, 1009 
Cl-graph, 619 
circ (closed trail), 652 
circuit in a graph, 318, 649 
circuit in a matroid, 692 
circuit matrix, 659 
circuit orientation, 658 
circuit space, 662 
circuit subspace, 652 
circuit vector, 656, 659 
circulant graph, 24, 381, 615 
circular C-coloring, 446 
circular chromatic number, 446 
circular interval graph, 130 
circular Lombardi drawing, 1269, 1270 
circulation, 1391 
circumcenter, 904 
circumference, 329, 964 
citation graphs, 1457 
Class 0 graph, 1437 
Class 1 (or 2) graphs, 424 
class edge-reconstruction number, 85 
class reconstruction number, 85 
class-teacher timetabling problem, 535 
classical vibrational scenario, 1552 
claw, 130, 491 
claw-free closure, 329 
claw-free graph, 83, 130, 491 
clean triangulation, 877 
clique of a graph, 475, 13802, 1327 
s-clique, 953 
clique hypergraph, 457 
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clique number w(G), 124, 361, 412, 475, 
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clique partition number, 510 
k-clique problem, 481 
cliquewidth-k graph, 111, 1305 
clone of a node, 1464 
closed disk, 730 
closed face, 861 
closed interval, 1068 
closed neighborhood, 1081, 1083 
closed neighborhood matrix, 1082 
closed set, 694 
closed surface, 731 
closed trail, 648 
closed under duality, 696 
closed under homeomorphism, 892 
closed under a taking of minors, 869 
closed unit disk, 730 
closed walk, 10, 1549 
closeness centrality of a vertex, 1521 
closure in a matroid, 694 
closure of a graph, 964 
cluster (for TSP), 348 
cluster (in a tree), 1210, 1214 

__ of type k, 1331 
clustered graph, 1275 


clustering coefficient of a graph, 1458, 


1524 
co-tree, 751 
coalescence, 686 
cobases, 696 
cobblestone path, 769 
coboundary of a graph, 1321 
cocircuits, 696 
cocktail party graph, 675 
cocycle basis, 1322 
cocycle rank of a fuzzy graph, 1322 
cograph, 110, 1303 
cographic, 696 
coindependent sets, 696 
collaboration graph, 1457 
collapsing operation, 1440 
collinear visibility layout, 1351 
coloops, 696 
color bound, 449 
color class, 409, 549 
color cost, 447 
color-bounded hypergraphs, 454 
colorable (hypergraph), 454 
(r, s)-colorable, 969 
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c-colorable, 7, 409 

colored graph, 1264 

colored point set, 1264 

colored point-set embedding, 1264 

coloring, 7 

__, on-line, 450 
___, partial, 449 

A-coloring, 445 

b-coloring, 445 

S-coloring, 451 

c-coloring of a graph, 7, 409, 753, 1264 

H-coloring, 447 

T-coloring, 445 

coloring number, 412, 969 

combinatorial geometry, 694 

combinatorially equivalent triangulations, 
877 

commodity (in a flow network), 1387, 
1496 

communication facility, 1496 

compact schedule, 174, 553 

comparability digraph, 189 

comparable (elements in a poset), 189 

compatible (a graph and a colored point 
set), 1264 

complement (edge-complement), 123, 1328 

complement of a fuzzy set, 1315 

complement of G’ in G, 648 

complement reducible graphs, 110 

complement threshold tolerance graph, 
1111 

complementarity property (for a 
compact schedule), 554 

complementary profiles (in a schedule), 
551 

complementary slackness, 1395 

complete (in structural graph theory), 
127, 153 

X-complete, 127 

complete k-partite graph, 26 

complete m-ary tree, 186 

complete r-uniform hypergraph, 454 

complete axiom system, 907 

complete bipartite graph, 26 

complete digraph, 22, 166 

complete fuzzy graph, 1319 

complete graph, 21 

complete graph invariant, 71 

complete isomorphism invariant, 779 

complete matching, 1412 


1588 


complete metric space, 1044 
complete multipartite graphs, 26 


completeness constraint (for timetabling), 


533 
completion of a transshipment network, 
1398 
completion time of a broadcast 
protocol, 1478, 1488 
complex graph, 988 
component, 14, 124, 235, 649 
___, even, 751 
_.., odd, 751 
component factor, 490 
composition G1 [G2], 574, 882 
composition of k-terminal graphs, 110 
composition of fuzzy relations, 1315 
composition of strips, 130, 131 
concatenation, 64 
concrete, 907 
concurrent flow problem, 1387 
condensation, 165 
condensation tournament, 201 
conditional connectivity, 384 
conditional domination number, 1098 
Condorcet paradox, 215 
Condorcet winner, 215 
conductance of a graph, 1338 
n-cone (or n-point suspension), 565 
configuration (in graph pebbling), 1429 
___, positive, 1443 
___, size of, 1429 
___, smooth, 1440 
___, support of, 1435 
__, weight of, 1431 
(r, k)-configuration, 903 
3-configuration, 903 
configuration, symmetric 903 
configuration model (of a random graph), 
1461 
conflict (in a timetable), 538, 543 
conflict graph, 539 
congestion at an edge, 799 
congestion of a mapping, 799 
congestion of an immersion, 916 
congruent imbeddings, 772, 894 
connected component in a graph, 1317 
k-connected graph, 27, 235, 237, 317, 
700, 702, 745, 751, 953 
connected search strategy, 1150 
connected sum, 733 
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connected vertices, 1316 

connection matrix, kth, 1050 

connection set, 24, 615 

connectivity requirement, 1505 

consecutive 1s property, 1564 

consistent axiom system, 907 

consistent set of arcs, 208 

consistently oriented at edge e, 736 

construction heuristic, 340, 485 

containment graph, 1106 

continuous model (for cost flows), 1402 

k-contractible edge, 245, 246 

contractible curve, 734 

contractible cycle in an imbedding, 861 

contractible edge in a triangulation, 883 

contractible to another graph, 419 

contractible to another triangulation, 883 

contraction of an edge, 245, 419 

contraction of a triangulation on an edge, 
883 

contraction of matroids, 699 

convergent sequence in a metric space, 
1044 

convergent sequence of graphons, 1043 

convex graph property, 981 

convex hull, 1069 

convex sequence, 925 

convex set of vertices, 1069 

convolution of sequences, 766 

cops-and-robbers, 1154 

copying model (in complex network), 1463 

k-core of a graph, 988, 1527 

corner vertex, 1155 

corona of two graphs, 1087 

cosine similarity (of vertices), 1528 

cospanning tree, 649 

cost chromatic number, 448 

cost flow network, 175 

cost of a coloring, 447 

cost set, 447 

course timetable, 538 

cover graph, 189 

cover of a graph, 89 

covering, k-fold, 821 

covering digraph, 784 

covering graph, 784, 785 

covering projection, 791 

___, regular, 793 
covering, regular, 843 
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covering space, 791 

___, regular, 793 
covering transformation, 793 
covering walk, 175, 268 
covering with folds, 882 
cover in a poset, 189 
Coxeter complex, 839 
Coxeter regular skew polyhedra, 844 
crawl (of nodes), 1458 
Cremona-Richmond configuration, 451 
k-critical (chromatically), 414 
k-critical (crossing-critical), 920 
critical graph, 44 
critical probability, 1467 
critically k-chromatic, 28 
critically k-connected, 28, 251 
critically k-edge-connected, 28 
cross edge, 59, 1177, 1178 
crosscap distribution polynomial, 761 
crosscap distribution sequence, 761 
crosscap number, 734, 739, 761, 815 

__-_, Maximum, 739, 761 

___, minimum, 739 761 

___, symmetric, 815 
crosscap range of a graph, 761 
crosscap types, 885 
k-crossing-critical, 920 
crossing number, 914 

__, odd, 923 

___, standard, 923 

__ of a drawing, 1247 

___ of a 1-factor, 496 
crossing of edges, 1241 
crossing sequence, orientable, 924 
crystallization of a map, 835 
d-cube graph, 24, 568 
cube-connected cycles graph, 1485 
current, 797 
current assignment, regular, 797 
current graph, regular, 797 
current group, 797 
cut, 166, 650 
s-t cut, 1380 
cut condition, 274 
cut distance, 1044, 1045, 1046 
cut matrix, 659 
cut norm of a graphon, 1046 
cut orientation, 659 
cut vector, 656, 659 
cut-edge, 15, 235, 745 
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cut-set, 166 

__ in a fuzzy graph, 1320 
cut-vertex, 15, 235, 1317 
cut-width, 146 
cutpoint (articulation point), 15, 1188 
cutset, 650 

___. space, 662 

___ subspace, 654 
cutwidth of a graph, 915, 1138 
cutwidth of an edge-numbering, 1138 
CVRP tour, 350 
cycle, 10, 692, 1316 

___ basis, 1322 

_._ cover, 269 

_-__ decomposition, 259 

___ double cover, 269 

__ extendable, 320 

___ flow (or path flow), 1403 

__ graph, 22, 1318 

__ in a digraph 204 

__ in a fuzzy graph, 1328 

__ of length n, 124 

___ packing, 270 

__ rank B(G), 25, 746, 776 

___ rank of a fuzzy graph, 1322 

___ shift, 306 

___ unit, 1087 

___, kth power of, 962 
p-cycle, 1514 
Cycle Double Cover Conjecture, 270 
cyclic bandwidth of a graph, 1136 
cyclic bandwidth of an edge-numbering, 

1136 

cyclomatic number of a graph, 1542 
cylinder, 732 


dag (directed acyclic graph), 180, 1181 

dart, 834 

dart-reversing involution, 834 

de Bruijn 2-factor, 306 

de Bruijn arc, 306 

de Bruijn graph, 266, 306, 1486 
___, k-dimensional binary, 1486 
___ of order k, S-relative, 312 

de Bruijn sequence, 266, 305 

de Bruijn shift, 306 

dead end of a path, 1190 

decision tree, 217 

deck (for reconstruction), 78 
_._ polynomial, 88 
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decomposable graph, 1096 

decomposition, 548 

H-decomposition of G, 506 

decomposition tree, 101, 1292 

decreasing property, 981 

decremental of a dynamic, 1208 

deficiency €(G) of a graph, 751 

deficiency of a branch point, 810 

d-degenerate, 412 

degree of a vertex, 8, 56, 592, 629 

degree associated reconstruction 
number, 95 

degree centrality of a node, 1521 

k-degree closure, 315 

degree factor, 490 

degree lower bound, 1203 

degree sequence, 8 

degree-k graph, 1240 

degree-equivalent, 143 

Delaunay drawable (planar triangulated 
graph), 1243 

Delaunay triangulation (of a drawing), 
1243 

deleting a vertex, 15 

deleting an edge, 15 

deletion in a matroid, 699 

demand (in a flow network), 1387 

dense graph, 57 

dense imbedding, 861 

density of edges between X and Y, 1047 

dependent subset of a matroid, 692 

depth (in a rooted tree), 177, 186 

depth-first forest, 59, 1177, 1178 

depth-first search, 1177 

depth-first tree, 59, 1177 

(X,-permutation) derived digraph, 789 

derived digraph, 784 

derived graph, 784 

derived imbedding of a current graph, 
797 

derived imbedding of a voltage graph, 
795 

derived surface, 795 

descendant (in a rooted tree), 177, 186, 
1176 

dessin d’enfant, 839 

destination node, 1496 

detachment (operation) of G, 263 

dfs tree, 1177 

P-diagonal flip, 892 
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diagonal flip, 888 
diameter of a graph, 11, 361, 1065, 1075 
k-diameter, 1073 
digon, 796 
digraph, 2, 6, 56 
__, connected, 165 
__, weakly connected, 11 
___, strongly connected, 11, 649 
k-connected digraph, strongly, 28 
digraph representation of a relation, 172 
digraph, strongly connected, 632 
st-digraph, 1240 
Dijkstra’s algorithm, 171 
dimension order (in a broadcast 
protocol), 1481 
dipole, 4, 21, 769, 786 
Dirac constant, 1552 
Dirac delta function, 1552 
direct product (or conjunction), 324, 594 
direct sum, 699 
direct visibility, 1359 
directed bridge, 1199 
directed bar-visibility graph, 1351 
directed circuit, 663 
directed cutset, 663 
directed de Bruijn graphs, 1566 
directed distance from, 11 
directed distance of two vertices, 1075 
directed edge, 6 
directed facilities (in a network), 1496 
directed graph, 6, 56 
directed hypergraph, 1575 
directed line graph, 1566 
directed sweeping, 1157 
directed to (or from), 6 
directed tree, 166, 184 
directed version of CPP (DCPP), 285 
directed walk, 10, 164 
x-y directed walk, 165 
disconnected, 235 
disconnecting (vertex-)set (edge-set), 166, 
235 
(u|v)-edge-set, 387 
(u|v)-disconnecting set, 239 
discovery order, 1180 
discrete model (for convex cost flow), 
1402 
disk, 730 
___, open, 730 
dismantleable graph, 1155 
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distance between two graphs, 1072 

distance between two vertex subsets, 363 

distance between two vertices, 11, 1064 

t-distance connectivity, 387 

t-distance edge-connectivity, 387 

distance function (in a flow network), 
1385 

distance graph, 429 

distance matrix (of a TSP), 337 

distance sum for a vertex, 1542 

distance-regular, 383, 682 

Distinct Spheres Condition, 610 

distinguishing chromatic number, 608 

distinguishing coloring, 608 

distinguishing number, 608 

diverse path protection (in network 
design), 1512 

diversification (in network design), 1512 

divisible by t (property of a graph), 510 

division tree, 218 

divisor, 594 

DNA graphs, 1566 

DNA sequence, 312 

dodecahedron graph, 370 

domain (in a poset), 188 

dominance drawing of an acyclic digraph, 
1242 

dominate, 1200 

dominate or dominated (in a 
tournament), 197 

dominating circuit, 318 

dominating set (in a tournament), 212 

dominating set, 7(G)-set, 1080, 1303 

domination chain, 1085 

domination critical, k-critical, 1097 

domination graph, 212 

domination number, 212, 1080 

domination perfect graph, 1085 

dominator tree, 1200 

double jump, 987 

double ray, 591 

double tracing, 268 

double-split graph, 126 

doubled path, 767 

doubly-periodic graph, 461 

doubly-regular, 198 

drawable as a minimum spanning tree, 
1244 

6-drawing, 1244 

hv-drawing, 1242 
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drawing of a graph, 913, 1239 
__, convex, 1241 
__, e-planar 1275 
__., oc-planar, 1275 
__, planar, 1241 
dual (matroidal), 696 
dual (voltage-current), 797 
dual (Whitney), 668 
dual Archimedean, 1113 
dual feasible, 1397 
dual map (Poincaré), 821 
dual of a cost flow problem, 1397 
dual prices, 1395 
Dyck map, 844 
dynamic graph, 1207 


E-adjacent imbeddings, 778 

ear decomposition, 746 

Eberhard’s Theorem, 828 

eccentricity of a vertex, 11, 1064 

k-eccentricity, 1073 

edge, 2, 736 

edge, proper, 3 

edge amalgamations, 764 

edge attribute, 5 

edge choice number, 421 

k-edge-coloring, 1003 

3-consecutive edge coloring, 452 

edge contraction, 748 

edge deletion, 747 

edge density model, 325 

edge design, 1497 

edge difference polynomial, 412 

edge expansion constant of a graph, 1338 

edge insertion, 747 

edge search, 1146 

edge-k-connected, 28, 745, 1201 

edge-2-connected component, 745 

edge-bandwidth, 1137 

edge-chromatic number, 8, 535 

c-edge-colorable, 8 

edge-coloring, 8 

edge-complement, 15, 648 

k-edge-connected, 237, 268 

edge-connectivity, 27, 166, 236, 238, 268, 
365 

edge-contraction, 115, 822 

edge-cut, 166, 235, 261, 1505 

edge-deck, 78 

edge-deleted subgraph G — e, 78 
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edge-deletion subgraph G — F, 235 

edge-disconnecting set, 166 

k; edge-disjoint paths problem, 147 

edge-extraction, 115 

edge-group, 592 

edge-isomorphism, 592 

edge-multiplicity, 3 

edge-numbering, 1136 

edge-recognizable, 82 

edge-reconstruction, 79 

k-edge-reconstruction, 92 

edge-searchable graph, 1146 

edge-suppression, 824 

edge-symmetric (-transitive), 380 

edge-transitive, 14, 596 

edge-weights, 1039 

edge-width of a map, 822, 861 

edgesum generated by an edge-numbering, 
1136 

edgesum of a graph, 1136 

Edmonds’s Branching Theorem, 1199 

eigenvalue power law, 1460 

eigenvalues of a graph, 326, 674 

eigenvalues-diameter (lower) bound, 676 

eigenvector centrality, 1522 

elementary figure (spectral graph 
theory), 675 

elementary graph, 89 

elimination ordering of vertices, 1155 

embedding, see imbedding 

empirical coefficients, 1549 

empty graph, 21, 647 

end of that closed or open ear, 776 

end-equivalent, 603 

endoskeleton, 1246 

endpoint, 2 

endpoint visibility graph, 1363 

ends of an infinite graph, 603 

endvertex-deck, 84 

endvertex-reconstructible, 84 

ensemble of trees, 1571 

enthalpy of a graph, 1550 

equitable coloring, 413 

equitable partition, 72 

equivalent imbeddings, 894 

equivalent panel structures, 896 

equivalent sequences, 1044 

equivalent under diagonal flips, 888 

Erd6s—Rényi random graph, 981 

Erdos—Hajnal property, 140, 141 
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Erd6és—Rényi graph, 961 

Erdés—Sés conjecture, 963 

ES (algorithm), 295 

a-essential, 607 

essential curve, 734 

essential cycle in a graph imbedding, 887 

Estrada index of a molecular graph, 1550 

ET tree, 1213 

Euclidean crystallographic group, 812 

Euclidean space group, 812 

Euclidean TSP, 337 

Euler characteristic, 39, 734 

Euler genus, 498, 815, 861 

___, Symmetric, 815 

Euler polyhedral equation, 746 

Euler tour, 1212 

eulerian graph, 28, 31 

eulerian spanning subgraph, 427 

eulerian tour (graph), 175, 259, 285 

eulerian trail, 10 

eulerian triangulation, 865 

even (undirected) graph, 259, 652 

even adjacency splits, 1485 

even mixed graph, 291 

even vertex (on an alternating path), 
1417 

even-hole-free, 129 

even-symmetric strategy (ES), 295 

event (in a random graph), 981 

exact algorithm, 339 

exceptional (an ordered pair of vertices), 
1320 

excess at a vertex, 1385, 1399 

excess of a graph, 987 

excluded minor, 704 

expanded blossom, 1417 

€-expander (expander graphs), 1345 

€-expander (spectral graph theory), 682 

expander family, 1338 

expansion constant of a graph, 1338 

exponential growth, 597 

exponential time hypothesis, 481 

m-extendable, 492 

extended flow network, 1391 

extended triangle group, 838 

external degree of a cluster, 1210 

external face of a region, 1241 

extremal function, 954 

extremal graphs, 954 
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face, open, 860 
face of a map, 821 
face of a region, 1241 
face of an imbedding, 736, 738, 745 
face tracing, 741 
face-size sequence (p-sequence), 826 
face-width of a map, 822 
face-width of an imbedding, 498, 861 
facility, 1496 
facility capacity, 1496 
facility levels, 1496 
factor, 490, 550 
n-factor, 23, 318, 490, 497, 550 
1-factor, 491, 1409 
(g, f)-factor, 502 
(a, b|-factor, 501 
f-factor (for a function f), 500 
G-factor (for a graph G), 510 
G-factor recognition problem, 510 
H-factor (for a set H of graphs), 504 
F-factor, 504 
H-factor, 957 
factor-critical, 504 
d-factorizable, 550 
factorization, 490 
1-factorization, 506, 622 
d-factorization, 550 
faithful imbedding, 896 
faithfully imbeddable graph, 896 
Fano coloring, k-line, 451 
Fano matroid, F7, 704 
Fano plane, 905 
farness of a node, 1521 
farthest vertex insertion, 344 
favoring, Do-, 272 
fb-walk, 761 
feasible multicommodity flow, 1387 
feasible schedule (timetable), 533, 538, 
557 
feedback set of arcs, 207 
fiber, 791 

_.., edge, 785 

___, vertex, 785 
fiber-preserving, 793 
fiber transformation group, 843 
final vertex of a walk, 10 
finish time order of vertices, 1180 
first moment method, 1026 
fixed cost, 1496 
fixed size model, 325 
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fixed-parameter tractable, 481 

flag, 835 

flag algebras, 1054 

flat, 694 

2-flip, 867 

feasible flow, standard, 1379, 1391 

flow, 271, 1497 

k-flow, 271 

s-t flow, 1391 

flow across cut, 1381 

flow decomposition, 1403 

flow network, standard, 1390 

s-t (flow) network, 175, 1378, 1391 

flow with gains, 1405 

flow-augmenting path, 1382 

flow-over-time network, 1403 

forbidden graphs, 954 

forbidden minors, 115, 748 

forest, 17, 1318 

forward arc (in a residual network), 1394 

forward arc (of a cut), 1380 

forward edge (in a depth-first tree), 59, 

1177 

fractional bijection, 1045 

fractional chromatic number, 440 

fractional vertex-coloring, 439 

k-fragment, 385 

frame, 137 

F-free (for a family of graphs F), 128 

{Fi, Fo, sey Fi, }-free, 327 

(C4, Cs)-free, 1089 

H-free graph, 124, 953 

S-free (for a tournament S), 141 

free action, 807 

free edges, 326 

free vertex (relative to a matching), 1409 

frontier arc, 167 

frontier edge, 18 

frozen triangulation, 888 

full triangle group, 810 

fullerene, 493 

fully cycle extendable, 320 

fully dynamic, 1208 

functional graph, 187 

fundamental circuit, 653 

fundamental circuit matrix, 660 

fundamental cutset, 654, 660 

fundamental polygon, 737 

fundamental sequence of a fuzzy graph, 
1325 


1594 


fuzzy cycle set (or cocycle set), 1321 
fuzzy cycle vector, 1321 

fuzzy 1-chain with boundary, 1321 
fuzzy coboundary, 1321 

fuzzy cocycle basis (or rank), 1322 
fuzzy cut set, 1320 

fuzzy cycle (or cocycle) 1321, 1322 
fuzzy cycle rank, 1322 

fuzzy digraph, 1325 

fuzzy forest, 1318 

fuzzy graph, 1316 

fuzzy intersection graph, 1323, 1326 
fuzzy interval graph 1327 

fuzzy number, 1327 

fuzzy power set, 1315 

fuzzy relation, 1315 

fuzzy singleton, 1319 

fuzzy subgraph, 1316 


Gabriel graph, 1244 

gain factor, 1404 

gain network, 1404 

gainy arc, 1404 

game chromatic number, 451 

gas-water-electricity problem, 40 

gear graphs, 566 

general graph, 4 

generalized CPP, 299 

generalized p-cycle, 367 

Generalized Asymmetric Traveling 
Salesman Problem, 348 

generalized dicyclic group, 600 

generalized hierarchical product, 248 

generalized partial circuit (or cutset) 
ear decomposition, 669 

generalized routing problem (GRP), 299 

Generalized Symmetric Traveling 
Salesman Problem, 348 

genus, 28, 734, 738, 745, 746 

_-, Minimum, 28, 738, 746, 760 

genus distribution polynomial, 761 

genus distribution sequence, 761 

genus imbedding, 738 

genus of a finite geometry, 907 

genus of a group, 807 

__., (strong) symmetric, 807 

genus of a surface, 39, 821 

genus range of a graph, 760 

geodesic, 607, 1064 

geodetic, 607 
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s-geodetic, 388 
geodetic iteration number, 1069 
geodetic number, 1068 
geometric, 923 
geometric multiplicity, 674 
geometric protean model, 1465 
geometric realization, 827 
geometric representations, 704 
geometry, finite, 902 
___, n-point, 903 
_._ of Desargues, 904, 905 
___ of Pappus, 904 
__., abstract model of, 907 
GI (complexity class), 71 
girth, 361, 964 
girth pair (91, 92), 370 
Golomb postulates of randomness, 311 
gossip protocol, 1488 
gossip time, 1488 
gossiping, 1487 
graceful labeling, 563 
___, a-labeling, 572 
graph, 2, 741, 834 
(n, k)-graph, 251 
(a, b|-graph, 501 
st-graph, 1354 
graph amalgamations, 764 
graph automorphism, 13 
graph encoding, 835 
graph invariant, 71 
graph parameter, 1049 
___, normalized (or simple), 1049 
graph polynomial, 412 
graph process, 325 
graph product, 594 
graph property, 968 
graph union, 16 
graph-based molecular descriptor, 1542 
graph, regular, 23 
graph, simple, 4, 745 
graph, trivial, 4, 21, 647 
graphic, 693 
graphical regular representation, 600 
graphical sequence of integers, 492 
graphon, 1038, 1039 
___ corresponding to a graph G, 1039 
___, simple, 1039 
r-greedy, 1429 
Greenwell’s Theorem, 80 
grid drawing, 1241 
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Grotzsch graph, 962 

ground set, 692 

group, (p,q,r) or (p,q,r)°, 810 
growth, 597 

GRR, 600 


Grundy coloring (chromatic number), 445 


guest (in a mapping G > H), 799 


H-decomposition Problem, 510 

half degree, 250 

half-disk, 730 

half-edge, 741 

half-transitive, 597 

Halin graph, 105, 769 

Hamilton-laceable graph, 621 

hamiltonian (or Hamilton-) connected, 
315, 390, 621 

hamiltonian cycle or graph, 28, 34, 200, 
314, 320, 984 

_.., k-ordered, 320 

hamiltonian decomposition, 323 

hamiltonian partition, 506 

hamiltonian path, 200 

Hamming distance, 477 

Hamming graph, 477, 683 

handle types, 885 

HAP (home-away pattern), 551 

hard constraint, 533 

harmonious coloring, 446 

Hasse diagram, 189 

head of an arc (or directed edge), 6, 164 

heavy-tailed distributions, 1459 

Heawood formula, 832 

Heawood graph, 371, 600 

Heawood number, 442 

Heawood problem, 753 

height (of a rooted tree), 177, 186 

height of a fuzzy set, 1325 

helm graphs, 565 

Helmholtz free energy, 1550 

hereditary class of graphs, 142 

hereditary collection of subsets, 694 

hereditary property, 968 

hereditary under minors, 869 

hexagonal system, 493 

hierarchical CPP, 299 

hierarchical drawing of a graph, 1241 

hierarchical generating set, 382 

historical path, 1225 

hitting time, 326, 995 
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Hoffman polynomial, 677 
hole, 125 
___, even, 125 
__., odd, 125, 425 
home game, 549 
home-away pattern, 551 
homeomorphic, 750 
homeomorphically reduced tree, 637 
homogeneous balanced graph, 113 
homogeneous pair (structural graph 
theory), proper, 127 
homomorphic to a graph, 962 
homomorphism, 90, 447 
___ between weighted graphs, 1041 
_-_ with forbidden X, 90 
_--, normalized, 1042 
homomorphism between (oriented) maps, 
838 
homomorphism density, 1041, 1042 
homomorphisms, number of, 1041 
Hosoya index of a molecular graph, 1542 
host (in a mapping G > H), 799 
hull number of a graph, 1069 
hull set (in a graph), 1069 
Hurwitz bound, 841 
Hurwitz group, 814, 848 
Hurwitz maps, 848 
hypercube graph, k-dimensional binary, 
1480 
d-hypercube graph, 24 
hyperedges, 1575 
hypergraph, 453 
__., bi-, 454 
__., C- (or D-), 454 
__., mixed, 454 
_.., C-edges in a, 454 
___, D-edges in a, 454 
hypergraph, k-uniform, 1020 
hypergraph imbedding, 836 
hypermap, 836 
hyperplane, 694 
hypohamiltonian, 497 
hypotraceable, 497 


idle at time, 1483 

Thara zeta function, 1342 
illegitimate deck, 94 
imbeddable, 376 

imbedded (di)graph, 1241 
imbedded voltage graph, 795 
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imbedding number, orientable, 761 
imbedding of a graph, 738, 745, 913 
_., 2-cell, 738, 824 
___, circular, 861 
___, strongly noncellular, 762 
__, symmetric, 807, 815 
imbeddings, similar, 871 
immediate dominator, 1200 
immersion in a surface, 738, 916 
immersion of a subgraph, strong, 145 
imperfect graph, 124 
imprimitive group action, 602 
improvement heuristics, 340 
improvement local search, 485 
in-arcs, 261 
in-branching, 210 
in-degree, 197 
in-eccentricity, 388 
in-score, 197 
in-set, 197 
in-tree, 184, 210, 265, 290 
incenter of a triangle, 904 
incidence matrix, 57, 660 
incidence of edges and vertices, 3 
incidence set, 261, 650, 654 
incidence vector, 656 
incidence-partition system, 273 
inclusion tree, 1275 
increasable arc, 1382 
increasing graph property, 981 
incremental (dynamic graph problem), 
1208 
indegree of a vertex, 8 
independence number, 315, 409, 475, 552, 
953 
independent axioms for a geometry, 907 
independent domination number, 1083 
independent odd crossing number, 923 
independent set (of vertices), 240, 315, 
475, 539, 552, 1083, 1292 
independent subset of matroid, 692 
index of mutuality, 1523 
indirect visibility, 1359 
induced digraph (in voting), 215 
induced imbedding, 741 
induced Ramsey number, 1019 
induced subgraph on a subset of edges, 
647 
induced subgraph on a subset of 
vertices, 14, 124, 648 
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infinite connectivity, 606 
informed at time t, 1478 

___ by a gossip protocol, 1488 
initial vertex of a walk, 10 
insertion of a vertex at an arc, 343 
integer flow, 271 
integer program (in network design), 1496 
integrality gap, 1500 
interior vertex, 378 
interior of a contractible cycle, 861 
interlace of two segments in a cycle, 1194 
interlaced, 572 
intermediate growth, 598 
internal search strategy, 1150 
internal vertex of a walk (or path), 10, 

237, 1200 
internal vertex (in a rooted tree), 186 
internally-disjoint, 237 
intersection graph, 1106, 1359 
intersection of fuzzy sets, 1315 
interval dimension, 1112 
interval edge coloring, 453 
interval graphs, 1563 
intuitionistic fuzzy graph, 1334 
irreducible schedule, 553 
irreducible snark, 423 
irreducible tournament, 197, 632 
irreducible tree, 637 
irreducible triangulation, 884 
irredundance number, 1084 
irredundant set of vertices, 1083 
ISO (complexity class), 71 
isolated edge, 669 
isolated vertex, 8 
isomorphic k-labeled graphs, 1050 
isomorphic as permutation groups, 592 
isomorphic factorization of a graph, 622 
Isomorphic Factorization Problem, 510 
isomorphic graphons, 1039 
isomorphic graphs, 12 
isomorphic labeled graphs, 1050 
isomorphic maps, 821 
isomorphic matroids, 692 
isomorphic triangulations, 877 
isomorphism between two general 
(or simple) graphs, 12 

isomorphism invariant, 12 
isomorphism of graphs, 68 
isomorphism of labeled graphs, 1039 
isomorphism of weighted graphs, 1040 
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isomorphism-complete, 71 

isoperimetric constant of a graph, 1337 

isoperimetric number of a set of nodes, 
1459 

isoperimetric set (of vertices), 1337 

isotopic triangulations, 877 

iterated local transitivity model, 1464 


jackknife operation on graphs, 103, 1296 
jellyfish pseudosurface, 732 

Johnson graph, 683 

join operation, 16 

2-join, proper, 126 


k-HB (homogeneous balanced) graphs, 
1308 

Konigsberg bridges problem, 31 

«-absorption, 278 

«-detachment, 277 

«*-transformation, 278 

«-transformation, 277 

Kazhdan constant (of a finite group), 
1340 

Kempe-chain, 42 

Kepler—Poinsot regular star polyhedra, 
843 

kernel in an oriented graph, 427 

kernel for a surface, 871 

key or key unit (type of graph), 1087 

Kier and Hall molecular connectivity 
index of a molecular graph, 1542 

king, 209 

Kirchhoff current law (KCL), 797 

Kirchhoff index of a molecular graph, 
1543 

Kirchhoff matrix, 266 

Kirchhoff voltage law (KLV), 795, 882 

Kleene closure, 64 

Klein bottle, 733 

Klein map, 844 

Kneser graph, 609, 1435 

Knéodel graph, 1480 

Kronecker product, 248, 594 

Kuratowski’s Theorem, 748 


N (or A“))-optimal, 385 
labeled digraph, 632 

labeled graph, 627, 1050 
labeled graph (in biology), 1566 
labeled tree, 637 
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labeling, L(r1,...,7a)-, 445 

ladder, closed-end, 768 

ladder, Mobius, 616, 962 

language, 64 

Laplacian eigenvalues, 377 

Laplacian matrix of a graph, 377, 687, 
1543 

large edge-width imbedding, 866 

large-scale (of a complex network), 1457 

lattice graph, 680 

layered (di)graph, 1240 

layered drawing, 1241 

layered network, 668 

leaf, 186 

k-leaf-connected, 390 

left (right) child, 177, 186 

left coset, 381 

left (right) subtree, 177 

left-right (binary) tree, 643 

Lemke graph, 1435 

length function on a graph, 1175 

length of a directed walk, 165 

length of a walk, 10 

less than (in a poset), 189 

letters (of an alphabet), 266 

level (in a rooted tree), 177, 186 

level family of fuzzy subsets, 1325 

level set, 1315 

Levi graph, 906 

LEW-imbedding, 866 

lexicographic product, 324, 594 

lift of a walk, 794 

lightpath, 669 

limit of a convergent graph sequence, 

1043 

limit of a convergent sequence of graphons, 

1043 

limit of a sequence in a metric space, 

1044 

line digraph, k-iterated, 368 

line graph, 29, 126, 318, 363, 421, 678, 
1137 

___, generalized, 678 

k-linear arboricity, 506 

linear extension ordering, 190 

k-linear forest, 506 

linear mixed-integer program, 1496 

linear program (in network design), 1496 

linear programming relaxation, 1500 

lines in a geometry, 902 
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link in a triangulation, 877 

link vertex, 1087 

k-linked, 243 

list assignment, 410 

list chromatic index, 421 

list chromatic number, 410 
L-colorable, 410 

list colorable, 410 

list coloring, 454 

list critical, k-, 414 

list edge-chromatic number, 421 
list homomorphism, 447 

load at a vertex, 799 

load of a mapping, 799 

lobe of a graph, 602 

lobster, 564 

local access network, 1499 

local clustering coefficient of a vertex, 
1525 

local completion, 329 

local group of a voltage group, 788 
local improvement, 485 

local rotation, 834 

local search (LS), 485 

logarithmic density, 968 

Lombardi drawing, 1269, 1270 
long circular interval graph, 130 
loop in a graph, 3 

loop in a matroid, 694 

loopless graph, 4 

lossy arc, 1404 

low connectivity survivable network, 1505 
lower (or upper) chromatic number, 454 
Lyndon word, 309 


majority digraph, 215 

manifold, 2-, 731 

map defined by a crystallization, 835 

map determined by a rotation system, 
834 

map determined by a signed rotation 
system, 834 

map given by a graph encoding, 835 

map homomorphism, 821 

map on a surface, 821 

map sequences, g-admissible pair, 828 

map type, 826 

map, orientably regular, 842 

map, regular, 842 
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Markov chain (DTMC), discrete-time, 
170 
Markov digraph, 170 
Markov network, 1571 
Markov tree, 1571 
matched vertex, 1409 
___, mate of, 1409 
matching, 243, 287, 475, 1303, 1408 
matching lower bound, 1203 
matching number, 495 
matrix function, 1549 
matrix indexed by a set, 1051 
matrix, regular, 667 
matroid, 692 
___, regular, 693 
___, simple, 694 
___, restriction of, 699 
maximally (edge-) connected, 237, 238 
maximally t-distance (edge-) connected, 
388 
maximally distant, 664 
maximum clique, 547 
maximum crosscap imbedding, 739 
maximum density, 985 
maximum flow, 1393 
___ problem, 175, 1379 
__ with gains, 1405 
maximum s-t flow over time, 1403 
maximum genus imbedding, 738 
maximum genus of a graph, 738, 746, 
760 
maximum multicommodity flow problem, 
1387 
maximum nonorientable genus, 761 
maximum number of bends of a polyline 
drawing, 1247 
maximum-clique problem, 476 
maximum-independent-set problem, 476 
maximum-size matching, 1409 
maximum-weight bipartite matching, 1393 
maximum-weight-cycle-packing problem, 
270 
maximum-weight matching, 1409 
maximum-weight-clique problem, 476 
maximum-weight-independent-set prob- 
lem, 476 
measurable function, 1039 
measure-preserving map, 1039 
medial graph, 861 
medial map, 821 
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median of a graph, 1072 

median vertex, 1072 

meeting (in timetabling), 532 

memetic algorithms, 548 

Menger graph, 906 

merge (of two cycles), 352 

merging twins, 1040 

metric on a set, 1044 

metric ray, 607 

metric space, 1044 

Metric TSP, 337 

metric type, 607 

min-max k-CPP, 299 

minimal clean triangulation, 887 

minimal dominating set, 1083 

minimal domination imperfect graph, 1086 

minimal generating set, 381 

minimal forbidden minor, 748 

minimal polynomial, 676 

minimal transversal hypergraph, 1575 

minimal triangulation, 822 

minimally k-connected (or edge-connected), 
250, 711 

minimum broadcast graph, 1480 

minimum broadcast tree, 1479 

minimum convex cost flow, 1402 

minimum-cost circulation, 1391 

minimum-cost flow, 1391 

minimum-cost-flow problem, 175 

minimum cost maximum flow with gains, 
1405 

minimum-cost transshipment, 1393 

minimum crosscap imbedding, 739 

minimum s-t cut, 1380 

minimum degree of a digraph, 238 

minimum t-degree, 388 

minimum edge cover, 1093 

minimum edge-degree, 385 

minimum k-edge-degree, 385 

minimum genus imbedding, 738 

minimum geodetic set, 1068 

minimum geodetic subgraph, 1068 

minimum gossip, 1490 

minimum hull set, 1069 

minimum hull subgraph, 1070 

minimum spanning tree of a set of points, 
1244 

minimum triangulation, 881 

minimum weight drawable graph, 1246 

minimum weight drawing, 1246 
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minimum weight forbidden graph, 1246 

minimum weight triangulation, 1245 

minimum-vertex-cover problem, 476 

minimum-weight-cycle-cover problem, 270 

minor crossing number, 923 

minor of a graph, 115, 420, 699, 748, 
869, 970 

minor of a map, 829 

minor of a matrix, (i, 7)”, 266 

minor of a matroid, 699, 706 

___, proper, 699 

minor-closed class of graphs, 748 

minor-closed class of matroids, 704 

minor-minimal imbedded graph, 870 

mirror image of a map, 835 

mixed-cut, 261 

mixed graph, 6, 1192 

mixed hypergraph, 454 

mixed search 1151 

mixed version of CPP (MCPP), 285 

mixed-searchable graph, 1151 

mixture of trees, 1571 

Mobius band (or strip), 732 

Mobius ladders, 568 

model for an axiom system, 907 

modularity (a measure of homophily), 
1528 

molecular graph, 1538 

momentum of a node, 1552 

monadic second-order logic for a graph, 
1300 

monodromy group of a map, 838 

monogon, 796 

monotone property, 968, 981 

monotonic search strategy, 1150 

Moore bound, 372, 374 

multi-arc, 6 

multicommodity flow, 1387 

multi-edge, 3 

multigraph, 4, 421 

multi-level network design, 1502 

multipartite number of a graph, 1356, 
1358 

multiplicative, 1043, 1049 

mutually reachable, 165, 238 

Mycielski graphs, 962 


Nash equilibrium (a proper coloring), 1470 
natural projection of covering graph to 
base graph, 785, 796 
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nauty, 72 
NC (complexity class), 510 
near-regular tournament, 198 
near-planar graph, 921 
near-triangulation, 1351 
nearest vertex insertion, 344 
Nebesky nu-invariant, 751 
necklace, 309 

__ of type (r,s), 750, 776 
negative (edge-) boundary, 378 
negative (a-)fragment, 378 
negative witness proximity drawing, 1274 
neighborhood, 197, 316 
6-neighborhood, 1244 
neighborhood complex, 411 
neighbors (of two clusters), 1214 
neighbor of a vertex, 3 
k-nest, 861 
net voltage on a walk, 787, 882 
network, 1496 

_.. design problem, 1496 

_.. loading problem, 1509 

__. reliability, 267 
nil pointer, 57 
no-clashes constraint, 533 
no-conflicts constraint, 533 
node, 2 
node levels, 1505 
node search, 1151 
node-label-controlled graph, 112 
node-searchable graph, 1151 
noncollinear visibility layout, 1351, 1352 
non-deterministic finite automaton, 64 
non-Fano matroid, Ff , 704 
non-intersecting transition system or eu- 

lerian tour, 276 
non-intersecting with respect to O*(v), 
275 

non-orientable crossing sequence, 924 
non-orientable genus, 734 

___, minimum, 761, 815 

___, symmetric, 815 
non-orientable surface, 734 
non-repetitive coloring, 449 
non-revisiting path, 829 
non-saturating push operation, 1400 
nonseparable fuzzy graph, 1317 
non-separating cocircuit, 711 
non-separating curve, 734 
non-tree edges or vertices, 18, 1177 
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normal drawing, 913 

normal form of matrix, 667 

normalized Laplacian, 1460 

nowhere-zero k-flow, 443 

Nowhere-Zero 5-Flow Conjecture, 271 

nowhere-zero integer flow, 271 

null graph, 4, 21, 647 

nullity of a graph, 649 

nullity of a molecular graph, 1540 

number of bends of a polyline drawing, 
1247 

st-numbering of a graph, 1196 


octahedral graph, d-, 24 

odd-cycle property, 491 

(1, f)-odd factor, 501 

odd-hole-free, 129 

odd K-expansion, 420 

odd vertex (on an alternating path), 1417 

on-line social networks, 1457 

open ear decomposition, 1196 

open eulerian trail, 259 

open neighborhood of a set, 1083 

open neighborhood of a vertex, 1081 

open postman tour, 286 

openly-disjoint, 237 

optimal gossip protocol, 1489 

optimal gossip size, 1489 

optimal pyramid, 137 

orbit of a vertex, 14 

orbit of an edge, 14 

orbit of an automorphism group, 1040 

order of a graph, 315 

order of a tournament, 197 

order of an edge in a branch 
decomposition, 108 

order-type, 620 

ordered-pair permutation, 634 

orientable CDC, 270 

orientable surface with g handles, 734 

orientation of a graph, 1189, 1192, 1328 

orientation reversing, 734 

oriented d-coloring, 549 

Oriented Cycle Double Cover 
Conjecture, 270 

oriented graph, 168, 366 

oriented polygon, 736 

origin node, 1496 

originator (in a graph), 1478 

orthocenter of a triangle, 904 
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orthogonal (P(G)-), 273 

orthogonal complement in a vector space, 
656 

orthogonal drawing, 1241 

orthogonal representation of a drawing, 
1241 

orthogonal subspaces of a vector space, 
656 

Otter’s formula, 639 

out-arcs, 261 

out-branching, 210 

out-degree, 8, 56, 197 

out-eccentricity, 388 

out-set, 197 

out-tree, 166, 184, 210, 265 

outerplanar digraph, 1241 

outerplanar graph, 276, 441, 769, 1241 

outerplanar imbedding, 769 

overlap matrix, 772 


packing (in a graph), 1093 

P3-packing, 504 

packing chromatic number, 429 

packing coloring, 429 

packing number of a graph, 1093 

PageRank matrix, 1468 

paintability, 450 

k-paintable, 450 

painting of a directed graph, 663 

pair crossing number, 923 

pair permutation 4), 629 

paired de Bruijn graphs, 1569 

Paley graph, 616 

pancyclic graph, 319, 966 

panel in a triangulation, 896 

parallel elements, 694 

parallel operation, 103, 160, 1296 

parameter matrix, /**-order, 683 

parameterized complexity problems, 481 

parent (in a rooted tree), 177, 186, 1431 

k-parity-linked, 243 

partial (in a genus distribution), value 
of, 765 

partial order, 188 

partial k-tree, 103, 1352 

partial triple system, 451 

partially balanced incomplete block 
design, 903 

partially directed graph, 6 

partially dynamic graph problem, 1208 
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partially ordered set, 188 
k-partite graph, 26, 361 
partite sets, 26 
partition (in timetabling), 539 
partition-cut, 261 
partition into k parts, 1047 
partition refinement, 72 
partition, maximum r-path, 1429 
partitioned genus distribution, 765 
pasting topological spaces, 736 
path, 10, 57 
A-path, 240 
A-—B path, 240 
k-path, 204 
k-paths problem, 244 
T-path, 492 
T-path covering, 492 
path-decomposition, 107, 146 
path factor, 504 
path flow, 1403 
path graph, 22 
pathwidth, 107, 146 
pathwidth-k graph, 107 
PBIBD, 903 
pebbling exponent, 1437 
pebbling graph, 1434 
pebbling number, 1435 
__, cover, 1443 
___, distance, 1443 
___, domination target, 1443 
___, rooted, 1429 
__, target, 1443 
__, weighted, 1443 
(optimal) pebbling number, 1440, 1443 
__, fractional, 1443 
__, k-fold, 1443 
2-pebbling property, 1435 
pebbling step from u to v, 1429 
__, weighted, 1443 
pebbling threshold, 1441 
penalty weight, 538 
percolation, 1467 
perfect difference set, 905 
perfect graph, 124 
perfect matching, 287, 491, 550, 1189, 
1409 
peripheral vertex, 1066 
periphery, 1066 
permanent of a matrix, 495 
permutation graph, 249 
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permutation group, 592 

___, regular, 599 

___, transitive, 592 

___, order of, 592, 629 
permutation voltage group, 789 
Perron root, 482 
personalized PageRank, 1469 
Petersen graph, 24, 45 
Petrie dual of a map, 821 
Pevzner’s graphs, 1566 
Pfaffian, 496 
phase transition, 987 
pinched open disk, 731 
pinched torus, 731 
anar graph or digraph, 28, 1241 
lanar matroid, 697 
anar slope number of a graph, 1273 
lanar triangulation, 322 
anarizing collection of cycles, 869 
lanarizing curve, 763 
anarizing orders, 868 
plane graph, 441 
Platonic graph, 24 
Platonic solids, 843 
k-plex, 1526 
pointed Lombardi drawing, 1270 
points of a geometry, 902 
polygon vertex visibility graph, 1362 
polygonal complex, 736 
polyhedral imbedding, 861 
polyhedral map, 821 
polyline drawing, 1240 
polynomial growth of degree r, 597 
poset, 188 
positive boundary, edge-, 378 
positive fragment, a-, 378 
positive ordering of F,, 275 
positive semidefinite, 1051, 1052 
positive witness proximity drawing, 1274 
postman problem, 284 
postman tour, 175, 268, 284 
postorder of a dfs tree, 1180 
k-th power of a graph, 318, 1437 
power law (in degree distribution of a 

graph), 1459 
power law graph, 1459 
power law random graph, 1460 
Precoloring Extension problem (PrExt), 
449, 450 

preferential attachment model, 1462 
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preflow, 1385 

preorder of a dfs tree, 1180 

preserves orientation, 807 

price of anarchy, 1470 

prime graph (relative to graph product), 
594 

prime graph (structural graph theory), 
132, 141 

prime walk, 1342 

primitive graph, 602 

primitive group action, 602 

principal partition, 665 

principal subgraph, 665 

principal submatrix, 678 

prism, 138 

probabilistic graphical model, 1571 

probability (system in given state), 1550 

product of two graphs, 16 

production, 767 

profile of a graph, 1137 

profile of a team (in scheduling), 551 

profile of an edge-numbering, 1137 

profile width of a graph, 1137 

program flow graph, 1197 

projective plane (geometric), 903 

projective plane (topological), 732 

proper (vertex-) edge-coloring of a 
hypergraph, 453 

proper (vertex-) node-coloring, 409, 539 

proper ancestor or descendant, 177, 1176 

proper arc-coloring, 174 

proper descendant, 177 

proper edge-coloring, 421, 535 

proper Hurwitz group, 814 

proper tolerance graph, 1108 

proper total coloring, 421 

properly (p,q,r), 810 

proportional cost structure, 1503 

protein-protein interaction network, 1457 

pseudo TRVG, 1354 

2-pseudomanifold, 731 

pseudosurface, 731 

punctured surface, 891 

push(v, w) operation, 1400 

pyramid, 136, 1437 

___, near-, 1437 
pyramidal, 1437 


quadrangulate, 897 
quadrangulation, 864 
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quadratic residue tournament, 198 
quality of a timetabling solution, 545 
quantum graph, k-labeled, 1054 
quantum vibrational scenario, 1552 
quasi 4-connected, 371 

quasi-adjoint graph, 1567 

quasi-line graph, 131 

quasi-minimal connection set, 621 
quasi-order, 143 

quickly recognized class membership, 116 


RAC drawing, 1271 

radial graph, 861 

radio coloring (or L(2, 1)-labeling), 445 
radius of a graph, 11, 1065, 1075 
k-radius, 1073 

rainbow connection, 446 

Ramanujan, 1339 

Ramanujan graph, 681 

(Ramsey good) G-good graph, 1009 
Ramsey graphs, 1017 


(Ramsey minimal) (G), G2,--- ,G,)-minimal, 


1003 
(Ramsey minimal) (G, H)-minimal, 1017 
Ramsey multiplicity, 1019 
Ramsey number, 1003, 1028 
_.., connected, 1019 
___, generalized, 1003 
_--, restricted-size, 1017 
Ramsey-finite pair, 1017 
Ramsey-infinite pair, 1017 
Randié index (in a molecular graph), 
1542 
k-in, l-out random digraph Dy_-in,-out, 
325 
random graph, 504, 981 
W-random graph, 1049 
random graph process, standard, 995 
random regular graph, 1461 
random vertex insertion, 344 
randomness of an infinite binary sequence, 
311 
rank, 649, 694 
___ r(M) of a matroid, 694 
___ of an infinite matrix, 1051 
__ of an abelian group, 814 
rank-r whirl, 706 
ranking number, 448 
Rao-contained, 143 
t-rational, 510 
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ray in an infinite graph, 591 

realize a topological space, 736 

receiver (in a tournament), 197 

Recognition Problem, t-Rational, 510 

recognizable class (graph reconstruction), 
82 

N-reconstructible, 94 

reconstructible graph, 78 

reconstruction index, 93 

reconstruction number, 85 

reconstruction of a graph, 78 

rectangle intersection graphs, 1359 

rectangle of influence graph, 1244 

rectangle-visibility, 1352 

rectilinear crossing number, 923 

rectilinear drawing, 923 

recursive circulant graph, 1480 

recursively constructed graph class, 101, 
1291 

red-blue edge-coloring, 1027 

Rédei’s Theorem, 1193 

reduced cost vector, 1395 

reduced digraph, 1240 

reduced face-size sequence, 826 

reduced incidence matrix, 660 

reduced ordered-pair group gl, 634 

reduced tree, 637 

reduced vertex-degree sequence, 827 

reduced-by-matching graphs, 1568 

p-reducible, 247 

reducible arc, 1382 

reducible configuration, 43 

reducible flow graph, 1197 

reducible tournament, 197 

reduction operations, 116 

reflection positive, 1051 

reflexible map, 842 

regular graph, strongly, 684 

regular action, 616 

regular expressions, 64 

k-regular graph, 23, 44 

s-regular graph, 599 

e-regular pair of vertex sets, 1047 

regular sets, 66 

regular tournament, 198 

related vertices, 1177 

relation, 172 

relatively prime, 594 

removable circuit (of a matroid), 709 

representable over F, 693 
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k-representative, 877 
representativity, 498, 861 
reservation (in a network), 1512 
residual capacity (or network), 1382, 1394 
resistance distance of two vertices, 1543 
resource (slot) in timetabling, 532 
restricted edge-connectivity, 385 
k-restricted-edge-connectivity, 385 
restricted edge-cut, 385 
k-restricted-edge-cut, 385 
restricted multi-level partition, 1211 
restricted partition of a tree, 1210 
retract, 268 
retracing, 268 
retract-free, 268 
reverse direction on an edge, 736 
reverse of a trail, 277 
Riemann—Hurwitz equation, 810 
right child, 177, 186 
rim, 706 
ring sum, 651 
Ringel-Youngs Theorem, 753 
Rédl nibble, 1031 
root of a tree, 166, 184, 1240 
root systems, 680 
rooted graph, 764 
rooted imbedding, 866 
rooted labeled tree, 637 
rooted map, 848 
rooted oriented map, 848 
rooted tree, 166, 184, 637, 1240 
rotation, (global) 741 
rotation, generalized, 741 
rotation at a vertex, 741 
rotation of a binary string, 309 
rotation system, 741, 834 
rotation system of a map, 834 

___, Signed, 834 
rotational tournament, 198 
round-robin tournament, 549 
router graphs, 1457 
(optimal) rubbling number, 1443 
rubbling step, 1443 

__., strict, 1443 
run, 311 
run (in a binary sequence), maximal, 

311 

run-permuted sequence, 311 
rural postman problem, 286 
RVG, 1352 
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saturated arc set, 1394 

saturating an arc, 1392 

X-saturating matching, 1412 

saturating push operation, 1400 

saving (of a merge), 352 

savings digraph, 352 

scanned (from a vertex), 1179 

Schlafli graph, 130 

score, 197 

score sequence, 197 

score vector, 197 

search, 1174 

search tree, 73 

second neighborhood, 206 

seed vector, 1469 

seg (synonym for cut), 651 

segment endpoint visibility graph, 1363 

segment of a tour, 277 

segment reversal, 277 

segment visibility graph, 1363 

segments (a family of subgraphs), 1193 

self-centered graph, 1067 

self-healing rings (in a network), 1512 

self-loop, 3 

self-union, m-fold, 16 

selfish routing, 1470 

semicellular graph imbedding, 762 

semi-circle law, 1460 

semidominator, 1200 

semigirth, 367 

m-semigirth, 375 

r-semigreedy, 1429 

semiregular (or almost regular) factor, 
501 

semisymmetric, 597 

d-separated matching, 510 

separate A from B, 240 

(A|B)-separating, 240 

separating curve, 734 

separating two rays (of an infinite graph), 
603 

separation pair (in a graph), 1195 

serf, 209 

series operation, 103, 160 

series operation of graphs, 1296 

series-parallel digraph, 1241 

series-parallel graph, 103, 1241, 1295, 
1353 

set edge-reconstructible, 87 

set of lifted boundary walks, 795 
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set of minimal forbidden minors, 748 

set reconstructible, 87 

set-merging problem, 1197 

Shabat polynomial, 838 

sharp, 984 

shift operation, 306 

shortcuts, 345 

shortest-path tree, 1175 

shrunken blossom, 1417 

siblings, 186 

shuffle-exchange graph, k-dimensional 
binary, 1486 

Sidorenko Conjecture, 1055 

signature of a graph, 741 

signed boundary walk, 736 

signed edges, 736 

n-simplex, 24 

simulated annealing, 547 

simultaneous embedding, 1267 

simultaneous geometric embedding, 1267 

sincere decision, 217 

single-source shortest paths, 1393 

single-source single-sink network, 1391 

sink, 175, 180, 1378 

sink of a dag, 1181 

sink vertex of a digraph, 1240 

size of a gossip protocol, 1489 

size of a graph, 315 

size (cardinality) of a matching, 1409 

size of an r-book, 956 

size Ramsey number, 1003 

skeleton of a complex, 24, 736 

skeleton of an imbedding, 877 

skew vertex, 895 

skew-morphism, 843 

skew-partition, 126 

skew-partition, balanced, 127 

slope number of a graph, 1273 

slope of a segment, 1273 

small world phenomenon, 1458 

smallest bridgeless spanning subgraph, 
1201 

smooth covering, 821 

smooth Lombardi drawing, 1270 

smoothing, 749 

smoothing move (in graph pebbling), 1440 

snark, 423 

soft constraint, 533 

solid 1-cover, 1483 

solid h-cover, 1485 
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D-solvable, 1429 
__, weakly, 1448 
D-solvable on G, 1434 
r-solvable, 1429 
__, k-fold, 1429 

sophisticated decision, 218 

source, 175, 180, 1378 

source of a dag, 1181 

source vertex of a digraph, 1240 

span of a coloring, 445 

spanning cycle or path, 200 

spanning forest, 649 

spanning fuzzy subgraph, 1316 

spanning set, 694 

spanning subgraph, 14 

spanning tree, 18, 649 

spanning walk, 865 

sparse, 57 

sparse (complex graph), 1458 

sparse subgraph, 1217 

special-cycle, 1089 

specification of a fundamental polygon, 
737 

spectral gap, 681 

spectral gap of a regular graph, 1338 

spectral moment of an adjacency 
matrix, 1549 

spectrum of a graph, 674 

sphere (surface), 732 

n-sphere of a vertex, 610 

n-spike, 706 

spiked cycle, 212 

spindle pseudosurface, 732 

spine of a book, 737 

splice at uv, 278 

a-b split of G at v, 263 

split graph, 496, 1114 

split graph (in graph pebbling), 1434 

split graph (structural graph theory), 
144 

splitting a vertex of a weighted graph, 
1040 

splitting operation, 245, 263 

splitting-closed class of triangulations, 
892 

spoke of a wheel, 706 

square of a cycle, 962 

square of a graph, 421 

stability number, 124 

stabilizer, 596 


1606 


stable partners, 1423 
stable perfect matching, 1421 
stable (or independent) set of nodes, 124, 
539, 552 
k-stable tournament, 209 
stably bounded hypergraphs, 454 
stacked books, 568 
stacker crane problem, 286 
standard plane representation, 177 
standard simplex, 479 
standardized variable, 983 
star chromatic number, 446 
star cutset, 128 
k-star cutset, 128 
star neighborhood in a triangulation, 877 
start vertex in a flow graph, 1197 
stationary DTMC, 170 
status of a vertex, 1071 
Steiner distance (of a set of vertices), 
1073 
Steiner nodes, 1503 
Steiner tree, 1073 
__ problem, 1503 
Steiner triple system, 451, 902 
straddling edges, 669 
straight-line drawing, 1240 
straightness of a ray, 607 
r-strategy, 1431 
___, basic, 1431 
r-strategy polytope, 1431 
stratified graph, 778 
stratum, j**, 779 
strength of a path, 1316 
strength of connectedness, 1317 
strict k-coloring, 454 
strictly balanced, 985 
strip, 606 
k-strong, 28 
strong center of a digraph, 1077 
strong central vertex, 1077 
strong component, 166, 1185 
_.. graph, 1185 
Strong Cycle Double Cover Conjecture, 
270 
strong diameter of a digraph, 1077 
strong digraph, 200, 1075 
k-strong digraph, 203 
strong distance, 1077 
strong double tracing, 268 
strong eccentricity of a vertex, 1077 
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Strong Perfect Graph Theorem, 125 

strong product, 324, 594 

strong radius of a digraph, 1077 

strong sweeping, 1157 

strong visibility, 1349 

strongly connected, 165, 200, 238, 285 

strongly k-connected, 28 

strongly connected digraph, 1185 

k-strongly connected digraph, 203 

strongly geodetic, 388 

strongly graceful, 572 

strongly noncontractible curve, 763 

strongly orientable, 168 

(n, k; a, c)-strongly-regular, 383 

structure (graph reconstruction theory), 
91 

STSP, 337 

sub-tour elimination constraints, 341 

sub-tours, 341 

subdivision graph, 326 

subdivision of an edge or graph, 420 

subexponential growth in an infinite graph, 
597 

subgraph, 14, 647 

subgraph centrality of an atom, 1550 

Subgraph Recognition Problem, 
k-regular, 513 

substitution operation, 132 

successor of a substring t, 305 

k-sum, 700, 702 

k-sum G- H, 1050, 1054 

sum of subgraph centralities, 1550 

sun, 505 

sun (in graph pebbling), 1435 

super k-restricted edge connected, 385 

(e, 0)-super-uniform pair, 967 

super-k, 366 

super-A, 363, 366 

super-A*), 385 

support of a set, 1315 

supporting surface of a map, 821 

surface, 731 

___ minor of a graph imbedding, 869 

surface rotation at a vertex, 741 

surface rotation of an imbedded graph, 
(global), 741 

surface with k holes, 734 

survivable layered network, 669 

survivable network, 1505 

suspension operation, 16 
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sweep (of nodes), 1458 

sweep number of a graph, 1146 

sweep problem on a graph, 1146 

sweep strategy, 1146 

k-sweepable graph, 1146 

switch (on a node), 1496 

switch installation cost, 1496 

switch of signed rotation system at a 
vertex uv, 834 

switches in a signature, 741 

symbol set, 198 

symmetric, strongly, 807 

symmetric difference, 651 

symmetric digraph, 285 

symmetric function, 1039 

symmetric group, S,, 381, 629 

symmetric pair-group s2), 629 

Symmetric TSP, 337 

symmetric vertex, 285, 291 

symmetric-even, 296 

symmetry group of an imbedding, 894 

symmetry of a graph, 68 

symmetry of an imbedding, 894 

system of imprimitivity, 602 

k-system that dominates, 318 

Szekeres—Wilf number, 412 


tabu search, 485, 548 

tail of an arc (directed edge), 6, 164 

tail in a closed walk, 1342 

tailless closed walk, 1342 

target of a graph, 1240 

teleportation constant, 1468 

k-terminal graph G = (V,T, E), 109 

k-terminal recursively structured class, 
109 

terminal vertex of a walk, 10 

terminals in a series-parallel graph, 103, 
1295 

ternary matroid, 693 

ternary tree, 1240 

tessellation, 838 

theta graph, 138 

thickening of a long circular interval graph, 
131 

thickness of a graph, 1352 

1-thin graph, 134 

thread, 1429 

threshold function, 983 

threshold graph, 496 
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threshold tolerance graph, 1111 

Thue chromatic number, 449 

time (slot), 531 

time delay, 483 

time-constraint CPP, 299 

time-dependent CPP, 299 

timetabling problem, 531 

tolerance chain graph, 1110 

tolerance graph, 1107 

top tree, 1214 

topological bandwidth, 1138 

topological index of a molecular graph, 
1542 

topological numbering of a directed graph, 
1181 

topological realization of a graph G, 738 

topological rectangle visibility graph, 1354 

topological sort (or topsort), 190 

topology tree, 1211 

torsion subgroup, 604 

torus, 733 

g-torus, 734 

total degree, 291 

total distance of a vertex, 1071 

total edge length of a drawing, 1247 

total graph, 421 

total imbedding distribution, 772 

total weight of a weighted graph, 1039 

totally separates, 240 

totally unimodular matrix, 694 

t-tough, 319 

toughness, 319, 376, 491 

tour, 337, 348 

k-tour (in a binary string), 306 

tournament, 28, 141, 168, 196, 198, 549, 
632, 1185 

___, almost regular, 198 
___, round-robin, 632 

n-tournament, 197 

tournament matrix, 213 

tower of a of length & (extremal graph 
theory), 967 

traceable, 314 

trail, 10 

A-trail (eulerian), 276 

transformation, 870 

transit time, 1403 

transition (system), 273 

transition matrix, 170 

transition probability, 170 
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transition probability matrix, 1468 
transitive closure, 172 
transitive closure of a graph, 63 
transitive digraph, 172 
transitive edge, 1240 
s-transitive graph, 599 
transitive tournament, 141, 201 
transitively orientable graph, 1328 
translation (type of automorphism), 603 
transmission path, 1488 
transmitter, 197 
transpose of a fractional bijection, 1045 
transshipment network, 1391 
transversal hypergraph, 1575 
trapezoid graph, 1111 
trapezoid order, 1111 
Traveling Salesman Problem, 45 
traveling-tournament problem, 548 
traversable mixed graph, 1192 
tree, 17, 102, 649, 1292, 1318 

_.., ordered, 177, 186, 643, 1176 
k-tree, 103, 1352 

___, ordered, 643 

__, m-ary, 186 

__, 1-4, 1-rooted 1-4, 640 
tree distance, 664 
tree edge, 18, 59, 1177 
tree vertices, 18 
tree-decomposition, 106, 1298 
treewidth, 106, 1298 
triangle group, 810, 838 
triangle inequality, 337 
triangle-free, 133 
triangular graph, 680 
triangular imbedding, 876 
triangulate, 276 
triangulate a surface, 877 
triangulated graph (= chordal), 104 
triangulation, 441, 822, 865 

___, k-irreducible, 887 

_.., k-minimal, 822, 888 

___, of a closed surface, 876 

__., of a surface-with-boundary, 877 

_-, on a sphere, standard, 889 

___, pseudo-minimal, 890 

__., tight, 880 
P-triangulation, P-equivalent, 892 
triangulation (of a set of points), 1245 
triconnected graph, 1195, 1240 
trigraph, 135 
trivial blocks, 602 
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trivial dessin, 839 

trivial edge subset, n-, s-, 384 

trivial edge-disconnecting set, 363 

trivial necklace, 309 

TSP, 337 

Turan graph, 954 

Turan number, 954 

Turan type problem, 954 

twins, 1040 

twist, 693 

twisted product, 248 

two-terminal series-parallel (di)graphs, 
1242 

type, 815 


unavoidable minors, 708 

unavoidable set, 43 

uncapacitated facility, 1496 
uncapacitated network design model, 1500 
uncolorable, 454 

underlying cellular imbedding, 763 
underlying graph of a crystallization, 835 
underlying graph of a map, 821 
underlying graph of digraph or mixed 
graph, 6 

underlying semicellular imbedding, 763 
underlying topological space, 736 
undirected facilities, 1496 

undirected graph, 56 

undirected hypergraph, 1575 
undirected sweeping, 1157 

undirected version of CPP (UCPP), 284 
r-uniform hypergraph, 454 

uniform matroid, 692 

€-uniform pair, 967 

uniform random graph, 981 

unimodal distribution, 775 

unimodal distribution, strongly, 775 
unimodular matrix, 660 

union of fuzzy sets, 1315 

union of matroids, 698 

uniquely colorable, 454 

uniquely k-colorable, 416 

uniquely edge k-colorable, 424 
uniquely hamiltonian, 323 

uniquely imbeddable graph, 894, 896 
unit bar-visibility graph, 1351 

unit cylinder, 732 

unit disk, 730 

unit distance graph, 429 

unit graph, 1087 
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unit half-disk, 730 

unit rectangle-visibility graph, 1352 
unit RVG, 1352 

unit sphere, 732 

unit tolerance graph, 1108 
universal class of objects, 1356 
universal k-colored point set, 1264, 
universal dessin, 839 

universal extended triangle group, 838 
universal triangle group, 838 
university course timetabling, 538 
unmatched vertex, 1409 
D-unsolvable, 1429 

r-unsolvable, 1429 

r-unsolvability polytope, 1431 
unsplittable flow problem, 1388 
untight triangulation, 880 

update on a graph, 1207 

upper (or lower) chromatic number, 454 
upper domination number, 1083 
upper irredundance number, 1084 
upper-embeddable graph, 751 
upward drawing, 1241 

upward planar digraph, 1241 
utilities problem, 40 


valence, 8 
d-valent vertex, 8 
valid vertex mapping, 110 
value of a flow, 1379 
value of a corresponding partial, 765 
variable cost, 1496 
vector matroid, 692 
vector space, 651 
vector with entries indexed by LD, 1051 
Vehicle Routing Problem (VRP), 350 
vertex, 2 
_-. amalgamation, 764 
_-. attribute, 5 
_._ boundary, 1345 
___ coloring, 3-consecutive, 452 
___ cover, 475, 495, 1408 
___ independence number, 1083 
___ of a polygonal complex, 736 
__ ranking, 448 
___ set, 135, 834 
__. splitting, 883 
_.. with label 7, 1050 
__. variant, 737 
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vertex visibility graph, 1362 
vertex-coloring, 7, 409 
vertex-k-connected, 745 
vertex-connectivity, 27, 166, 236, 238, 
364, 317 

vertex-covering number, 495 
k-vertex-critical, 414 
vertex-cut, 166, 235 
vertex-degree sequence (v-sequence), 827 
vertex-deleted subgraph, 78 
vertex-deletion subgraph, 235 
vertex-disjoint paths problem, 147 
vertex-edge incidence matrix, 678 
vertex-face incidence graph, 861 
vertex-labeling, standard (0-based), 19 
vertex-splitting, 822 
k-vertex-splitting, 245 
vertex-symmetric, 380 
vertex-transitive, 380 
vertex-transitive action, 807 
vertex-transitive graph, 14, 592, 616 
vertex-weighted graph, 476 
vertex-weights, 1039 
vertex-weights, normalized, 1045 
vertices and edges of a map, 821 
z-visibility, 1356 
k-visibility band, 1359 
visibility drawing, 1241 
e-visibility layout, 1349 
visibility representation of a graph, 1356 
Vizing’s Conjecture, 1096 
voltage, 783, 789 
voltage assignment, U,,-permutation, 789 
voltage assignment, regular, 783 
voltage graph, 882 

___, regular, 783 

__, Up-permutation 789 
voltage group, 783, 882 
voltage sequence, 787 
volume of a flow, 1391, 1403 
volume of a set of nodes, 1459, 1464 
Voronoi diagram, 1243 
VR-representable graph, 1356 


walk, 10 

k-walk, 865 
___, in a voltage graph, 787 
___, open, 10 
__., trivial, 10 
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weak $-drawing, 1244 

weak isomorphism, 1323 

weak line-isomorphism, 1323 

weak vertex-isomorphism, 1323 

weak immersion, 145 

Weak Perfect Graph Theorem, 125 

weak product, 594 

weak rectangle of influence drawing, 1244 

weak sweeping, 1157 

weak visibility, 1349 

weak bar k-visibility, 1359 

weak 3D visibility representation, 1356 

weakly chordal graph 1115 

weakly connected, 165 

weakly edge-reconstructible, 82 

weakly equivalent, 896 

weakly k-linked, 243 

weakly pancyclic, 966 

weakly reconstructible, 82 

web graph, 565, 1457 

weight, 695 

weight function (in graph pebbling), 1431 

weight matrix, 337 

weight of a cycle, 270 

weight of a vertex, 1431 

weight of a vertex set, 476 

weight of an edge, 1408 

weight of an matching, 1409 

weighted graph, 1039, 1443 

weighted sum of closed walks, 1549 

Weissfeiler-Lehman refinement, 73 

well-behaved time bound, 1217 

well-covered graph, 504 

well-quasi-order, 143 

wheel graph, 245, 675, 706 

wheel-neighborhood, 861 

Whitney 2-flip, 867 

Whitney-similar, 867 

whole segment visibility graph, 1363 

width (representativity), 861 

width of a branch decomposition, 108 

width of a path decomposition, 107, 146 

width of a tree decomposition, 106—108, 
146, 1298 

Wiener index of a molecular graph, 1542 

Wiener index of a graph, 1464 

windy postman problem, 286 

word, 266 

wqo (well-quasi-ordered), 143 

wreath product, 595 
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Xuong tree, 751 


Y AY-equivalent, 870 
yield an expander family, 1343 


Zagreb indices of a molecular graph, 1542 
Zemel measure, 339 

zig-zag product, 1341 

ZPR graph, 1356 
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